Do someone think it is feasible to sort a matrix (1M rows x 100 columns) for each row in GPU? We keeping the repeating sorting every day and want to know whether the performance could be improved to 10X or 20X faster ( Currently we just bought a server with 8 GPU K40).
l*m
3 楼
please refer https://solarianprogrammer.com/2013/02/04/sorting-data-in- parallel-cpu-gpu/ In my opinion, cpu should be fast enough for the size if the sort alg and implementation is correct. CPU-GPU data copy is a big overhead for such a task
for
【在 c*****l 的大作中提到】 : Do someone think it is feasible to sort a matrix (1M rows x 100 columns) for : each row in GPU? We keeping the repeating sorting every day and want to : know whether the performance could be improved to 10X or 20X faster ( : Currently we just bought a server with 8 GPU K40).
【在 c*****l 的大作中提到】 : Do someone think it is feasible to sort a matrix (1M rows x 100 columns) for : each row in GPU? We keeping the repeating sorting every day and want to : know whether the performance could be improved to 10X or 20X faster ( : Currently we just bought a server with 8 GPU K40).