A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2014; you can also visit the original URL.
The file type is application/pdf
.
Improving GPGPU concurrency with elastic kernels
2013
Proceedings of the eighteenth international conference on Architectural support for programming languages and operating systems - ASPLOS '13
Each new generation of GPUs vastly increases the resources available to GPGPU programs. GPU programming models (like CUDA) were designed to scale to use these resources. However, we find that CUDA programs actually do not scale to utilize all available resources, with over 30% of resources going unused on average for programs of the Parboil2 suite that we used in our work. Current GPUs therefore allow concurrent execution of kernels to improve utilization. In this work, we study concurrent
doi:10.1145/2451116.2451160
dblp:conf/asplos/PaiTG13
fatcat:glstzpry65goxjx4pdqgnnqdxm