A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
A Hybrid Circular Queue Method for Iterative Stencil Computations on GPUs
2012
Journal of Computer Science and Technology
In this paper, we present a hybrid circular queue method that can significantly boost the performance of stencil computations on GPU by carefully balancing usage of registers and shared-memory. Unlike earlier methods that rely on circular queues predominantly implemented using indirectly addressable shared memory, our hybrid method exploits a new reuse pattern spanning across the multiple time steps in stencil computations so that circular queues can be implemented by both shared memory and
doi:10.1007/s11390-012-1206-3
fatcat:r45ekbmayjayrfrmka77nqqe44