A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
P-Ray: A Software Suite for Multi-core Architecture Characterization
[chapter]
2008
Lecture Notes in Computer Science
The computer industry has transitioned into multi-core and many-core parallel systems. The CUDA programming environment from NVIDIA is an attempt to make programming many-core GPUs more accessible to programmers. However, there are still many burdens placed upon the programmer to maximize performance when using CUDA. One such burden is dealing with the complex memory hierarchy. Efficient and correct usage of the various memories is essential, making a difference of 2-17x in performance.
doi:10.1007/978-3-540-89740-8_13
fatcat:hv2aoouhcve4xc2vlff77k7q4i