A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2015; you can also visit the original URL.
The file type is application/pdf
.
Divergence Analysis and Optimizations
2011
2011 International Conference on Parallel Architectures and Compilation Techniques
The growing interest in GPU programming has brought renewed attention to the Single Instruction Multiple Data (SIMD) execution model. SIMD machines give application developers a tremendous computational power; however, the model also brings restrictions. In particular, processing elements (PEs) execute in lock-step, and may lose performance due to divergences caused by conditional branches. In face of divergences, some PEs execute, while others wait; this alternation ending when they reach a
doi:10.1109/pact.2011.63
dblp:conf/IEEEpact/CoutinhoSPM11
fatcat:yw6lopa7tngrhgntbdprunlpne