A quantitative study of irregular programs on GPUs

Martin Burtscher, Rupesh Nasre, Keshav Pingali
2012 2012 IEEE International Symposium on Workload Characterization (IISWC)  
GPUs have been used to accelerate many regular applications and, more recently, irregular applications in which the control flow and memory access patterns are data-dependent and statically unpredictable. This paper defines two measures of irregularity called control-flow irregularity and memory-access irregularity, and investigates, using performance-counter measurements, how irregular GPU kernels differ from regular kernels with respect to these measures. For a suite of 13 benchmarks, we find
more » ... that (i) irregularity at the warp level varies widely, (ii) control-flow irregularity and memory-access irregularity are largely independent of each other, and (iii) most kernels, including regular ones, exhibit some irregularity. A program's irregularity can change between different inputs, systems, and arithmetic precision but generally stays in a specific region of the irregularity space. Whereas some highly tuned implementations of irregular algorithms exhibit little irregularity, trading off extra irregularity for better locality or less work can improve overall performance.
doi:10.1109/iiswc.2012.6402918 dblp:conf/iiswc/BurtscherNP12 fatcat:7mykmlj2fjh5pdy7hryswxbnee