A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Effective automatic computation placement and dataallocation for parallelization of regular programs
2014
Proceedings of the 28th ACM international conference on Supercomputing - ICS '14
This paper proposes techniques for data allocation and computation mapping when compiling affine loop nest sequences for distributedmemory clusters. Techniques for transformation and detection of parallelism, and generation of communication sets relying on the polyhedral framework already exist. However, these recent approaches used a simple strategy to map computation to nodestypically block or block-cyclic. These mappings may lead to excess communication volume for multiple loop nests. In
doi:10.1145/2597652.2597673
dblp:conf/ics/ReddyB14
fatcat:56ct6skdwrdr7apbp55du5zjnu