A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit <a rel="external noopener" href="http://www.csa.iisc.ernet.in:80/~cplse/papers/ukbr-pact-14-1.pdf">the original URL</a>. The file type is <code>application/pdf</code>.
<i title="ACM Press">
<a target="_blank" rel="noopener" href="https://fatcat.wiki/container/i2tihayjsjhmhmrqaqe2apcipy" style="color: black;">Proceedings of the 23rd international conference on Parallel architectures and compilation - PACT '14</a>
This paper deals with optimizing time-iterated computations on periodic data domains. These computations are prevalent in computational sciences, particularly in partial differential equation solvers. We propose a fully automatic technique suitable for implementation in a compiler or in a domain-specific code generator for such computations. Dependence patterns on periodic data domains prevent existing algorithms from finding tiling opportunities. Our approach augments a state-of-the-art<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/2628071.2628106">doi:10.1145/2628071.2628106</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/IEEEpact/BondhugulaBCPV14.html">dblp:conf/IEEEpact/BondhugulaBCPV14</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/5csjr5drv5aqvjj6buyxke3bie">fatcat:5csjr5drv5aqvjj6buyxke3bie</a> </span>
more »... lization and localityenhancing algorithm from the polyhedral framework to allow timetiling of stencil computations on periodic domains. Experimental results on the swim SPEC CPU2000fp benchmark show a speedup of 5× and 4.2× over the highest SPEC performance achieved by native compilers on Intel Xeon and AMD Opteron multicore SMP systems, respectively. On other representative stencil computations, our scheme provides performance similar to that achieved with no periodicity, and a very high speedup is obtained over the native compiler. We also report a mean speedup of about 1.5× over a domain-specific stencil compiler supporting limited cases of periodic boundary conditions. To the best of our knowledge, it has been infeasible to manually reproduce such optimizations on swim or any other periodic stencil, especially on a data grid of twodimensions or higher.
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170428131443/http://www.csa.iisc.ernet.in:80/~cplse/papers/ukbr-pact-14-1.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/38/f5/38f5bd7a79437b97df9a174e10e89ab25d2578aa.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/2628071.2628106"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>