A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Global communication optimization for tensor contraction expressions under memory constraints
Proceedings International Parallel and Distributed Processing Symposium
The accurate modeling of the electronic structure of atoms and molecules involves computationally intensive tensor contractions involving large multi-dimensional arrays. The efficient computation of complex tensor contractions usually requires the generation of temporary intermediate arrays. These intermediates could be extremely large, but they can often be generated and used in batches through appropriate loop fusion transformations. To optimize the performance of such computations on
doi:10.1109/ipdps.2003.1213121
dblp:conf/ipps/CociorvaGKBLSR03
fatcat:dz5nucpcwjeclfngh523zfkotu