Applying OOC Techniques in the Reduction to Condensed Form for Very Large Symmetric Eigenproblems on GPUs

Davor Davidovic, Enrique S. Quintana-Orti
2012 2012 20th Euromicro International Conference on Parallel, Distributed and Network-based Processing  
In this paper we address the reduction of a dense matrix to tridiagonal form for the solution of symmetric eigenvalue problems on a graphics processor (GPU) when the data is too large to fit into the accelerator memory. We apply outof-core techniques to a three-stage algorithm, carefully redesigning the first stage to reduce the number of data transfers between the CPU and GPU memory spaces, maintain the memory requirements on the GPU within limits, and ensure high performance by featuring a high ratio between computation and communication.
doi:10.1109/pdp.2012.54 dblp:conf/pdp/DavidovicQ12 fatcat:c5zxetewxra2ljkdbftmpa5zpa