Using Recursion to Boost ATLAS's Performance [chapter]

Paolo D'Alberto, Alexandru Nicolau
High-Performance Computing  
We investigate the performance benefits of a novel recursive formulation of Strassen's algorithm over highly tuned matrix-multiply (MM) routines, such as the widely used ATLAS for high-performance systems. We combine Strassen's recursion with high-tuned version of ATLAS MM and we present a family of recursive algorithms achieving up to 15% speed-up over ATLAS alone. We show experimental results for 7 different systems.
doi:10.1007/978-3-540-77704-5_12 dblp:conf/ishpc/DAlbertoN05 fatcat:uvgyyoiogjbcpmoozgn3hmjjg4