A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Implementation of Strassen's algorithm for matrix multiplication
1996
Proceedings of the 1996 ACM/IEEE conference on Supercomputing (CDROM) - Supercomputing '96
In this paper we report on the development of an e cient and portable implementation of Strassen's matrix multiplication algorithm. Our implementation is designed to be used in place of DGEMM, the Level 3 BLAS matrix multiplication routine. E cient performance will be obtained for all matrix sizes and shapes and the additional memory needed for temporaryvariables has beenminimized. Replacing DGEMM with our routine should provide a signi cant performance gain for large matrices while providing
doi:10.1145/369028.369096
fatcat:hls63elv5ngcbjpxe6yr7xoxni