A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
Hierarchical Matrix-Matrix Multiplication Based on Multiprocessor Tasks
[chapter]
2004
Lecture Notes in Computer Science
We consider the realization of matrix-matrix multiplication and propose a hierarchical algorithm implemented in a task-parallel way using multiprocessor tasks on distributed memory. The algorithm has been designed to minimize the communication overhead while showing large locality of memory references. The task-parallel realization makes the algorithm especially suited for cluster of SMPs since tasks can then be mapped to the different cluster nodes in order to efficiently exploit the cluster
doi:10.1007/978-3-540-24687-9_1
fatcat:ogm27c5efjddrfzk7c3ec2bpzi