A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
Variable-Size Batched Condition Number Calculation on GPUs
2018
2018 30th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)
We present a kernel that is designed to quickly compute the condition number of a large collection of tiny matrices on a graphics processing unit (GPU). The matrices can differ in size and the process integrates the use of pivoting to ensure a numerically-stable matrix inversion. The performance assessment reveals that, in double precision arithmetic, the new GPU kernel achieves up to 550 GFLOPs (billions of floatingpoint operations per second) and 800 GFLOPs on NVIDIA's P100 and V100 GPUs,
doi:10.1109/cahpc.2018.8645907
dblp:conf/sbac-pad/AnztDFG18
fatcat:hogahlzelvhnnag6hzl4xz4ikq