A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2012; you can also visit the original URL.
The file type is application/pdf
.
Pretty Good Accuracy in Matrix Multiplication with GPUs
2010
2010 Ninth International Symposium on Parallel and Distributed Computing
With systems such as Road Runner, there is a trend in super computing to offload parallel tasks to special purpose co-processors, composed of many relatively simple scalar processors. The cheaper commodity class equivalent of such a processor would be the graphics card, potentially offering super computer power within the confines of a desktop PC. Graphics cards however are not without problems, these range from the lack of double precision on most cards to a fairly steep drop in performance
doi:10.1109/ispdc.2010.12
dblp:conf/ispdc/BadinBDN10
fatcat:5ggvtf5ai5defpwleykhlsds4e