Filters








4 Hits in 4.8 sec

CPU-GPU hybrid bidiagonal reduction with soft error resilience

Yulu Jia, Piotr Luszczek, George Bosilca, Jack J. Dongarra
2013 Proceedings of the Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems - ScalA '13  
In this paper, we present a design of a bidiagonal reduction algorithm that is resilient to soft errors, and we also describe its implementation on hybrid CPU-GPU architectures.  ...  The tests were performed on a Sandy Bridge CPU coupled with an NVIDIA Kepler GPU.  ...  Our fault tolerant bidiagonal reduction algorithm employs reverse computation and algorithm-based fault tolerance to detect, locate, and correct soft errors in the bidiagonal reduction on CPU-GPU hybrid  ... 
doi:10.1145/2530268.2530270 dblp:conf/sc/JiaLBD13 fatcat:fnzowqdodzhz5bl57mwkeujere

Algorithm-Based Fault Tolerance for Two-Sided Dense Matrix Factorizations

Doctoral Dissertations, Yulu Jia
2015 unpublished
We also designed algorithm-based fault tolerant algorithms for the CPU-GPU vii hybrid Hessenberg reduction algorithm and the CPU-GPU hybrid bidiagonal reduction algorithm.  ...  Solving EVP and SVP numerically involves two-sided matrix factorizations: the Hessenberg reduction, the tridiagonal reduction, and the bidiagonal reduction.  ...  This work also addresses soft errors in the Hessenberg reduction algorithm and the bidiagonal reduction algorithms on CPU-GPU hybrid algorithms.  ... 
fatcat:cs2f6ueb2bdkherfcy3udzn6xq

Welcome from the SHAXC-2 Organizing Committee Aims and Scope SHAXC-2 Organizing Committee

David Keyes, David Keyes, George Turkiyyah
2014 unpublished
The challenges of the exascale include: reducing communication, reducing synchronization, increasing concurrency to exploit specialized manycore and GPU processing elements, and incorporating resilience  ...  into the algorithms, themselves, rather than leaving the entire burden of resilience to the hardware.  ...  He is interested in optimizing dense linear algebra for hybrid-distributed memory systems equipped with GPUs.  ... 
fatcat:rdvlyeogwjbnzk6p2j6lzm2see

Scalability and fault tolerance of ABFT methods for dense matrix multiplication

Svetoslav Inkolov
2018 unpublished
As an essential basis for the results DPLASMA, a highly optimized library for distributed hybrid systems was used.  ...  Therefore, the need for exhaustive fault detection and error correction algorithms become increasingly important over the last few years.  ...  The scope for singular values is defined by reducing to condensed form where it can be done either by reduction to band-bidiagonal or reduction to proper bidiagonal.  ... 
doi:10.25365/thesis.51789 fatcat:m7t2bxc3sfc25ckuuo5npbgr2q