A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2010; you can also visit the original URL.
The file type is
Combining FT-MPI with H2O: Fault-Tolerant MPI Across Administrative Boundaries
19th IEEE International Parallel and Distributed Processing Symposium
We observe increasing interest in aggregating geographically distributed, heterogeneous resources to perform large scale computations. MPI remains the most popular programming paradigm for such applications; however, as the size of computing environments increases, fault tolerance aspects become critically important. We argue that the fault tolerance model proposed by FT-MPI fits well in geographically distributed environments, even though its current implementation is confined to a singledoi:10.1109/ipdps.2005.141 dblp:conf/ipps/KurzyniecS05 fatcat:fma5puvcibcmpmcgsmp6pwsczq