A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Efficient and flexible fault tolerance and migration of scientific simulations using CUMULVS
1998
Proceedings of the SIGMETRICS symposium on Parallel and distributed tools - SPDT '98
Many practical scienti c computer applications would bene t from a simple checkpointing mechanism that provides automatic restart or recovery in response to faults and failures, and enables dynamic load balancing and improved resource utilization using task migration. However, developing applications with such capabilities, especially in distributed, heterogeneous operating environments, is very challenging. CUMULVS is a middleware infrastructure for interacting with parallel scienti c
doi:10.1145/281035.281042
fatcat:ik5tertvezhx3fajj5moh4zixu