Recovering Internet Service Sessions from Operating System Failures

F. Sultan, A. Bohra, S. Smaldone, Yufei Pan, P. Gallard, I. Neamtiu, L. Iftode
2005 IEEE Internet Computing  
Operating system hangs, crashes, deadlocks or panics are system failures that cause loss of active client sessions in an Internet service. We describe a system that detects such failures and recovers service sessions in clusters of Internet servers. The core of our system is Backdoors, a novel system architecture that enables recovery of light-weight state associated with client service sessions present in the memory of a server, even when its OS is no longer available. We have built a
more » ... ve built a Backdoors prototype using commodity components and describe our experience with the system in recovering service sessions from multiple node failures in a complex multi-tier auction service.
doi:10.1109/mic.2005.45 fatcat:v2iy5fglsffmtkdx35hqnuaq6i