Dependable Systems [chapter]

André Schiper
2006 Lecture Notes in Computer Science  
Improving the dependability of computer systems is a critical and essential task. In this context, the paper surveys techniques that allow to achieve fault tolerance in distributed systems by replication. The main replication techniques are first explained. Then group communication is introduced as the communication infrastructure that allows the implementation of the different replication techniques. Finally the difficulty of implementing group communication is discussed, and the most
more » ... algorithms are presented. Almost the same paper appears under the title Group Communication: from practice to theory in
doi:10.1007/11808107_2 fatcat:otlxfarnp5ekthg6xtsfwro7du