A Component-Based Methodology to Design Arbitrary Failure Detectors for Distributed Protocols

Roberto Baldoni, Jean-Michel Helary, Sara Tucci Piergiovanni
2007 10th IEEE International Symposium on Object and Component-Oriented Real-Time Distributed Computing (ISORC'07)  
Nowadays, there are many protocols able to cope with process crashes, but, unfortunately, a process crash represents only a particular faulty behavior. Handling tougher failures (e.g. sending omission failures, receive omission failures, arbitrary failures) is a real practical challenge due to malicious attacks or unexpected software errors. This paper proposes a component-based methodology allowing to take a protocol A resilient to crash failures and to add software components in order to
more » ... ts in order to adapt the protocol A to be resilient to more general failures than crash, without changing the code of A. On this basis, it introduces the notions of liveness failure detector and safety failure detector, two independent software components to be used by a protocol to increases its resilience respectively to liveness and safety failures of processes running the protocol. Then, the feasibility of this approach is shown, by providing an implementation of liveness failure detectors and of safety failure detectors for a protocol solving the problem of global data computation. Definition 3 A process p is correct (in a run of a protocol A) if p is live and safe in this run.
doi:10.1109/isorc.2007.6 dblp:conf/isorc/BaldoniHP07 fatcat:q6sc6tioare55cuyj32oa55pre