Application-Level Diagnostic and Membership Protocols for Generic Time-Triggered Systems

M Serafini, Péter Bokor, N Suri, J Vinter, A Ademaj, Wolfgang Brandstätter, Fulvio Tagliabò, J Koch
2011 IEEE Transactions on Dependable and Secure Computing  
We present on-line tunable diagnostic and membership protocols for generic time-triggered (TT) systems to detect crashes, send/receive omission faults and network partitions. Compared to existing diagnostic and membership protocols for TT systems, our protocols do not rely on the single-fault assumption and also tolerate non fail-silent (Byzantine) faults. They run at the application level and can be added on top of any TT system (possibly as a middleware component) without requiring
more » ... ns at the system level. The information on detected faults is accumulated using a penalty/reward algorithm to handle transient faults. After a fault is detected, the likelihood of node isolation can be adapted to different system configurations, including configurations where functions with different criticality levels are integrated. All protocols are formally verified using model checking. Using actual automotive and aerospace parameters, we also experimentally demonstrate the transient fault handling capabilities of the protocols.
doi:10.1109/tdsc.2010.23 fatcat:ko7k2iffzvhqje6ymi63luudfy