Reliability estimation of fault-tolerant systems: tools and techniques

R. Geist, K.S. Trivedi
1990 Computer  
A power has focused attention on tools and techniques we might use to accurately estimate the reliability of a proposed computing system on the basis of models derived from the design of that system. Reliability modeling of fault-tolerant computing systems has become an integral part of the system design process, especially for those systems with life-critical applications such as aircraft and spacecraft flight control. Reliability modeling has also become an important arena in which to view
more » ... classic struggle between model accuracy, that is, the extent to which a model of a system faithfully represents the system under study, and model tractability, that is, the extent to which the modeler can extract useful information from the model in a cost-effective manner. Within this arena, certain additional complexity constraints that typically render the classical modeling tools inadequate compound the difficulty in searching for solutions to this trade-off problem. One constraint is the huge disparity in state transition rates. A rate ratio (largest ratexmallest rate) of 1O'O within a single model is not uncommon, yielding "stiff' systems of differential, integral, or alge-Comparatively evaluating state-ofthe-art tools and techniques helps us estimate the reliability of fault-tolerant computing systems. We consider design limitations, efficiency, and accuracy.
doi:10.1109/2.56852 fatcat:abh4ofctkvbatkymj67ikofj64