Filters








61,946 Hits in 4.4 sec

A message system supporting fault tolerance

Anita Borg, Jim Baumbach, Sam Glazer
1983 Proceedings of the ninth ACM symposium on Operating systems principles - SOSP '83  
A simple and general design uses message-based communication to provide software tolerance of singlepoint hardware failures.  ...  An implementation for the Auragen 4000 series of M68000-based systems is described. The operating system, Autos TM, is a distributed version of UNIX *.  ...  A solution which requires the dedication of substantial system resources solely for the support of fault tolerance is therefore unacceptable.  ... 
doi:10.1145/800217.806617 dblp:conf/sosp/BorgBG83 fatcat:brrf5ajdnzdqvmjmddwgaiisue

A message system supporting fault tolerance

Anita Borg, Jim Baumbach, Sam Glazer
1983 ACM SIGOPS Operating Systems Review  
A simple and general design uses message-based communication to provide software tolerance of singlepoint hardware failures.  ...  An implementation for the Auragen 4000 series of M68000-based systems is described. The operating system, Autos TM, is a distributed version of UNIX *.  ...  A solution which requires the dedication of substantial system resources solely for the support of fault tolerance is therefore unacceptable.  ... 
doi:10.1145/773379.806617 fatcat:yypr3h2jvbfsnf2yiqlstdxzdi

Transparent Fault Tolerance for Web Services Based Architectures [chapter]

Vijay Dialani, Simon Miles, Luc Moreau, David De Roure, Michael Luck
2002 Lecture Notes in Computer Science  
While it is relatively easy to make an individual service fault-tolerant, improving fault-tolerance of services collaborating in multiple application scenarios is a challenging task.  ...  This increased flexibility in system composition makes it difficult to address classical distributed system issues such as fault-tolerance.  ...  Modifications to the SOAP messaging layer enable us to support both fault tolerance by message-based checkpointing and rollback, and fault tolerance by object replication.  ... 
doi:10.1007/3-540-45706-2_126 fatcat:x4tdnffng5d47mrlg5mfv5bfsm

Application-level fault tolerance in real-time embedded systems

Francisco Afonso, Carlos Silva, Adriano Tavares, Sergio Montenegro
2008 2008 International Symposium on Industrial Embedded Systems  
This work proposes and evaluates a fault tolerance framework for supporting the development of dependable applications.  ...  Redundant hardware implies the establishment of a distributed system executing a set of fault tolerance strategies by software, and may also employ some form of diversity, by using different variants or  ...  ACKNOWLEDGMENT This work has been supported by the Portuguese Foundation for Science and Technology (FCT).  ... 
doi:10.1109/sies.2008.4577690 dblp:conf/sies/AfonsoSTM08 fatcat:nv7fgtax7feolnp7edacaioxkq

Middleware Fault Tolerance Support for the BOSS Embedded Operating System

F. Afonso, C. Silva, S. Montenegro, A. Tavares
2006 2006 International Workshop on Intelligent Solutions in Embedded Systems  
This work proposes fault-tolerant support mechanisms for the BOSS embedded operating system, based on the application of proven fault tolerance strategies by middleware control software which transparently  ...  Critical embedded systems need a dependable operating system and application. Despite all efforts to prevent and remove faults in system development, residual software faults usually persist.  ...  Acknowledgments This work has been supported by the Portuguese Foundation for Science and Technology (FCT).  ... 
doi:10.1109/wises.2006.329114 dblp:conf/wises/AfonsoSMT06 fatcat:mw4nhfwa4ncefkgpo53zul6pum

Fault Tolerant Approaches for Distributed Real-time and Embedded Systems

Paul Rubel, Matthew Gillen, Joseph Loyall, Richard Schantz, Aniruddha Gokhale, Jaiganesh Balasubramanian, Aaron Paulos, Priya Narasimhan
2007 MILCOM 2007 - IEEE Military Communications Conference  
Fault tolerance (FT) is a crucial design consideration for mission-critical distributed real-time and embedded (DRE ) systems, which combine the real-time characteristics of embedded platforms with the  ...  First, we describe three enhanced FT techniques that support the needs of DRE systems: a transparent approach to mixed-mode communication, auto-configuration of dynamic systems, and duplicate management  ...  , but infrequent message traffic) reduced the impact of fault tolerance on system performance.  ... 
doi:10.1109/milcom.2007.4455043 fatcat:vttdfjrfcve5tl3pdb5m7gatv4

Middleware Fault Tolerance Support for the BOSS Embedded Operating System

F. Afonso
2006 2006 International Workshop on Intelligent Solutions in Embedded Systems  
This work proposes fault-tolerant support mechanisms for the BOSS embedded operating system, based on the application of proven fault tolerance strategies by middleware control software which transparently  ...  Critical embedded systems need a dependable operating system and application. Despite all efforts to prevent and remove faults in system development, residual software faults usually persist.  ...  Acknowledgments This work has been supported by the Portuguese Foundation for Science and Technology (FCT).  ... 
doi:10.1109/wises.2006.237000 fatcat:suqnobbuzvdkhkcoky3kf3lo2y

Adaptive byzantine fault tolerance support for agent oriented systems: The BDARX

Alvi et al., Department of Computer Science and Information Technology, The University of Lahore, Lahore, Pakistan
2019 International Journal of Advanced and Applied Sciences  
Multi-agent systems (MAS) with fault tolerance capabilities have got much attention during the recent years. Many fault-tolerance mechanisms have been proposed.  ...  It deals with adaptive dynamic replication schemes to make agent systems more fault tolerant, but it does not handle Byzantine faults in MAS environments.  ...  In order to provide long term fault tolerance support in open and distributed systems, DARX has multiple dimensions including nodes selection, level of fault tolerance support, and scalability of overall  ... 
doi:10.21833/ijaas.2019.02.009 fatcat:nyjk6mn645hqne473yoz6scw4a

Gateways for Accessing Fault Tolerance Domains [chapter]

P. Narasimhan, L. E. Moser, P. M. Melliar-Smith
2000 Lecture Notes in Computer Science  
Enterprise applications can be structured as domains, where each domain contains objects that are replicated for fault tolerance, with the replication being managed by a fault tolerance infrastructure  ...  Gateways can allow unreplicated clients to benefit from the fault tolerance services of the replicated servers, without compromising replica consistency within the fault tolerance domain.  ...  A key issue in fault tolerance for CORBA will be the mechanisms to support interaction of non-fault-tolerant CORBA systems with fault-tolerant CORBA systems.  ... 
doi:10.1007/3-540-45559-0_5 fatcat:excyrb46xbdezbn6rrovyhg55a

Multi-μ

Luís Miguel Pinho, Francisco Vasques
1998 ACM SIGAda Ada Letters  
Keywords Ada 95, Real-Time Systems, Software Based Fault Tolerance, Off-The-Shelf Components.  ...  The Multi-µ architecture, based on a specifically proposed Fault Tolerance Manager (FTManager), is then described.  ...  ADA SUPPORT FOR FAULT TOLERANT SYSTEMS The Ada language doesn't provide direct support for fault tolerance mechanisms, apart from the exception mechanism, which can provide forward error recovery.  ... 
doi:10.1145/301687.289532 fatcat:ncqjlsm3mnaqpekk3itzdqtk4q

Page 32 of IEEE Transactions on Computers Vol. 52, Issue 1 [page]

2003 IEEE Transactions on Computers  
Systems that use the CORBA service approach imple- ment the fault tolerance mechanisms as a Common Object Service on top of the ORB.  ...  The adapter objects convert CORBA’s messages into the multi- cast messages in the group communication system and make multiple replicas look like a single replica for use in active replication.  ... 

Eternal?a component-based framework for transparent fault-tolerant CORBA

P. Narasimhan, L. E. Moser, P. M. Melliar-Smith
2002 Software, Practice & Experience  
The Eternal system implements the new Fault-Tolerant CORBA standard.  ...  The Eternal system is a component-based middleware framework that provides transparent fault tolerance for enterprise applications, and that ensures continuous 24 × 7 operation without requiring special  ...  ACKNOWLEDGEMENTS This research has been supported by the Defense Advanced Research Projects Agency in conjunction with the Office of Naval Research and the Air Force Research Laboratory, Rome, under Contracts  ... 
doi:10.1002/spe.458 fatcat:th7oqegehzdlxfs2mklxfffeyy

Reliable multicore processors for NASA space missions

Carlos Villalpando, David Rennels, Raphael Some, Manuel Cabanas-Holmen
2011 2011 Aerospace Conference  
-Network should include support for determining if a message made it to its intended destination, either by hardware support, or overlying redundant software support.  ...  Error detection and recovery is still needed to deal with any detrimental errors that may still occur • Software Implemented Fault Tolerance (SIFT) is a tool, but needs architecture support.  ...  -Analyze the effectiveness of each method against the required performance -Some requirements may require redundant hardware and software to meet the error rate requirements, some may be tolerant of restarting  ... 
doi:10.1109/aero.2011.5747447 fatcat:lj4koqi2lfbahf6sv7tfomspea

S/W Fault-tolerant OFP System for UAVs based on Partition Computing

Eun-Hye Jeong, Jung-Guk Kim
2013 Information Engineering Research Institute procedia  
However, in the case of a small UAV, it is preferred to implement a S/W fault-tolerant system that runs primary and recovery systems together in a single flight control computer to reduce the payload.  ...  On the other hand, fault-tolerant Operational Flight Programs (OFP) for unmanned aerial vehicles have usually been implemented as primary-backup systems based on dual nodes.  ...  Acknowledgements This work was supported by the Defense Acquisition Program Administration and Agency for Defense Development under contract.  ... 
doi:10.1016/j.ieri.2013.11.036 fatcat:bmdfhff72zebdcyn4b5pckxu2q

Self-Healing Network for Scalable Fault Tolerant Runtime Environments [chapter]

Thara Angskun, Graham E. Fagg, George Bosilca, Jelena Pješivac-Grbović, Jack J. Dongarra
2007 Distributed and Parallel Systems  
This paper presents a self-healing network (SHN) for supporting scalable and fault-tolerant runtime environments.  ...  It will automatically recover itself after a failure occurs. SHN is implemented on top of a scalable fault-tolerant protocol (SFTP).  ...  The lack of scalable fault-tolerance parallel runtime environments motivates us to design and implement such a system.  ... 
doi:10.1007/978-0-387-69858-8_8 fatcat:xlev6tjxgjc2zkptwyyuwnhhle
« Previous Showing results 1 — 15 out of 61,946 results