Fault-Tolerant Computing with Heterogeneous Hardening Modes [chapter]

Florian Kriebel, Faiq Khalid, Bharath Srinivas Prabakaran, Semeen Rehman, Muhammad Shafique
2020 Embedded Systems  
AbstractFault-tolerance using (full-scale) redundancy-based techniques has been employed to detect and correct reliability errors (i.e., soft errors), but they pose significant area and power overhead. On the other hand, due to the masking and the error tolerance properties at different system layers and of different applications, respectively, reliable heterogeneous architectures have been emerged as an attractive design choice for power-efficient dependable computing platforms. This chapter
more » ... scusses the building blocks of such computing systems, based on both embedded and superscalar processors, with different reliability (fault-tolerant) modes at the architecture layer to memories like caches, for heterogeneous in-order and out-of-order processors. We provide a comprehensive reliability, i.e., soft error, vulnerability analysis of different components in in-order and out-of-order processors, e.g., caches. We also discuss different methodologies to improve the performance and power of such a system by analyzing these vulnerabilities. Moreover, we show how such heterogeneous hardware-level hardening modes can further be complemented by software-level techniques that can be realized using a reliability-driven compiler (as introduced in Chapter "Dependable Software Generation and Execution on Embedded Systems").
doi:10.1007/978-3-030-52017-5_7 fatcat:d3wgo4fcvfcldgmhl2hfido5ee