A self-healing redundancy scheme for mission/safety-critical applications

In the nanoelectronics era, multiple faults or failures in circuits and systems deployed in mission- and safety-critical applications, such as space, aerospace, nuclear etc., are known to occur. To withstand these, higher order redundancy is suggested to be used selectively in the sensitive portions...

Full description

Saved in:
Bibliographic Details
Main Authors: Balasubramanian, Padmanabhan, Maskell, Douglas Leslie
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2019
Subjects:
Online Access:https://hdl.handle.net/10356/103389
http://hdl.handle.net/10220/47290
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-103389
record_format dspace
spelling sg-ntu-dr.10356-1033892020-03-07T11:50:49Z A self-healing redundancy scheme for mission/safety-critical applications Balasubramanian, Padmanabhan Maskell, Douglas Leslie School of Computer Science and Engineering Fault Tolerance DRNTU::Engineering::Computer science and engineering Redundancy In the nanoelectronics era, multiple faults or failures in circuits and systems deployed in mission- and safety-critical applications, such as space, aerospace, nuclear etc., are known to occur. To withstand these, higher order redundancy is suggested to be used selectively in the sensitive portions of a circuit or system. In this context, the distributed minority and majority voting based redundancy (DMMR) scheme was proposed as an alternative to the N-modular redundancy (NMR) scheme for the efficient implementation of higher order redundancy. However, the DMMR scheme is not self-healing. In this paper, we present a new self-healing redundancy (SHR) scheme that can inherently correct its internal faults or failures without any external intervention, which makes it ideal for mission/safety-critical applications. To achieve the same degree of fault tolerance, the SHR scheme requires fewer function blocks than the NMR and DMMR schemes. We present the architectures of the proposed SHR scheme, discuss the system reliability, and provide the design metrics estimated for example SHR systems alongside the corresponding NMR and DMMR systems using a 32/28-nm CMOS technology. From the perspectives of fault tolerance, self-healing capability, and optimizations in the design metrics, the SHR scheme is preferable to the NMR and DMMR schemes. MOE (Min. of Education, S’pore) Published version 2019-01-02T01:56:48Z 2019-12-06T21:11:34Z 2019-01-02T01:56:48Z 2019-12-06T21:11:34Z 2018 Journal Article Balasubramanian, P., & Maskell, D. L. (2018). A self-healing redundancy scheme for mission/safety-critical applications. IEEE Access, 6, 69640-69649. doi:10.1109/ACCESS.2018.2880763 https://hdl.handle.net/10356/103389 http://hdl.handle.net/10220/47290 10.1109/ACCESS.2018.2880763 en IEEE Access © 2018 IEEE. Translations and content mining are permitted for academic research only. Personal use is also permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information 10 p. application/pdf
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic Fault Tolerance
DRNTU::Engineering::Computer science and engineering
Redundancy
spellingShingle Fault Tolerance
DRNTU::Engineering::Computer science and engineering
Redundancy
Balasubramanian, Padmanabhan
Maskell, Douglas Leslie
A self-healing redundancy scheme for mission/safety-critical applications
description In the nanoelectronics era, multiple faults or failures in circuits and systems deployed in mission- and safety-critical applications, such as space, aerospace, nuclear etc., are known to occur. To withstand these, higher order redundancy is suggested to be used selectively in the sensitive portions of a circuit or system. In this context, the distributed minority and majority voting based redundancy (DMMR) scheme was proposed as an alternative to the N-modular redundancy (NMR) scheme for the efficient implementation of higher order redundancy. However, the DMMR scheme is not self-healing. In this paper, we present a new self-healing redundancy (SHR) scheme that can inherently correct its internal faults or failures without any external intervention, which makes it ideal for mission/safety-critical applications. To achieve the same degree of fault tolerance, the SHR scheme requires fewer function blocks than the NMR and DMMR schemes. We present the architectures of the proposed SHR scheme, discuss the system reliability, and provide the design metrics estimated for example SHR systems alongside the corresponding NMR and DMMR systems using a 32/28-nm CMOS technology. From the perspectives of fault tolerance, self-healing capability, and optimizations in the design metrics, the SHR scheme is preferable to the NMR and DMMR schemes.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Balasubramanian, Padmanabhan
Maskell, Douglas Leslie
format Article
author Balasubramanian, Padmanabhan
Maskell, Douglas Leslie
author_sort Balasubramanian, Padmanabhan
title A self-healing redundancy scheme for mission/safety-critical applications
title_short A self-healing redundancy scheme for mission/safety-critical applications
title_full A self-healing redundancy scheme for mission/safety-critical applications
title_fullStr A self-healing redundancy scheme for mission/safety-critical applications
title_full_unstemmed A self-healing redundancy scheme for mission/safety-critical applications
title_sort self-healing redundancy scheme for mission/safety-critical applications
publishDate 2019
url https://hdl.handle.net/10356/103389
http://hdl.handle.net/10220/47290
_version_ 1681034436185948160