FAC: a fault-tolerant design approach based on approximate computing

This article introduces a new fault-tolerant design approach based on approximate computing, called FAC, for designing redundant circuits and systems. Traditionally, triple modular redundancy (TMR) has been used to ensure complete tolerance to any single fault or a faulty processing unit, where the...

Full description

Saved in:
Bibliographic Details
Main Authors: Balasubramanian, Padmanabhan, Maskell, Douglas Leslie
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/170487
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-170487
record_format dspace
spelling sg-ntu-dr.10356-1704872023-09-22T15:35:45Z FAC: a fault-tolerant design approach based on approximate computing Balasubramanian, Padmanabhan Maskell, Douglas Leslie School of Computer Science and Engineering Hardware & Embedded Systems Lab (HESL) Engineering::Computer science and engineering::Hardware Engineering::Electrical and electronic engineering::Electronic circuits Engineering::Electrical and electronic engineering::Integrated circuits Engineering::Electrical and electronic engineering::Computer hardware, software and systems Engineering::Computer science and engineering::Computing methodologies Fault Tolerance Redundancy Approximate Computing Arithmetic Circuits Digital Logic Design Low Power Design This article introduces a new fault-tolerant design approach based on approximate computing, called FAC, for designing redundant circuits and systems. Traditionally, triple modular redundancy (TMR) has been used to ensure complete tolerance to any single fault or a faulty processing unit, where the processing unit may be a circuit or a system. However, TMR incurs more than 200% overhead in terms of area and power compared to a single processing unit. Alternative redundancy approaches have been proposed in the literature to mitigate these overheads associated with TMR, but they provide only partial or moderate fault tolerance. Among the alternatives, majority voting-based reduced precision redundancy (MVRPR) may be useful for error-resilient applications such as digital signal processing. While MVRPR guarantees only moderate fault tolerance, the proposed FAC is well-suited for error-resilient applications and ensures 100% tolerance to any single fault or a faulty processing unit, like TMR. In this work, we evaluate the performance of TMR, MVRPR, and FAC for a digital image processing application. The image processing results obtained demonstrate the effectiveness of FAC. Moreover, when the processing unit is implemented using a 28-nm CMOS technology, FAC achieves significant improvements over TMR, including a 15.3% reduction in delay, a 19.5% reduction in area, and a 24.7% reduction in power. Compared to MVRPR, FAC exhibits notable enhancements, with an 18% reduction in delay, a 5.4% reduction in area, and an 11.2% reduction in power. When considering the power-delay product, which reflects energy efficiency, FAC demonstrates a 36.2% reduction compared to TMR and a 27.2% reduction compared to MVRPR. When considering the power-delay-area product, which represents design efficiency, FAC achieves a 48.7% reduction compared to TMR and a 31.1% reduction compared to MVRPR. Ministry of Education (MOE) Published version This research was partially funded by the Singapore Ministry of Education (MOE), Academic Research Fund under grant numbers Tier-1 RG48/21 and Tier-1 RG127/22. 2023-09-19T01:30:30Z 2023-09-19T01:30:30Z 2023 Journal Article Balasubramanian, P. & Maskell, D. L. (2023). FAC: a fault-tolerant design approach based on approximate computing. Electronics, 12(18), 3819-. https://dx.doi.org/10.3390/electronics12183819 2079-9292 https://hdl.handle.net/10356/170487 10.3390/electronics12183819 18 12 3819 en RG48/21 RG127/22 Electronics © 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Hardware
Engineering::Electrical and electronic engineering::Electronic circuits
Engineering::Electrical and electronic engineering::Integrated circuits
Engineering::Electrical and electronic engineering::Computer hardware, software and systems
Engineering::Computer science and engineering::Computing methodologies
Fault Tolerance
Redundancy
Approximate Computing
Arithmetic Circuits
Digital Logic Design
Low Power Design
spellingShingle Engineering::Computer science and engineering::Hardware
Engineering::Electrical and electronic engineering::Electronic circuits
Engineering::Electrical and electronic engineering::Integrated circuits
Engineering::Electrical and electronic engineering::Computer hardware, software and systems
Engineering::Computer science and engineering::Computing methodologies
Fault Tolerance
Redundancy
Approximate Computing
Arithmetic Circuits
Digital Logic Design
Low Power Design
Balasubramanian, Padmanabhan
Maskell, Douglas Leslie
FAC: a fault-tolerant design approach based on approximate computing
description This article introduces a new fault-tolerant design approach based on approximate computing, called FAC, for designing redundant circuits and systems. Traditionally, triple modular redundancy (TMR) has been used to ensure complete tolerance to any single fault or a faulty processing unit, where the processing unit may be a circuit or a system. However, TMR incurs more than 200% overhead in terms of area and power compared to a single processing unit. Alternative redundancy approaches have been proposed in the literature to mitigate these overheads associated with TMR, but they provide only partial or moderate fault tolerance. Among the alternatives, majority voting-based reduced precision redundancy (MVRPR) may be useful for error-resilient applications such as digital signal processing. While MVRPR guarantees only moderate fault tolerance, the proposed FAC is well-suited for error-resilient applications and ensures 100% tolerance to any single fault or a faulty processing unit, like TMR. In this work, we evaluate the performance of TMR, MVRPR, and FAC for a digital image processing application. The image processing results obtained demonstrate the effectiveness of FAC. Moreover, when the processing unit is implemented using a 28-nm CMOS technology, FAC achieves significant improvements over TMR, including a 15.3% reduction in delay, a 19.5% reduction in area, and a 24.7% reduction in power. Compared to MVRPR, FAC exhibits notable enhancements, with an 18% reduction in delay, a 5.4% reduction in area, and an 11.2% reduction in power. When considering the power-delay product, which reflects energy efficiency, FAC demonstrates a 36.2% reduction compared to TMR and a 27.2% reduction compared to MVRPR. When considering the power-delay-area product, which represents design efficiency, FAC achieves a 48.7% reduction compared to TMR and a 31.1% reduction compared to MVRPR.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Balasubramanian, Padmanabhan
Maskell, Douglas Leslie
format Article
author Balasubramanian, Padmanabhan
Maskell, Douglas Leslie
author_sort Balasubramanian, Padmanabhan
title FAC: a fault-tolerant design approach based on approximate computing
title_short FAC: a fault-tolerant design approach based on approximate computing
title_full FAC: a fault-tolerant design approach based on approximate computing
title_fullStr FAC: a fault-tolerant design approach based on approximate computing
title_full_unstemmed FAC: a fault-tolerant design approach based on approximate computing
title_sort fac: a fault-tolerant design approach based on approximate computing
publishDate 2023
url https://hdl.handle.net/10356/170487
_version_ 1779156269145260032