FAC: a fault-tolerant design approach based on approximate computing
This article introduces a new fault-tolerant design approach based on approximate computing, called FAC, for designing redundant circuits and systems. Traditionally, triple modular redundancy (TMR) has been used to ensure complete tolerance to any single fault or a faulty processing unit, where the...
Saved in:
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/170487 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-170487 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1704872023-09-22T15:35:45Z FAC: a fault-tolerant design approach based on approximate computing Balasubramanian, Padmanabhan Maskell, Douglas Leslie School of Computer Science and Engineering Hardware & Embedded Systems Lab (HESL) Engineering::Computer science and engineering::Hardware Engineering::Electrical and electronic engineering::Electronic circuits Engineering::Electrical and electronic engineering::Integrated circuits Engineering::Electrical and electronic engineering::Computer hardware, software and systems Engineering::Computer science and engineering::Computing methodologies Fault Tolerance Redundancy Approximate Computing Arithmetic Circuits Digital Logic Design Low Power Design This article introduces a new fault-tolerant design approach based on approximate computing, called FAC, for designing redundant circuits and systems. Traditionally, triple modular redundancy (TMR) has been used to ensure complete tolerance to any single fault or a faulty processing unit, where the processing unit may be a circuit or a system. However, TMR incurs more than 200% overhead in terms of area and power compared to a single processing unit. Alternative redundancy approaches have been proposed in the literature to mitigate these overheads associated with TMR, but they provide only partial or moderate fault tolerance. Among the alternatives, majority voting-based reduced precision redundancy (MVRPR) may be useful for error-resilient applications such as digital signal processing. While MVRPR guarantees only moderate fault tolerance, the proposed FAC is well-suited for error-resilient applications and ensures 100% tolerance to any single fault or a faulty processing unit, like TMR. In this work, we evaluate the performance of TMR, MVRPR, and FAC for a digital image processing application. The image processing results obtained demonstrate the effectiveness of FAC. Moreover, when the processing unit is implemented using a 28-nm CMOS technology, FAC achieves significant improvements over TMR, including a 15.3% reduction in delay, a 19.5% reduction in area, and a 24.7% reduction in power. Compared to MVRPR, FAC exhibits notable enhancements, with an 18% reduction in delay, a 5.4% reduction in area, and an 11.2% reduction in power. When considering the power-delay product, which reflects energy efficiency, FAC demonstrates a 36.2% reduction compared to TMR and a 27.2% reduction compared to MVRPR. When considering the power-delay-area product, which represents design efficiency, FAC achieves a 48.7% reduction compared to TMR and a 31.1% reduction compared to MVRPR. Ministry of Education (MOE) Published version This research was partially funded by the Singapore Ministry of Education (MOE), Academic Research Fund under grant numbers Tier-1 RG48/21 and Tier-1 RG127/22. 2023-09-19T01:30:30Z 2023-09-19T01:30:30Z 2023 Journal Article Balasubramanian, P. & Maskell, D. L. (2023). FAC: a fault-tolerant design approach based on approximate computing. Electronics, 12(18), 3819-. https://dx.doi.org/10.3390/electronics12183819 2079-9292 https://hdl.handle.net/10356/170487 10.3390/electronics12183819 18 12 3819 en RG48/21 RG127/22 Electronics © 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Hardware Engineering::Electrical and electronic engineering::Electronic circuits Engineering::Electrical and electronic engineering::Integrated circuits Engineering::Electrical and electronic engineering::Computer hardware, software and systems Engineering::Computer science and engineering::Computing methodologies Fault Tolerance Redundancy Approximate Computing Arithmetic Circuits Digital Logic Design Low Power Design |
spellingShingle |
Engineering::Computer science and engineering::Hardware Engineering::Electrical and electronic engineering::Electronic circuits Engineering::Electrical and electronic engineering::Integrated circuits Engineering::Electrical and electronic engineering::Computer hardware, software and systems Engineering::Computer science and engineering::Computing methodologies Fault Tolerance Redundancy Approximate Computing Arithmetic Circuits Digital Logic Design Low Power Design Balasubramanian, Padmanabhan Maskell, Douglas Leslie FAC: a fault-tolerant design approach based on approximate computing |
description |
This article introduces a new fault-tolerant design approach based on approximate computing, called FAC, for designing redundant circuits and systems. Traditionally, triple modular redundancy (TMR) has been used to ensure complete tolerance to any single fault or a faulty processing unit, where the processing unit may be a circuit or a system. However, TMR incurs more than 200% overhead in terms of area and power compared to a single processing unit. Alternative redundancy approaches have been proposed in the literature to mitigate these overheads associated with TMR, but they provide only partial or moderate fault tolerance. Among the alternatives, majority voting-based reduced precision redundancy (MVRPR) may be useful for error-resilient applications such as digital signal processing. While MVRPR guarantees only moderate fault tolerance, the proposed FAC is well-suited for error-resilient applications and ensures 100% tolerance to any single fault or a faulty processing unit, like TMR. In this work, we evaluate the performance of TMR, MVRPR, and FAC for a digital image processing application. The image processing results obtained demonstrate the effectiveness of FAC. Moreover, when the processing unit is implemented using a 28-nm CMOS technology, FAC achieves significant improvements over TMR, including a 15.3% reduction in delay, a 19.5% reduction in area, and a 24.7% reduction in power. Compared to MVRPR, FAC exhibits notable enhancements, with an 18% reduction in delay, a 5.4% reduction in area, and an 11.2% reduction in power. When considering the power-delay product, which reflects energy efficiency, FAC demonstrates a 36.2% reduction compared to TMR and a 27.2% reduction compared to MVRPR. When considering the power-delay-area product, which represents design efficiency, FAC achieves a 48.7% reduction compared to TMR and a 31.1% reduction compared to MVRPR. |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Balasubramanian, Padmanabhan Maskell, Douglas Leslie |
format |
Article |
author |
Balasubramanian, Padmanabhan Maskell, Douglas Leslie |
author_sort |
Balasubramanian, Padmanabhan |
title |
FAC: a fault-tolerant design approach based on approximate computing |
title_short |
FAC: a fault-tolerant design approach based on approximate computing |
title_full |
FAC: a fault-tolerant design approach based on approximate computing |
title_fullStr |
FAC: a fault-tolerant design approach based on approximate computing |
title_full_unstemmed |
FAC: a fault-tolerant design approach based on approximate computing |
title_sort |
fac: a fault-tolerant design approach based on approximate computing |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/170487 |
_version_ |
1779156269145260032 |