Computation and memory optimized spectral domain convolutional neural network for throughput and energy-efficient inference
Conventional convolutional neural networks (CNNs) present a high computational workload and memory access cost (CMC). Spectral domain CNNs (SpCNNs) offer a computationally efficient approach to compute CNN training and inference. This paper investigates CMC of SpCNNs and its contributing components...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Article |
Published: |
Springer
2023
|
Subjects: | |
Online Access: | http://eprints.utm.my/105057/ http://dx.doi.org/10.1007/s10489-022-03756-1 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Teknologi Malaysia |
id |
my.utm.105057 |
---|---|
record_format |
eprints |
spelling |
my.utm.1050572024-06-30T00:44:30Z http://eprints.utm.my/105057/ Computation and memory optimized spectral domain convolutional neural network for throughput and energy-efficient inference Rizvi, Shahriyar Masud Ab. Rahman, Ab. Al-Hadi Sheikh, Usman Ullah Fuad, Kazi Ahmed Asif Shehzad, Hafiz Muhammad Faisal TK Electrical engineering. Electronics Nuclear engineering Conventional convolutional neural networks (CNNs) present a high computational workload and memory access cost (CMC). Spectral domain CNNs (SpCNNs) offer a computationally efficient approach to compute CNN training and inference. This paper investigates CMC of SpCNNs and its contributing components analytically and then proposes a methodology to optimize CMC, under three strategies, to enhance inference performance. In this methodology, output feature map (OFM) size, OFM depth or both are progressively reduced under an accuracy constraint to compute performance-optimized CNN inference. Before conducting training or testing, it can provide designers guidelines and preliminary insights regarding techniques for optimum performance, least degradation in accuracy and a balanced performance–accuracy trade-off. This methodology was evaluated on MNIST and Fashion MNIST datasets using LeNet-5 and AlexNet architectures. When compared to state-of-the-art SpCNN models, LeNet-5 achieves up to 4.2× (batch inference) and 4.1× (single-image inference) higher throughputs and 10.5× (batch inference) and 4.2× (single-image inference) greater energy efficiency at a maximum loss of 3% in test accuracy. When compared to the baseline model used in this study, AlexNet delivers 11.6× (batch inference) and 5× (single-image inference) increased throughput and 25× (batch inference) and 8.8× (single-image inference) more energy-efficient inference with just 4.4% reduction in accuracy. Springer 2023 Article PeerReviewed Rizvi, Shahriyar Masud and Ab. Rahman, Ab. Al-Hadi and Sheikh, Usman Ullah and Fuad, Kazi Ahmed Asif and Shehzad, Hafiz Muhammad Faisal (2023) Computation and memory optimized spectral domain convolutional neural network for throughput and energy-efficient inference. Applied Intelligence, 53 (4). pp. 4499-4523. ISSN 0924-669X http://dx.doi.org/10.1007/s10489-022-03756-1 DOI : 10.1007/s10489-022-03756-1 |
institution |
Universiti Teknologi Malaysia |
building |
UTM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Malaysia |
content_source |
UTM Institutional Repository |
url_provider |
http://eprints.utm.my/ |
topic |
TK Electrical engineering. Electronics Nuclear engineering |
spellingShingle |
TK Electrical engineering. Electronics Nuclear engineering Rizvi, Shahriyar Masud Ab. Rahman, Ab. Al-Hadi Sheikh, Usman Ullah Fuad, Kazi Ahmed Asif Shehzad, Hafiz Muhammad Faisal Computation and memory optimized spectral domain convolutional neural network for throughput and energy-efficient inference |
description |
Conventional convolutional neural networks (CNNs) present a high computational workload and memory access cost (CMC). Spectral domain CNNs (SpCNNs) offer a computationally efficient approach to compute CNN training and inference. This paper investigates CMC of SpCNNs and its contributing components analytically and then proposes a methodology to optimize CMC, under three strategies, to enhance inference performance. In this methodology, output feature map (OFM) size, OFM depth or both are progressively reduced under an accuracy constraint to compute performance-optimized CNN inference. Before conducting training or testing, it can provide designers guidelines and preliminary insights regarding techniques for optimum performance, least degradation in accuracy and a balanced performance–accuracy trade-off. This methodology was evaluated on MNIST and Fashion MNIST datasets using LeNet-5 and AlexNet architectures. When compared to state-of-the-art SpCNN models, LeNet-5 achieves up to 4.2× (batch inference) and 4.1× (single-image inference) higher throughputs and 10.5× (batch inference) and 4.2× (single-image inference) greater energy efficiency at a maximum loss of 3% in test accuracy. When compared to the baseline model used in this study, AlexNet delivers 11.6× (batch inference) and 5× (single-image inference) increased throughput and 25× (batch inference) and 8.8× (single-image inference) more energy-efficient inference with just 4.4% reduction in accuracy. |
format |
Article |
author |
Rizvi, Shahriyar Masud Ab. Rahman, Ab. Al-Hadi Sheikh, Usman Ullah Fuad, Kazi Ahmed Asif Shehzad, Hafiz Muhammad Faisal |
author_facet |
Rizvi, Shahriyar Masud Ab. Rahman, Ab. Al-Hadi Sheikh, Usman Ullah Fuad, Kazi Ahmed Asif Shehzad, Hafiz Muhammad Faisal |
author_sort |
Rizvi, Shahriyar Masud |
title |
Computation and memory optimized spectral domain convolutional neural network for throughput and energy-efficient inference |
title_short |
Computation and memory optimized spectral domain convolutional neural network for throughput and energy-efficient inference |
title_full |
Computation and memory optimized spectral domain convolutional neural network for throughput and energy-efficient inference |
title_fullStr |
Computation and memory optimized spectral domain convolutional neural network for throughput and energy-efficient inference |
title_full_unstemmed |
Computation and memory optimized spectral domain convolutional neural network for throughput and energy-efficient inference |
title_sort |
computation and memory optimized spectral domain convolutional neural network for throughput and energy-efficient inference |
publisher |
Springer |
publishDate |
2023 |
url |
http://eprints.utm.my/105057/ http://dx.doi.org/10.1007/s10489-022-03756-1 |
_version_ |
1803334979274407936 |