Computation and memory optimized spectral domain convolutional neural network for throughput and energy-efficient inference

Conventional convolutional neural networks (CNNs) present a high computational workload and memory access cost (CMC). Spectral domain CNNs (SpCNNs) offer a computationally efficient approach to compute CNN training and inference. This paper investigates CMC of SpCNNs and its contributing components...

Full description

Saved in:
Bibliographic Details
Main Authors: Rizvi, Shahriyar Masud, Ab. Rahman, Ab. Al-Hadi, Sheikh, Usman Ullah, Fuad, Kazi Ahmed Asif, Shehzad, Hafiz Muhammad Faisal
Format: Article
Published: Springer 2023
Subjects:
Online Access:http://eprints.utm.my/105057/
http://dx.doi.org/10.1007/s10489-022-03756-1
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Malaysia
id my.utm.105057
record_format eprints
spelling my.utm.1050572024-06-30T00:44:30Z http://eprints.utm.my/105057/ Computation and memory optimized spectral domain convolutional neural network for throughput and energy-efficient inference Rizvi, Shahriyar Masud Ab. Rahman, Ab. Al-Hadi Sheikh, Usman Ullah Fuad, Kazi Ahmed Asif Shehzad, Hafiz Muhammad Faisal TK Electrical engineering. Electronics Nuclear engineering Conventional convolutional neural networks (CNNs) present a high computational workload and memory access cost (CMC). Spectral domain CNNs (SpCNNs) offer a computationally efficient approach to compute CNN training and inference. This paper investigates CMC of SpCNNs and its contributing components analytically and then proposes a methodology to optimize CMC, under three strategies, to enhance inference performance. In this methodology, output feature map (OFM) size, OFM depth or both are progressively reduced under an accuracy constraint to compute performance-optimized CNN inference. Before conducting training or testing, it can provide designers guidelines and preliminary insights regarding techniques for optimum performance, least degradation in accuracy and a balanced performance–accuracy trade-off. This methodology was evaluated on MNIST and Fashion MNIST datasets using LeNet-5 and AlexNet architectures. When compared to state-of-the-art SpCNN models, LeNet-5 achieves up to 4.2× (batch inference) and 4.1× (single-image inference) higher throughputs and 10.5× (batch inference) and 4.2× (single-image inference) greater energy efficiency at a maximum loss of 3% in test accuracy. When compared to the baseline model used in this study, AlexNet delivers 11.6× (batch inference) and 5× (single-image inference) increased throughput and 25× (batch inference) and 8.8× (single-image inference) more energy-efficient inference with just 4.4% reduction in accuracy. Springer 2023 Article PeerReviewed Rizvi, Shahriyar Masud and Ab. Rahman, Ab. Al-Hadi and Sheikh, Usman Ullah and Fuad, Kazi Ahmed Asif and Shehzad, Hafiz Muhammad Faisal (2023) Computation and memory optimized spectral domain convolutional neural network for throughput and energy-efficient inference. Applied Intelligence, 53 (4). pp. 4499-4523. ISSN 0924-669X http://dx.doi.org/10.1007/s10489-022-03756-1 DOI : 10.1007/s10489-022-03756-1
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
topic TK Electrical engineering. Electronics Nuclear engineering
spellingShingle TK Electrical engineering. Electronics Nuclear engineering
Rizvi, Shahriyar Masud
Ab. Rahman, Ab. Al-Hadi
Sheikh, Usman Ullah
Fuad, Kazi Ahmed Asif
Shehzad, Hafiz Muhammad Faisal
Computation and memory optimized spectral domain convolutional neural network for throughput and energy-efficient inference
description Conventional convolutional neural networks (CNNs) present a high computational workload and memory access cost (CMC). Spectral domain CNNs (SpCNNs) offer a computationally efficient approach to compute CNN training and inference. This paper investigates CMC of SpCNNs and its contributing components analytically and then proposes a methodology to optimize CMC, under three strategies, to enhance inference performance. In this methodology, output feature map (OFM) size, OFM depth or both are progressively reduced under an accuracy constraint to compute performance-optimized CNN inference. Before conducting training or testing, it can provide designers guidelines and preliminary insights regarding techniques for optimum performance, least degradation in accuracy and a balanced performance–accuracy trade-off. This methodology was evaluated on MNIST and Fashion MNIST datasets using LeNet-5 and AlexNet architectures. When compared to state-of-the-art SpCNN models, LeNet-5 achieves up to 4.2× (batch inference) and 4.1× (single-image inference) higher throughputs and 10.5× (batch inference) and 4.2× (single-image inference) greater energy efficiency at a maximum loss of 3% in test accuracy. When compared to the baseline model used in this study, AlexNet delivers 11.6× (batch inference) and 5× (single-image inference) increased throughput and 25× (batch inference) and 8.8× (single-image inference) more energy-efficient inference with just 4.4% reduction in accuracy.
format Article
author Rizvi, Shahriyar Masud
Ab. Rahman, Ab. Al-Hadi
Sheikh, Usman Ullah
Fuad, Kazi Ahmed Asif
Shehzad, Hafiz Muhammad Faisal
author_facet Rizvi, Shahriyar Masud
Ab. Rahman, Ab. Al-Hadi
Sheikh, Usman Ullah
Fuad, Kazi Ahmed Asif
Shehzad, Hafiz Muhammad Faisal
author_sort Rizvi, Shahriyar Masud
title Computation and memory optimized spectral domain convolutional neural network for throughput and energy-efficient inference
title_short Computation and memory optimized spectral domain convolutional neural network for throughput and energy-efficient inference
title_full Computation and memory optimized spectral domain convolutional neural network for throughput and energy-efficient inference
title_fullStr Computation and memory optimized spectral domain convolutional neural network for throughput and energy-efficient inference
title_full_unstemmed Computation and memory optimized spectral domain convolutional neural network for throughput and energy-efficient inference
title_sort computation and memory optimized spectral domain convolutional neural network for throughput and energy-efficient inference
publisher Springer
publishDate 2023
url http://eprints.utm.my/105057/
http://dx.doi.org/10.1007/s10489-022-03756-1
_version_ 1803334979274407936