EMNAPE: efficient multi-dimensional neural architecture pruning for EdgeAI
In this paper, we propose a multi-dimensional pruning framework, EMNAPE, to jointly prune the three dimensions (depth, width, and resolution) of convolutional neural networks (CNNs) for better execution efficiency on embedded hardware. In EMNAPE, we introduce a two-stage evaluation strategy to evalu...
Saved in:
Main Authors: | , , , , , , , |
---|---|
Other Authors: | |
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/167488 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-167488 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1674882023-12-15T02:25:54Z EMNAPE: efficient multi-dimensional neural architecture pruning for EdgeAI Kong, Hao Luo, Xiangzhong Huai, Shuo Liu, Di Subramaniam, Ravi Makaya, Christian Lin, Qian Liu, Weichen School of Computer Science and Engineering 2023 Design, Automation & Test in Europe Conference & Exhibition (DATE) HP-NTU Digital Manufacturing Corporate Lab Engineering::Computer science and engineering Heuristic Algorithms Computational Modeling In this paper, we propose a multi-dimensional pruning framework, EMNAPE, to jointly prune the three dimensions (depth, width, and resolution) of convolutional neural networks (CNNs) for better execution efficiency on embedded hardware. In EMNAPE, we introduce a two-stage evaluation strategy to evaluate the importance of each pruning unit and identify the computational redundancy in the three dimensions. Based on the evaluation strategy, we further present a heuristic pruning algorithm to progressively prune redundant units from the three dimensions for better accuracy and efficiency. Experiments demonstrate the superiority of EMNAPE over existing methods. Ministry of Education (MOE) National Research Foundation (NRF) Submitted/Accepted version This study is partially supported under the RIE2020 Industry Alignment Fund – Industry Collaboration Projects (IAF-ICP) Funding Initiative, as well as cash and in-kind contribution from the industry partner, HP Inc., through the HP-NTU Digital Manufacturing Corporate Lab (I1801E0028). This work is also partially supported by the Ministry of Education, Singapore, under its Academic Research Fund Tier 2 (MOE2019-T2-1-071), and Nanyang Technological University, Singapore, under its NAP (M4082282). 2023-06-12T01:28:51Z 2023-06-12T01:28:51Z 2023 Conference Paper Kong, H., Luo, X., Huai, S., Liu, D., Subramaniam, R., Makaya, C., Lin, Q. & Liu, W. (2023). EMNAPE: efficient multi-dimensional neural architecture pruning for EdgeAI. 2023 Design, Automation & Test in Europe Conference & Exhibition (DATE). https://dx.doi.org/10.23919/DATE56975.2023.10137122 https://hdl.handle.net/10356/167488 10.23919/DATE56975.2023.10137122 en I1801E0028 MOE2019-T2-1-071 NAP (M4082282) 10.21979/N9/HGFYTJ © 2023 EDAA. Published by IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: https://doi.org/10.23919/DATE56975.2023.10137122. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering Heuristic Algorithms Computational Modeling |
spellingShingle |
Engineering::Computer science and engineering Heuristic Algorithms Computational Modeling Kong, Hao Luo, Xiangzhong Huai, Shuo Liu, Di Subramaniam, Ravi Makaya, Christian Lin, Qian Liu, Weichen EMNAPE: efficient multi-dimensional neural architecture pruning for EdgeAI |
description |
In this paper, we propose a multi-dimensional pruning framework, EMNAPE, to jointly prune the three dimensions (depth, width, and resolution) of convolutional neural networks (CNNs) for better execution efficiency on embedded hardware. In EMNAPE, we introduce a two-stage evaluation strategy to evaluate the importance of each pruning unit and identify the computational redundancy in the three dimensions. Based on the evaluation strategy, we further present a heuristic pruning algorithm to progressively prune redundant units from the three dimensions for better accuracy and efficiency. Experiments demonstrate the superiority of EMNAPE over existing methods. |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Kong, Hao Luo, Xiangzhong Huai, Shuo Liu, Di Subramaniam, Ravi Makaya, Christian Lin, Qian Liu, Weichen |
format |
Conference or Workshop Item |
author |
Kong, Hao Luo, Xiangzhong Huai, Shuo Liu, Di Subramaniam, Ravi Makaya, Christian Lin, Qian Liu, Weichen |
author_sort |
Kong, Hao |
title |
EMNAPE: efficient multi-dimensional neural architecture pruning for EdgeAI |
title_short |
EMNAPE: efficient multi-dimensional neural architecture pruning for EdgeAI |
title_full |
EMNAPE: efficient multi-dimensional neural architecture pruning for EdgeAI |
title_fullStr |
EMNAPE: efficient multi-dimensional neural architecture pruning for EdgeAI |
title_full_unstemmed |
EMNAPE: efficient multi-dimensional neural architecture pruning for EdgeAI |
title_sort |
emnape: efficient multi-dimensional neural architecture pruning for edgeai |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/167488 |
_version_ |
1787136716005441536 |