EMNAPE: efficient multi-dimensional neural architecture pruning for EdgeAI

In this paper, we propose a multi-dimensional pruning framework, EMNAPE, to jointly prune the three dimensions (depth, width, and resolution) of convolutional neural networks (CNNs) for better execution efficiency on embedded hardware. In EMNAPE, we introduce a two-stage evaluation strategy to evalu...

Full description

Saved in:
Bibliographic Details
Main Authors: Kong, Hao, Luo, Xiangzhong, Huai, Shuo, Liu, Di, Subramaniam, Ravi, Makaya, Christian, Lin, Qian, Liu, Weichen
Other Authors: School of Computer Science and Engineering
Format: Conference or Workshop Item
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/167488
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-167488
record_format dspace
spelling sg-ntu-dr.10356-1674882023-12-15T02:25:54Z EMNAPE: efficient multi-dimensional neural architecture pruning for EdgeAI Kong, Hao Luo, Xiangzhong Huai, Shuo Liu, Di Subramaniam, Ravi Makaya, Christian Lin, Qian Liu, Weichen School of Computer Science and Engineering 2023 Design, Automation & Test in Europe Conference & Exhibition (DATE) HP-NTU Digital Manufacturing Corporate Lab Engineering::Computer science and engineering Heuristic Algorithms Computational Modeling In this paper, we propose a multi-dimensional pruning framework, EMNAPE, to jointly prune the three dimensions (depth, width, and resolution) of convolutional neural networks (CNNs) for better execution efficiency on embedded hardware. In EMNAPE, we introduce a two-stage evaluation strategy to evaluate the importance of each pruning unit and identify the computational redundancy in the three dimensions. Based on the evaluation strategy, we further present a heuristic pruning algorithm to progressively prune redundant units from the three dimensions for better accuracy and efficiency. Experiments demonstrate the superiority of EMNAPE over existing methods. Ministry of Education (MOE) National Research Foundation (NRF) Submitted/Accepted version This study is partially supported under the RIE2020 Industry Alignment Fund – Industry Collaboration Projects (IAF-ICP) Funding Initiative, as well as cash and in-kind contribution from the industry partner, HP Inc., through the HP-NTU Digital Manufacturing Corporate Lab (I1801E0028). This work is also partially supported by the Ministry of Education, Singapore, under its Academic Research Fund Tier 2 (MOE2019-T2-1-071), and Nanyang Technological University, Singapore, under its NAP (M4082282). 2023-06-12T01:28:51Z 2023-06-12T01:28:51Z 2023 Conference Paper Kong, H., Luo, X., Huai, S., Liu, D., Subramaniam, R., Makaya, C., Lin, Q. & Liu, W. (2023). EMNAPE: efficient multi-dimensional neural architecture pruning for EdgeAI. 2023 Design, Automation & Test in Europe Conference & Exhibition (DATE). https://dx.doi.org/10.23919/DATE56975.2023.10137122 https://hdl.handle.net/10356/167488 10.23919/DATE56975.2023.10137122 en I1801E0028 MOE2019-T2-1-071 NAP (M4082282) 10.21979/N9/HGFYTJ © 2023 EDAA. Published by IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: https://doi.org/10.23919/DATE56975.2023.10137122. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
Heuristic Algorithms
Computational Modeling
spellingShingle Engineering::Computer science and engineering
Heuristic Algorithms
Computational Modeling
Kong, Hao
Luo, Xiangzhong
Huai, Shuo
Liu, Di
Subramaniam, Ravi
Makaya, Christian
Lin, Qian
Liu, Weichen
EMNAPE: efficient multi-dimensional neural architecture pruning for EdgeAI
description In this paper, we propose a multi-dimensional pruning framework, EMNAPE, to jointly prune the three dimensions (depth, width, and resolution) of convolutional neural networks (CNNs) for better execution efficiency on embedded hardware. In EMNAPE, we introduce a two-stage evaluation strategy to evaluate the importance of each pruning unit and identify the computational redundancy in the three dimensions. Based on the evaluation strategy, we further present a heuristic pruning algorithm to progressively prune redundant units from the three dimensions for better accuracy and efficiency. Experiments demonstrate the superiority of EMNAPE over existing methods.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Kong, Hao
Luo, Xiangzhong
Huai, Shuo
Liu, Di
Subramaniam, Ravi
Makaya, Christian
Lin, Qian
Liu, Weichen
format Conference or Workshop Item
author Kong, Hao
Luo, Xiangzhong
Huai, Shuo
Liu, Di
Subramaniam, Ravi
Makaya, Christian
Lin, Qian
Liu, Weichen
author_sort Kong, Hao
title EMNAPE: efficient multi-dimensional neural architecture pruning for EdgeAI
title_short EMNAPE: efficient multi-dimensional neural architecture pruning for EdgeAI
title_full EMNAPE: efficient multi-dimensional neural architecture pruning for EdgeAI
title_fullStr EMNAPE: efficient multi-dimensional neural architecture pruning for EdgeAI
title_full_unstemmed EMNAPE: efficient multi-dimensional neural architecture pruning for EdgeAI
title_sort emnape: efficient multi-dimensional neural architecture pruning for edgeai
publishDate 2023
url https://hdl.handle.net/10356/167488
_version_ 1787136716005441536