EdgeCompress: coupling multi-dimensional model compression and dynamic inference for EdgeAI
Convolutional neural networks (CNNs) have demonstrated encouraging results in image classification tasks. However, the prohibitive computational cost of CNNs hinders the deployment of CNNs onto resource-constrained embedded devices. To address this issue, we propose, a comprehensive compression fram...
Saved in:
Main Authors: | Kong, Hao, Liu, Di, Huai, Shuo, Luo, Xiangzhong, Subramaniam, Ravi, Makaya, Christian, Lin, Qian, Liu, Weichen |
---|---|
Other Authors: | School of Computer Science and Engineering |
Format: | Article |
Language: | English |
Published: |
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/171623 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
Similar Items
-
EMNAPE: efficient multi-dimensional neural architecture pruning for EdgeAI
by: Kong, Hao, et al.
Published: (2023) -
EvoLP: self-evolving latency predictor for model compression in real-time edge systems
by: Huai, Shuo, et al.
Published: (2023) -
On hardware-aware design and optimization of edge intelligence
by: Huai, Shuo, et al.
Published: (2023) -
CRIMP: compact & reliable DNN inference on in-memory processing via crossbar-aligned compression and non-ideality adaptation
by: Huai, Shuo, et al.
Published: (2023) -
Collate: collaborative neural network learning for latency-critical edge systems
by: Huai, Shuo, et al.
Published: (2023)