AdaDeep: A usage-driven, automated deep model compression framework for enabling ubiquitous intelligent mobiles

Recent breakthroughs in deep neural networks (DNNs) have fueled a tremendously growing demand for bringing DNN-powered intelligence into mobile platforms. While the potential of deploying DNNs on resource-constrained platforms has been demonstrated by DNN compression techniques, the current practice...

Full description

Saved in:
Bibliographic Details
Main Authors: LIU, Sicong, DU, Junzhao, NAN, Kaiming, ZHOU, Zimu, LIU, Hui, WANG, Zhangyang, LIN, Yingyan
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2021
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/6648
https://ink.library.smu.edu.sg/context/sis_research/article/7651/viewcontent/tmc21_liu.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-7651
record_format dspace
spelling sg-smu-ink.sis_research-76512022-01-14T03:20:28Z AdaDeep: A usage-driven, automated deep model compression framework for enabling ubiquitous intelligent mobiles LIU, Sicong DU, Junzhao NAN, Kaiming ZHOU, Zimu LIU, Hui WANG, Zhangyang LIN, Yingyan Recent breakthroughs in deep neural networks (DNNs) have fueled a tremendously growing demand for bringing DNN-powered intelligence into mobile platforms. While the potential of deploying DNNs on resource-constrained platforms has been demonstrated by DNN compression techniques, the current practice suffers from two limitations: 1) merely stand-alone compression schemes are investigated even though each compression technique only suit for certain types of DNN layers; and 2) mostly compression techniques are optimized for DNNs’ inference accuracy, without explicitly considering other application-driven system performance (e.g., latency and energy cost) and the varying resource availability across platforms (e.g., storage and processing capability). To this end, we propose AdaDeep, a usage-driven, automated DNN compression framework for systematically exploring the desired trade-off between performance and resource constraints, from a holistic system level. Specifically, in a layer-wise manner, AdaDeep automatically selects the most suitable combination of compression techniques and the corresponding compression hyperparameters for a given DNN. Thorough evaluations on six datasets and across twelve devices demonstrate that AdaDeep can achieve up to 18.6×18.6×18.6× latency reduction, 9.8×9.8×9.8× energy-efficiency improvement, and 37.3×37.3×37.3× storage reduction in DNNs while incurring negligible accuracy loss. Furthermore, AdaDeep also uncovers multiple novel combinations of compression techniques. 2021-12-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6648 info:doi/10.1109/TMC.2020.2999956 https://ink.library.smu.edu.sg/context/sis_research/article/7651/viewcontent/tmc21_liu.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Optimization Mobile Computing Mobile Handsets Mobile Applications Energy Storage Measurement Computational Modeling Artificial Intelligence and Robotics Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Optimization
Mobile Computing
Mobile Handsets
Mobile Applications
Energy Storage
Measurement
Computational Modeling
Artificial Intelligence and Robotics
Software Engineering
spellingShingle Optimization
Mobile Computing
Mobile Handsets
Mobile Applications
Energy Storage
Measurement
Computational Modeling
Artificial Intelligence and Robotics
Software Engineering
LIU, Sicong
DU, Junzhao
NAN, Kaiming
ZHOU, Zimu
LIU, Hui
WANG, Zhangyang
LIN, Yingyan
AdaDeep: A usage-driven, automated deep model compression framework for enabling ubiquitous intelligent mobiles
description Recent breakthroughs in deep neural networks (DNNs) have fueled a tremendously growing demand for bringing DNN-powered intelligence into mobile platforms. While the potential of deploying DNNs on resource-constrained platforms has been demonstrated by DNN compression techniques, the current practice suffers from two limitations: 1) merely stand-alone compression schemes are investigated even though each compression technique only suit for certain types of DNN layers; and 2) mostly compression techniques are optimized for DNNs’ inference accuracy, without explicitly considering other application-driven system performance (e.g., latency and energy cost) and the varying resource availability across platforms (e.g., storage and processing capability). To this end, we propose AdaDeep, a usage-driven, automated DNN compression framework for systematically exploring the desired trade-off between performance and resource constraints, from a holistic system level. Specifically, in a layer-wise manner, AdaDeep automatically selects the most suitable combination of compression techniques and the corresponding compression hyperparameters for a given DNN. Thorough evaluations on six datasets and across twelve devices demonstrate that AdaDeep can achieve up to 18.6×18.6×18.6× latency reduction, 9.8×9.8×9.8× energy-efficiency improvement, and 37.3×37.3×37.3× storage reduction in DNNs while incurring negligible accuracy loss. Furthermore, AdaDeep also uncovers multiple novel combinations of compression techniques.
format text
author LIU, Sicong
DU, Junzhao
NAN, Kaiming
ZHOU, Zimu
LIU, Hui
WANG, Zhangyang
LIN, Yingyan
author_facet LIU, Sicong
DU, Junzhao
NAN, Kaiming
ZHOU, Zimu
LIU, Hui
WANG, Zhangyang
LIN, Yingyan
author_sort LIU, Sicong
title AdaDeep: A usage-driven, automated deep model compression framework for enabling ubiquitous intelligent mobiles
title_short AdaDeep: A usage-driven, automated deep model compression framework for enabling ubiquitous intelligent mobiles
title_full AdaDeep: A usage-driven, automated deep model compression framework for enabling ubiquitous intelligent mobiles
title_fullStr AdaDeep: A usage-driven, automated deep model compression framework for enabling ubiquitous intelligent mobiles
title_full_unstemmed AdaDeep: A usage-driven, automated deep model compression framework for enabling ubiquitous intelligent mobiles
title_sort adadeep: a usage-driven, automated deep model compression framework for enabling ubiquitous intelligent mobiles
publisher Institutional Knowledge at Singapore Management University
publishDate 2021
url https://ink.library.smu.edu.sg/sis_research/6648
https://ink.library.smu.edu.sg/context/sis_research/article/7651/viewcontent/tmc21_liu.pdf
_version_ 1770576016641622016