ZeroBN : learning compact neural networks for latency-critical edge systems

Edge devices have been widely adopted to bring deep learning applications onto low power embedded systems, mitigating the privacy and latency issues of accessing cloud servers. The increasingly computational demand of complex neural network models leads to large latency on edge devices with limited...

Full description

Saved in:

Bibliographic Details
Main Authors:	Huai, Shuo, Zhang, Lei, Liu, Di, Liu, Weichen, Subramaniam, Ravi
Other Authors:	School of Computer Science and Engineering
Format:	Conference or Workshop Item
Language:	English
Published:	2022
Subjects:	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence ZeroBN Compact Learning
Online Access:	https://hdl.handle.net/10356/155572
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-155572
record_format	dspace
spelling	sg-ntu-dr.10356-1555722022-03-11T01:10:53Z ZeroBN : learning compact neural networks for latency-critical edge systems Huai, Shuo Zhang, Lei Liu, Di Liu, Weichen Subramaniam, Ravi School of Computer Science and Engineering 2021 58th ACM/IEEE Design Automation Conference (DAC) HP-NTU Digital Manufacturing Corporate Lab Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence ZeroBN Compact Learning Edge devices have been widely adopted to bring deep learning applications onto low power embedded systems, mitigating the privacy and latency issues of accessing cloud servers. The increasingly computational demand of complex neural network models leads to large latency on edge devices with limited resources. Many application scenarios are real-time and have a strict latency constraint, while conventional neural network compression methods are not latency-oriented. In this work, we propose a novel compact neural networks training method to reduce the model latency on latency-critical edge systems. A latency predictor is also introduced to guide and optimize this procedure. Coupled with the latency predictor, our method can guarantee the latency for a compact model by only one training process. The experiment results show that, compared to state-of-the-art model compression methods, our approach can well-fit the 'hard' latency constraint by significantly reducing the latency with a mild accuracy drop. To satisfy a 34ms latency constraint, we compact ResNet-50 with 0.82% of accuracy drop. And for GoogLeNet, we can even increase the accuracy by 0.3% National Research Foundation (NRF) Submitted/Accepted version This research was conducted in collaboration with HP Inc. and supported by National Research Foundation (NRF) Singapore and the Singapore Government through the Industry Alignment Fund - Industry Collaboration Projects Grant (I1801E0028). 2022-03-11T01:10:53Z 2022-03-11T01:10:53Z 2021 Conference Paper Huai, S., Zhang, L., Liu, D., Liu, W. & Subramaniam, R. (2021). ZeroBN : learning compact neural networks for latency-critical edge systems. 2021 58th ACM/IEEE Design Automation Conference (DAC), 151-156. https://dx.doi.org/10.1109/DAC18074.2021.9586309 9781665432740 https://hdl.handle.net/10356/155572 10.1109/DAC18074.2021.9586309 2-s2.0-85119416118 151 156 en I1801E0028 10.21979/N9/IRNJ4I ©2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: https://doi.org/10.1109/DAC18074.2021.9586309. application/pdf
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence ZeroBN Compact Learning
spellingShingle	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence ZeroBN Compact Learning Huai, Shuo Zhang, Lei Liu, Di Liu, Weichen Subramaniam, Ravi ZeroBN : learning compact neural networks for latency-critical edge systems
description	Edge devices have been widely adopted to bring deep learning applications onto low power embedded systems, mitigating the privacy and latency issues of accessing cloud servers. The increasingly computational demand of complex neural network models leads to large latency on edge devices with limited resources. Many application scenarios are real-time and have a strict latency constraint, while conventional neural network compression methods are not latency-oriented. In this work, we propose a novel compact neural networks training method to reduce the model latency on latency-critical edge systems. A latency predictor is also introduced to guide and optimize this procedure. Coupled with the latency predictor, our method can guarantee the latency for a compact model by only one training process. The experiment results show that, compared to state-of-the-art model compression methods, our approach can well-fit the 'hard' latency constraint by significantly reducing the latency with a mild accuracy drop. To satisfy a 34ms latency constraint, we compact ResNet-50 with 0.82% of accuracy drop. And for GoogLeNet, we can even increase the accuracy by 0.3%
author2	School of Computer Science and Engineering
author_facet	School of Computer Science and Engineering Huai, Shuo Zhang, Lei Liu, Di Liu, Weichen Subramaniam, Ravi
format	Conference or Workshop Item
author	Huai, Shuo Zhang, Lei Liu, Di Liu, Weichen Subramaniam, Ravi
author_sort	Huai, Shuo
title	ZeroBN : learning compact neural networks for latency-critical edge systems
title_short	ZeroBN : learning compact neural networks for latency-critical edge systems
title_full	ZeroBN : learning compact neural networks for latency-critical edge systems
title_fullStr	ZeroBN : learning compact neural networks for latency-critical edge systems
title_full_unstemmed	ZeroBN : learning compact neural networks for latency-critical edge systems
title_sort	zerobn : learning compact neural networks for latency-critical edge systems
publishDate	2022
url	https://hdl.handle.net/10356/155572
_version_	1728433417925689344

ZeroBN : learning compact neural networks for latency-critical edge systems

Similar Items