EvoLP: self-evolving latency predictor for model compression in real-time edge systems

EvoLP: self-evolving latency predictor for model compression in real-time edge systems

Edge devices are increasingly utilized for deploying deep learning applications on embedded systems. The real-time nature of many applications and the limited resources of edge devices necessitate latency-targeted neural network compression. However, measuring latency on real devices is challenging...

Full description

Saved in:

Bibliographic Details
Main Authors:	Huai, Shuo, Kong, Hao, Li, Shiqing, Luo, Xiangzhong, Subramaniam, Ravi, Makaya, Christian, Lin, Qian, Liu, Weichen
Other Authors:	School of Computer Science and Engineering
Format:	Article
Language:	English
Published:	2023
Subjects:	Engineering::Computer science and engineering Predictive Models Hardware
Online Access:	https://hdl.handle.net/10356/171636
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

Similar Items

Latency-constrained DNN architecture learning for edge systems using zerorized batch normalization
by: Huai, Shuo, et al.
Published: (2023)

Collate: collaborative neural network learning for latency-critical edge systems
by: Huai, Shuo, et al.
Published: (2023)

EdgeCompress: coupling multi-dimensional model compression and dynamic inference for EdgeAI
by: Kong, Hao, et al.
Published: (2023)

ZeroBN : learning compact neural networks for latency-critical edge systems
by: Huai, Shuo, et al.
Published: (2022)

CRIMP: compact & reliable DNN inference on in-memory processing via crossbar-aligned compression and non-ideality adaptation
by: Huai, Shuo, et al.
Published: (2023)

On hardware-aware design and optimization of edge intelligence
by: Huai, Shuo, et al.
Published: (2023)

EMNAPE: efficient multi-dimensional neural architecture pruning for EdgeAI
by: Kong, Hao, et al.
Published: (2023)

Smart scissor: coupling spatial redundancy reduction and CNN compression for embedded hardware
by: Kong, Hao, et al.
Published: (2023)

EvoPass: Evolvable graphical password against shoulder-surfing attacks
by: YU, Xingjie, et al.
Published: (2017)

Crossbar-aligned & integer-only neural network compression for efficient in-memory acceleration
by: Huai, Shuo, et al.
Published: (2023)

Of super-evos and non-evos: Imagining karmic law in the 23rd century
by: Garcia, Leni
Published: (2009)

SonLP: Social network link prediction by principal component regression
by: Bao, Z., et al.
Published: (2016)

SurgeNAS: a comprehensive surgery on hardware-aware differentiable neural architecture search
by: Luo, Xiangzhong, et al.
Published: (2023)

Bringing AI to edge : from deep learning's perspective
by: Liu, Di, et al.
Published: (2022)

Towards efficient convolutional neural network for embedded hardware via multi-dimensional pruning
by: Kong, Hao, et al.
Published: (2023)

EDLAB : a benchmark for edge deep learning accelerators
by: Kong, Hao, et al.
Published: (2022)

Hybrid competitive-cooperative coevolution of decentralized controller in EvoTanks
by: Tan, Lawrence C.
Published: (2008)

An efficient gustavson-based sparse matrix-matrix multiplication accelerator on embedded FPGAs
by: Li, Shiqing, et al.
Published: (2023)

MUGNoC: a software-configured multicast-unicast-gather NoC for accelerating CNN dataflows
by: Chen, Hui, et al.
Published: (2023)

EdgeNAS: discovering efficient neural architectures for edge systems
by: Luo, Xiangzhong, et al.
Published: (2023)

HOX gene promoter prediction and inter-genomic comparison: An evo-devo study
by: Endriga, Maria A., et al.
Published: (2010)

Surveillance video analysis using compressive sensing with low latency
by: Jiang, H., et al.
Published: (2014)

Enabling efficient edge intelligence: a hardware-software codesign approach
by: Huai, Shuo
Published: (2023)

Reduced worst-case communication latency using single-cycle multi-hop traversal network-on-chip
by: Chen, Peng, et al.
Published: (2021)

HSCoNAS : hardware-software co-design of efficient DNNs via neural architecture search
by: Luo, Xiangzhong, et al.
Published: (2022)

JALAD : joint accuracy- and latency-aware deep structure decoupling for edge-cloud execution
by: Li, Hongshan, et al.
Published: (2020)

Low-latency compression of mocap data using learned spatial decorrelation transform
by: Hou, Junhui, et al.
Published: (2018)

Efficient FPGA-based sparse matrix-vector multiplication with data reuse-aware compression
by: Li, Shiqing, et al.
Published: (2023)

Biological factors and mannerisms as predictors for science achievement
by: Diego, Artemio A.
Published: (1990)

Neural network with genetically evolved algorithms for stocks prediction
by: Phua, P.K.H., et al.
Published: (2013)

A convergence predictor model for consensus-based decentralised energy markets
by: Pareek, Parikshit, et al.
Published: (2024)

LAMP: load-balanced multipath parallel transmission in point-to-point NoCs
by: Chen, Hui, et al.
Published: (2022)

High dimensional optical data - varifocal multiview imaging, compression and evaluation
by: Wu, Kejun, et al.
Published: (2024)

Second lumbrical-interossei latency difference: A strong predictor of median neuropathy at the wrist in uremic patients
by: Sharma, V.K., et al.
Published: (2011)

Second lumbrical-interossei latency difference: A strong predictor of median neuropathy at the wrist in uremic patients.
by: Sharma, V.K., et al.
Published: (2016)

Low-Dimensional Models for Compressed Sensing and Prediction of Large-Scale Traffic Data
by: Mitrovic, Nikola, et al.
Published: (2016)

A connectionist model of data compression in memory
by: Iyer, L.R., et al.
Published: (2014)

A practical low-power memristor-based analog neural branch predictor
by: Wang, J., et al.
Published: (2014)

The healthcare analytics landscape in Singapore: Evolving to deliver better care
by: TAN, Kar Way, et al.
Published: (2020)

Multi-agent trajectory prediction with heterogeneous edge-enhanced graph attention network
by: Mo, Xiaoyu, et al.
Published: (2022)