EvoLP: self-evolving latency predictor for model compression in real-time edge systems

Edge devices are increasingly utilized for deploying deep learning applications on embedded systems. The real-time nature of many applications and the limited resources of edge devices necessitate latency-targeted neural network compression. However, measuring latency on real devices is challenging...

全面介紹

Saved in:
書目詳細資料
Main Authors: Huai, Shuo, Kong, Hao, Li, Shiqing, Luo, Xiangzhong, Subramaniam, Ravi, Makaya, Christian, Lin, Qian, Liu, Weichen
其他作者: School of Computer Science and Engineering
格式: Article
語言:English
出版: 2023
主題:
在線閱讀:https://hdl.handle.net/10356/171636
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!