Adaptive loss-aware quantization for multi-bit networks
We investigate the compression of deep neural networks by quantizing their weights and activations into multiple binary bases, known as multi-bit networks (MBNs), which accelerate the inference and reduce the storage for the deployment on low-resource mobile and embedded platforms. We propose Adapti...
Saved in:
Main Authors: | , , , |
---|---|
格式: | text |
語言: | English |
出版: |
Institutional Knowledge at Singapore Management University
2020
|
主題: | |
在線閱讀: | https://ink.library.smu.edu.sg/sis_research/5251 https://ink.library.smu.edu.sg/context/sis_research/article/6254/viewcontent/cvpr20_qu.pdf |
標簽: |
添加標簽
沒有標簽, 成為第一個標記此記錄!
|