TAB : unified and optimized ternary, binary and mixed-precision neural network inference on the edge
Ternary Neural Networks (TNNs) and mixed-precision Ternary Binary Networks (TBNs) have demonstrated higher accuracy compared to Binary Neural Networks (BNNs) while providing fast, low-power and memory-efficient inference. Related works have improved the accuracy of TNNs and TBNs, but overlooked thei...
Saved in:
Main Authors: | Zhu, Shien, Duong, Luan H. K., Liu, Weichen |
---|---|
Other Authors: | School of Computer Science and Engineering |
Format: | Article |
Language: | English |
Published: |
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/155648 https://doi.org/10.21979/N9/RZ75BY |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
Similar Items
-
XOR-Net : an efficient computation pipeline for binary neural network inference on edge devices
by: Zhu, Shien, et al.
Published: (2020) -
EdgeCompress: coupling multi-dimensional model compression and dynamic inference for EdgeAI
by: Kong, Hao, et al.
Published: (2023) -
FAT: an in-memory accelerator with fast addition for ternary weight neural networks
by: Zhu, Shien, et al.
Published: (2022) -
EdgeNAS: discovering efficient neural architectures for edge systems
by: Luo, Xiangzhong, et al.
Published: (2023) -
iMAT: energy-efficient in-memory acceleration for ternary neural networks with sparse dot product
by: Zhu, Shien, et al.
Published: (2023)