Adaptive loss-aware quantization for multi-bit networks

Adaptive loss-aware quantization for multi-bit networks

We investigate the compression of deep neural networks by quantizing their weights and activations into multiple binary bases, known as multi-bit networks (MBNs), which accelerate the inference and reduce the storage for the deployment on low-resource mobile and embedded platforms. We propose Adapti...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلفون الرئيسيون:	QU, Zhongnan, ZHOU, Zimu, CHENG, Yun, THIELE, Lothar
التنسيق:	text
اللغة:	English
منشور في:	Institutional Knowledge at Singapore Management University 2020
الموضوعات:	Quantization (signal) Optimization Neural networks Adaptive systems Microprocessors Training Tensile stress Databases and Information Systems Numerical Analysis and Scientific Computing
الوصول للمادة أونلاين:	https://ink.library.smu.edu.sg/sis_research/5251 https://ink.library.smu.edu.sg/context/sis_research/article/6254/viewcontent/cvpr20_qu.pdf
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
المؤسسة:	Singapore Management University
اللغة:	English

مواد مشابهة

Bounds on the optimal quantization performance of dynamically quantized linear systems with bounded noise
بواسطة: Ling, Q., وآخرون
منشور في: (2014)

p-Meta: Towards on-device deep model adaptation
بواسطة: QU, Zhongnan, وآخرون
منشور في: (2022)

Numerical studies on quantized vortex dynamics in superfludity and superconductivity
بواسطة: TANG QINGLIN
منشور في: (2013)

Verification of bit-flip attacks against quantized neural networks
بواسطة: ZHANG, Yedi, وآخرون
منشور في: (2025)

Adaptive vertex quantization for mesh compression
بواسطة: Qiu, Z.M., وآخرون
منشور في: (2014)

Certified quantization strategy synthesis for neural networks
بواسطة: ZHANG, Yedi, وآخرون
منشور في: (2024)

Skyrmion helicity: quantization and quantum tunneling effects
بواسطة: Psaroudaki, Christina, وآخرون
منشور في: (2023)

Quantization-uncertainty-dependent analysis and control of linear systems with multi-input–multi-output quantization
بواسطة: Ning, Zepeng, وآخرون
منشور في: (2023)

Asymptotic stabilization of dynamically quantized nonlinear systems in feedforward form
بواسطة: Ling, Q., وآخرون
منشور في: (2014)

Read and Write Voltage Signal Optimization for Multi-Level-Cell (MLC) NAND Flash Memory
بواسطة: Aslam, Chaudhry Adnan, وآخرون
منشور في: (2016)

Generalized Weyl Quantization, Coherent State Quantization and Time in Quantum Mechanics
بواسطة: Romeo, Daisy
منشور في: (2021)

QVIP: An ILP-based formal verification approach for quantized neural networks
بواسطة: ZHANG, Yedi, وآخرون
منشور في: (2022)

A simple and efficient model for quantization effects of hole inversion layers in MOS devices
بواسطة: Hou, Y.-T., وآخرون
منشور في: (2014)

Representations of some MD5-group via deformation quantization
بواسطة: Nguyen, Viet Hai
منشور في: (2017)

Wavelet-based image compression using classified interpolative vector quantization
بواسطة: Bi, M., وآخرون
منشور في: (2014)

A novel method for wavelet quantization of noisy speech
بواسطة: Madhukumar, A. S., وآخرون
منشور في: (2020)

Scalable image retrieval by sparse product quantization
بواسطة: NING, Qingqun, وآخرون
منشور في: (2017)

Source-aware multidatabase query processing
بواسطة: LIM, Ee Peng, وآخرون
منشور في: (1997)

Improving conversational recommender system via contextual and time-aware modeling with less domain-specific knowledge
بواسطة: WANG, Lingzhi, وآخرون
منشور في: (2024)

Robust image analysis with sparse representation on quantized visual features
بواسطة: Bao, B.-K., وآخرون
منشور في: (2014)

Coherent State Quantization of Time of Arrival Functions of Confined and Free Particles
بواسطة: Romeo, Daisy A, وآخرون
منشور في: (2022)

A Full-custom based design of an 8-bit DLX computer architecture
بواسطة: Castillo, Gerald Menel B., وآخرون
منشور في: (2010)

Evidence aware neural pornographic text identification for child protection
بواسطة: SONG, Kaisong, وآخرون
منشور في: (2021)

Mesh simplification using ellipsoidal schema for isotropic quantization of face-normal vectors
بواسطة: Subramaniam, G., وآخرون
منشور في: (2014)

Quantization of spin waves in oval-shaped nanorings
بواسطة: Tan, C.G., وآخرون
منشور في: (2014)

Efficient and lightweight quantized compressive sensing using μ-law
بواسطة: Pudi, Vikramkumar, وآخرون
منشور في: (2020)

Tuple Source Relational Model: A Source-Aware Data Model for Multidatabases
بواسطة: LIM, Ee Peng, وآخرون
منشور في: (1998)

Quantization of spin waves in ferromagnetic Ni80Fe20 nanoring
بواسطة: TAN CHIN GUAN
منشور في: (2010)

Microprocessor based stepper motor controller
بواسطة: Lua, Wilbert, وآخرون
منشور في: (1988)

Prototype microprocessor-based signal strength finder
بواسطة: Bautista, Winston, وآخرون
منشور في: (1987)

Distributed prescribed-time attitude consensus for multiple spacecraft via quantized communication
بواسطة: Xu, Chuang, وآخرون
منشور في: (2023)

Perceived image similarity and quantization resolution
بواسطة: Chan, H.C.
منشور في: (2013)

Generalized Weyl Quantization and Time
بواسطة: Romeo, Daisy A, وآخرون
منشور في: (2021)

Improved one-band self-consistent effective mass methods for hole quantization in p-MOSFET
بواسطة: Low, T., وآخرون
منشور في: (2014)

Quantized vortex stability and interaction in the nonlinear wave equation
بواسطة: Bao, W., وآخرون
منشور في: (2014)

Aspect and opinion aware abstractive review summarization with reinforced hard typed decoder
بواسطة: TIAN, Yufei, وآخرون
منشور في: (2019)

Context-aware advertisement recommendation for high-speed social news feeding
بواسطة: LI, Yuchen, وآخرون
منشور في: (2016)

Conditions for forced and subharmonic oscillations in relay and quantized feedback systems
بواسطة: LIM LI HONG IDRIS
منشور في: (2010)

Supervisory evolutionary optimization strategy for adaptive maintenance schedules
بواسطة: WANG, Zhaoxia, وآخرون
منشور في: (2011)

A project study on a microprocessor-simulated sonic data communicator
بواسطة: Driz, Ariel, وآخرون
منشور في: (1985)