Transform Quantization for CNN Compression.

Sean I Young Wang Zhe David Taubman Bernd Girod

IEEE Trans Pattern Anal Mach Intell

Published: September 2022

In this paper, we compress convolutional neural network (CNN) weights post-training via transform quantization. Previous CNN quantization techniques tend to ignore the joint statistics of weights and activations, producing sub-optimal CNN performance at a given quantization bit-rate, or consider their joint statistics during training only and do not facilitate efficient compression of already trained CNN models. We optimally transform (decorrelate) and quantize the weights post-training using a rate-distortion framework to improve compression at any given quantization bit-rate. Transform quantization unifies quantization and dimensionality reduction (decorrelation) techniques in a single framework to facilitate low bit-rate compression of CNNs and efficient inference in the transform domain. We first introduce a theory of rate and distortion for CNN quantization and pose optimum quantization as a rate-distortion optimization problem. We then show that this problem can be solved using optimal bit-depth allocation following decorrelation by the optimal End-to-end Learned Transform (ELT) we derive in this paper. Experiments demonstrate that transform quantization advances the state of the art in CNN compression in both retrained and non-retrained quantization scenarios. In particular, we find that transform quantization with retraining is able to compress CNN models such as AlexNet, ResNet and DenseNet to very low bit-rates (1-2 bits).

Download full-text PDF	Source
http://dx.doi.org/10.1109/TPAMI.2021.3084839	DOI Listing

Publication Analysis

Top Keywords

transform quantization

quantization

transform

cnn

cnn compression

weights post-training

cnn quantization

joint statistics

quantization bit-rate

cnn models

Similar Publications

Integrated fusion approach for multi-class heart disease classification through ECG and PCG signals with deep hybrid neural networks.

Sci Rep

March 2025

Department of Cardiology, National Institue of Medical Science, NIMS University, Jaipur, Rajasthan, India.

Shivalila Hangaragi N Neelima Katarina Jegdic Amitesh Nagarwal

Detection and classification of cardiovascular diseases are crucial for early diagnosis and prediction of heart-related conditions. Existing methods rely on either electrocardiogram or phonocardiogram signals, resulting in higher false positive rates. Solely ECG misses the murmurs associated with the narrowing of the blood vessels caused by abnormalities in the heart.

View Article and Find Full Text PDF

Similar Publications

IMAGE CLASSIFICATION-DRIVEN SPEECH DISORDER DETECTION USING DEEP LEARNING TECHNIQUE.

SLAS Technol

March 2025

Department of Documents and Archive, Center of Documents and Administrative Communication, King Faisal University, Al Hofuf, 31982, Al-Ahsa, Saudi Arabia.

Nasser Ali Aljarallah Ashit Kumar Dutta Abdul Rahaman Wahab Sait

Speech disorders affect an individual's ability to generate sounds or utilize the voice appropriately. Neurological, developmental, physical, and trauma may cause speech disorders. Speech impairments influence communication, social interaction, education, and quality of life.

View Article and Find Full Text PDF

Similar Publications

Interpretable Optimization-Inspired Deep Network for Off-Grid Frequency Estimation.

IEEE Trans Neural Netw Learn Syst

February 2025

Pingping Pan Yunjian Zhang You Li Yishan Ye Wei He

The accuracy of on-grid frequency estimation methods suffers from the quantization error of discrete grids. In this article, a deep unfolded network for off-grid frequency estimation is proposed, dubbed OGFreq. In the OGFreq, there exist two kinds of variables.

View Article and Find Full Text PDF

Similar Publications

Energy-Adaptive Bitstream-Layer Model for Perceptual Quality Assessment of V-PCC Encoded 3D Point Clouds.

IEEE Trans Image Process

January 2025

Wusi Sang Honglei Su Qi Liu Hui Yuan Zhou Wang

The scope of point cloud (PC) applications is expanding. We propose a no-reference bitstream-layer quality assessment model that eliminates the need for full decoding of the PC, providing quality evaluation scores during the V-PCC decoding process. Specifically, we illustrate the relationship between content diversity (CD) and perceptual coding distortion in lossless geometric coding.

View Article and Find Full Text PDF

Similar Publications

Practical Compact Deep Compressed Sensing.

IEEE Trans Pattern Anal Mach Intell

November 2024

Bin Chen Jian Zhang

Recent years have witnessed the success of deep networks in compressed sensing (CS), which allows for a significant reduction in sampling cost and has gained growing attention since its inception. In this paper, we propose a new practical and compact network dubbed PCNet for general image CS. Specifically, in PCNet, a novel collaborative sampling operator is designed, which consists of a deep conditional filtering step and a dual-branch fast sampling step.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!