Transform Quantization for CNN Compression.

IEEE Trans Pattern Anal Mach Intell

Published: September 2022

In this paper, we compress convolutional neural network (CNN) weights post-training via transform quantization. Previous CNN quantization techniques tend to ignore the joint statistics of weights and activations, producing sub-optimal CNN performance at a given quantization bit-rate, or consider their joint statistics during training only and do not facilitate efficient compression of already trained CNN models. We optimally transform (decorrelate) and quantize the weights post-training using a rate-distortion framework to improve compression at any given quantization bit-rate. Transform quantization unifies quantization and dimensionality reduction (decorrelation) techniques in a single framework to facilitate low bit-rate compression of CNNs and efficient inference in the transform domain. We first introduce a theory of rate and distortion for CNN quantization and pose optimum quantization as a rate-distortion optimization problem. We then show that this problem can be solved using optimal bit-depth allocation following decorrelation by the optimal End-to-end Learned Transform (ELT) we derive in this paper. Experiments demonstrate that transform quantization advances the state of the art in CNN compression in both retrained and non-retrained quantization scenarios. In particular, we find that transform quantization with retraining is able to compress CNN models such as AlexNet, ResNet and DenseNet to very low bit-rates (1-2 bits).

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2021.3084839DOI Listing

Publication Analysis

Top Keywords

transform quantization
20
quantization
11
transform
8
cnn
8
cnn compression
8
weights post-training
8
cnn quantization
8
joint statistics
8
quantization bit-rate
8
cnn models
8

Similar Publications

Detection and classification of cardiovascular diseases are crucial for early diagnosis and prediction of heart-related conditions. Existing methods rely on either electrocardiogram or phonocardiogram signals, resulting in higher false positive rates. Solely ECG misses the murmurs associated with the narrowing of the blood vessels caused by abnormalities in the heart.

View Article and Find Full Text PDF

IMAGE CLASSIFICATION-DRIVEN SPEECH DISORDER DETECTION USING DEEP LEARNING TECHNIQUE.

SLAS Technol

March 2025

Department of Documents and Archive, Center of Documents and Administrative Communication, King Faisal University, Al Hofuf, 31982, Al-Ahsa, Saudi Arabia.

Speech disorders affect an individual's ability to generate sounds or utilize the voice appropriately. Neurological, developmental, physical, and trauma may cause speech disorders. Speech impairments influence communication, social interaction, education, and quality of life.

View Article and Find Full Text PDF

The accuracy of on-grid frequency estimation methods suffers from the quantization error of discrete grids. In this article, a deep unfolded network for off-grid frequency estimation is proposed, dubbed OGFreq. In the OGFreq, there exist two kinds of variables.

View Article and Find Full Text PDF

The scope of point cloud (PC) applications is expanding. We propose a no-reference bitstream-layer quality assessment model that eliminates the need for full decoding of the PC, providing quality evaluation scores during the V-PCC decoding process. Specifically, we illustrate the relationship between content diversity (CD) and perceptual coding distortion in lossless geometric coding.

View Article and Find Full Text PDF

Practical Compact Deep Compressed Sensing.

IEEE Trans Pattern Anal Mach Intell

November 2024

Recent years have witnessed the success of deep networks in compressed sensing (CS), which allows for a significant reduction in sampling cost and has gained growing attention since its inception. In this paper, we propose a new practical and compact network dubbed PCNet for general image CS. Specifically, in PCNet, a novel collaborative sampling operator is designed, which consists of a deep conditional filtering step and a dual-branch fast sampling step.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!