Deep Neural Network Compression by In-Parallel Pruning-Quantization.

IEEE Trans Pattern Anal Mach Intell

Published: March 2020

Deep neural networks enable state-of-the-art accuracy on visual recognition tasks such as image classification and object detection. However, modern networks contain millions of learned connections, and the current trend is towards deeper and more densely connected architectures. This poses a challenge to the deployment of state-of-the-art networks on resource-constrained systems, such as smartphones or mobile robots. In general, a more efficient utilization of computation resources would assist in deployment scenarios from embedded platforms to computing clusters running ensembles of networks. In this paper, we propose a deep network compression algorithm that performs weight pruning and quantization jointly, and in parallel with fine-tuning. Our approach takes advantage of the complementary nature of pruning and quantization and recovers from premature pruning errors, which is not possible with two-stage approaches. In experiments on ImageNet, CLIP-Q (Compression Learning by In-Parallel Pruning-Quantization) improves the state-of-the-art in network compression on AlexNet, VGGNet, GoogLeNet, and ResNet. We additionally demonstrate that CLIP-Q is complementary to efficient network architecture design by compressing MobileNet and ShuffleNet, and that CLIP-Q generalizes beyond convolutional networks by compressing a memory network for visual question answering.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2018.2886192DOI Listing

Publication Analysis

Top Keywords

network compression
12
deep neural
8
in-parallel pruning-quantization
8
pruning quantization
8
network
5
networks
5
neural network
4
compression
4
compression in-parallel
4
pruning-quantization deep
4

Similar Publications

Background: Astroblastoma is an extremely rare tumor of the central nervous system, and its origin and validity as a different entity are still being debated. Because of its rarity and similarities to other glial neoplasms, it is often misdiagnosed, impacting treatment and outcomes.

Observations: Astroblastoma is very rare and mainly affects children and young adults.

View Article and Find Full Text PDF

We measure the response of open-cell polyurethane foams filled with a dense suspension of fumed silica particles in polyethylene glycol at compression speeds spanning several orders of magnitude. The gradual compressive stress increase of the composite material indicates the existence of shear rate gradients in the interstitial suspension caused by wide distributions in pore sizes in the disordered foam network. The energy dissipated during compression scales with an effective internal shear rate, allowing for the collapse of three data sets for different pore-size foams.

View Article and Find Full Text PDF

Development of a machine learning algorithm to identify cauda equina compression on MRI scans.

World Neurosurg

January 2025

Department of Neurosurgery, Manchester Centre for Clinical Neurosciences, Salford Royal Hospital, M6 8HD, Manchester, England, United Kingdom.

Objective: Cauda Equina Syndrome (CES) poses significant neurological risks if untreated. Diagnosis relies on clinical and radiological features. As the symptoms are often non specific and common, the diagnosis is usually made after a MRI scan.

View Article and Find Full Text PDF

As the Internet becomes increasingly popular, the number of users connected to it grows significantly. Consequently, the packet processing speed of network systems, such as routers, must be enhanced. IP lookup is a critical task used to find the next hop address by searching for the longest prefix match in the forwarding information base (FIB).

View Article and Find Full Text PDF

Middle ear barotrauma (MEBT) is the most common complication in providing hyperbaric oxygen therapy (HBO). This study explored the impact of altering the shape of the time-pressure curve with the aim of reducing the occurrence of MEBT and optimizing the HBO experience during the pressurization process. Four distinct mathematically derived protocols-Constant Pressure Difference (CPD), Constant Volume Difference (CVD), Constant Ratio (CR), and Inverted Constant Ratio (ICR)-were investigated using computer simulations on a simple ear model.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!