This paper aims at accelerating and compressing deep neural networks to deploy CNN models into small devices like mobile phones or embedded gadgets. We focus on filter level pruning, i.e., the whole filter will be discarded if it is less important. An effective and unified framework, ThiNet (stands for "Thin Net"), is proposed in this paper. We formally establish filter pruning as an optimization problem, and reveal that we need to prune filters based on statistics computed from its next layer, not the current layer, which differentiates ThiNet from existing methods. We also propose "gcos" (Group COnvolution with Shuffling), a more accurate group convolution scheme, to further reduce the pruned model size. Experimental results demonstrate the effectiveness of our method, which has advanced the state-of-the-art. Moreover, we show that the original VGG-16 model can be compressed into a very small model (ThiNet-Tiny) with only 2.66 MB model size, but still preserve AlexNet level accuracy. This small model is evaluated on several benchmarks with different vision tasks (e.g., classification, detection, segmentation), and shows excellent generalization ability.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2018.2858232DOI Listing

Publication Analysis

Top Keywords

group convolution
8
model size
8
small model
8
model
5
thinet pruning
4
pruning cnn
4
cnn filters
4
filters thinner
4
thinner net
4
net paper
4

Similar Publications

Lymphadenopathy is associated with lymph node abnormal size or consistency due to many causes. We employed the deep convolutional neural network ResNet-34 to detect and classify CT images from patients with abdominal lymphadenopathy and healthy controls. We created a single database containing 1400 source CT images for patients with abdominal lymphadenopathy (n = 700) and healthy controls (n = 700).

View Article and Find Full Text PDF

Aim: To develop and validate an artificial intelligence (AI)-powered tool based on convolutional neural network (CNN) for automatic segmentation of root canals in single-rooted teeth using cone-beam computed tomography (CBCT).

Methodology: A total of 69 CBCT scans were retrospectively recruited from a hospital database and acquired from two devices with varying protocols. These scans were randomly assigned to the training (n = 31, 88 teeth), validation (n = 8, 15 teeth) and testing (n = 30, 120 teeth) sets.

View Article and Find Full Text PDF

Introduction: Porcine reproductive and respiratory syndrome virus (PRRSV) is a major pathogen that has caused severe economic losses in the swine industry. Screening key host immune-related genetic factors in the porcine alveolar macrophages (PAMs) is critical to improve the anti-virial ability in pigs.

Methods: In this study, an model was set to evaluate the anti-PRRSV effect of tylvalosin tartrates.

View Article and Find Full Text PDF

In Vivo Confocal Microscopy for Automated Detection of Meibomian Gland Dysfunction: A Study Based on Deep Convolutional Neural Networks.

J Imaging Inform Med

January 2025

Department of Ophthalmology, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, National Clinical Research Center for Eye Disease, Shanghai, 200080, China.

The objectives of this study are to construct a deep convolutional neural network (DCNN) model to diagnose and classify meibomian gland dysfunction (MGD) based on the in vivo confocal microscope (IVCM) images and to evaluate the performance of the DCNN model and its auxiliary significance for clinical diagnosis and treatment. We extracted 6643 IVCM images from the three hospitals' IVCM database as the training set for the DCNN model and 1661 IVCM images from the other two hospitals' IVCM database as the test set to examine the performance of the model. Construction of the DCNN model was performed using DenseNet-169.

View Article and Find Full Text PDF

Scaling of ventral hippocampal activity during anxiety.

J Neurosci

January 2025

Laboratory of Systems Neuroscience, Department of Physiology, University of Bern, Bern, Switzerland.

The hippocampus supports a multiplicity of functions, with the dorsal region contributing to spatial representations and memory, and the ventral hippocampus (vH) being primarily involved in emotional processing. While spatial encoding has been extensively investigated, how the vH activity is tuned to emotional states, e.g.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!