GAN Compression: Efficient Architectures for Interactive Conditional GANs.

Muyang Li Ji Lin Yaoyao Ding Zhijian Liu Jun-Yan Zhu Song Han

IEEE Trans Pattern Anal Mach Intell

Published: December 2022

Conditional Generative Adversarial Networks (cGANs) have enabled controllable image synthesis for many vision and graphics applications. However, recent cGANs are 1-2 orders of magnitude more compute-intensive than modern recognition CNNs. For example, GauGAN consumes 281G MACs per image, compared to 0.44G MACs for MobileNet-v3, making it difficult for interactive deployment. In this work, we propose a general-purpose compression framework for reducing the inference time and model size of the generator in cGANs. Directly applying existing compression methods yields poor performance due to the difficulty of GAN training and the differences in generator architectures. We address these challenges in two ways. First, to stabilize GAN training, we transfer knowledge of multiple intermediate representations of the original model to its compressed model and unify unpaired and paired learning. Second, instead of reusing existing CNN designs, our method finds efficient architectures via neural architecture search. To accelerate the search process, we decouple the model training and search via weight sharing. Experiments demonstrate the effectiveness of our method across different supervision settings, network architectures, and learning methods. Without losing image quality, we reduce the computation of CycleGAN by 21×, Pix2pix by 12×, MUNIT by 29×, and GauGAN by 9×, paving the way for interactive image synthesis.

Download full-text PDF	Source
http://dx.doi.org/10.1109/TPAMI.2021.3126742	DOI Listing

Publication Analysis

Top Keywords

efficient architectures

image synthesis

gan training

gan compression

compression efficient

architectures

architectures interactive

interactive conditional

conditional gans

gans conditional

Similar Publications

ClinClip: a Multimodal Language Pre-training model integrating EEG data for enhanced English medical listening assessment.

Front Neurosci

January 2025

The Basic Department, The Tourism College of Changchun University, Changchun, China.

Guangyu Sun

Introduction: In the field of medical listening assessments,accurate transcription and effective cognitive load management are critical for enhancing healthcare delivery. Traditional speech recognition systems, while successful in general applications often struggle in medical contexts where the cognitive state of the listener plays a significant role. These conventional methods typically rely on audio-only inputs and lack the ability to account for the listener's cognitive load, leading to reduced accuracy and effectiveness in complex medical environments.

View Article and Find Full Text PDF

Similar Publications

Unconventional Photocapacitor Utilizing Metal-Organic Dye Capable of Operating in Low Intensity Light.

ACS Appl Mater Interfaces

January 2025

Department of Chemistry, Sardar Patel University, Vallabh Vidyanagar 388120, Gujarat, India.

Karan Surana Darshna B Kanani Sanjay N Bariya Yash G Kapdi Ashita Sharma

The development of devices capable of storing energy harnessed from photons is on the rise, owing to the increasing global energy demand for smart systems. The majority of reports in this field cover the use of integrated type devices, which houses a separate photovoltaic module and supercapacitor or battery. Herein, we are reporting a photocapacitor with a simple two-electrode design, capable of operating without a conventional electrolyte or metal ions.

View Article and Find Full Text PDF

Similar Publications

Photodynamic and photothermal bacteria targeting nanosystems for synergistically combating bacteria and biofilms.

J Nanobiotechnology

January 2025

Shanghai Ninth People's Hospital, College of Stomatology, Shanghai Jiao Tong University School of Medicine, Shanghai Jiao Tong University, 639 Zhizaoju Road, Shanghai, 200011, China.

Xiao Wang Wenxuan Shi Yu Jin Zhuoyuan Li Tanjun Deng

The escalating hazards posed by bacterial infections underscore the imperative for pioneering advancements in next-generation antibacterial modalities and treatments. Present therapeutic methodologies are frequently impeded by the constraints of insufficient biofilm infiltration and the absence of precision in pathogen-specific targeting. In this current study, we have used chlorin e6 (Ce6), zeolitic imidazolate framework-8 (ZIF-8), polydopamine (PDA), and UBI peptide to formulate an innovative nanosystem meticulously engineered to confront bacterial infections and effectually dismantle biofilm architectures through the concerted mechanism of photodynamic therapy (PDT)/photothermal therapy (PTT) therapies, including in-depth research, especially for oral bacteria and oral biofilm.

View Article and Find Full Text PDF

Similar Publications

Unlocking the Power of 3D Convolutional Neural Networks for COVID-19 Detection: A Comprehensive Review.

J Imaging Inform Med

January 2025

College of Science and Engineering, Hamad Bin Khalifa University, Ar-Rayyan, Qatar.

Ademola E Ilesanmi Taiwo Ilesanmi Babatunde Ajayi Gbenga A Gbotoso Samir Brahim Belhaouari

The advent of three-dimensional convolutional neural networks (3D CNNs) has revolutionized the detection and analysis of COVID-19 cases. As imaging technologies have advanced, 3D CNNs have emerged as a powerful tool for segmenting and classifying COVID-19 in medical images. These networks have demonstrated both high accuracy and rapid detection capabilities, making them crucial for effective COVID-19 diagnostics.

View Article and Find Full Text PDF

Similar Publications

TOPSIS prefabricated building construction evaluation based on interval-valued Pythagorean fuzzy numbers based on prospect theory.

Sci Rep

January 2025

Institute of Architectural Engineering, Shanghai Zhongqiao Vocational and Technical University, Shanghai, 201514, China.

Lixin Chang Norhaiza Nordin Shiwei Zhao Xinhua Gu Ye Zhao

Prefabricated buildings have a series of advantages such as high efficiency, energy savings, and environmental protection, and are being strongly promoted by the Chinese government. However, due to the late start of prefabricated buildings in China, the installation process of prefabricated components is relatively complex, leading to difficulties in quality and safety control. A novel evaluation methodology integrating the technique for order preference by similarity to ideal solution (TOPSIS) with prospect theory and interval-valued Pythagorean fuzzy numbers (IVPFNs) is proposed.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!