Conditional Generative Adversarial Networks (cGANs) have enabled controllable image synthesis for many vision and graphics applications. However, recent cGANs are 1-2 orders of magnitude more compute-intensive than modern recognition CNNs. For example, GauGAN consumes 281G MACs per image, compared to 0.44G MACs for MobileNet-v3, making it difficult for interactive deployment. In this work, we propose a general-purpose compression framework for reducing the inference time and model size of the generator in cGANs. Directly applying existing compression methods yields poor performance due to the difficulty of GAN training and the differences in generator architectures. We address these challenges in two ways. First, to stabilize GAN training, we transfer knowledge of multiple intermediate representations of the original model to its compressed model and unify unpaired and paired learning. Second, instead of reusing existing CNN designs, our method finds efficient architectures via neural architecture search. To accelerate the search process, we decouple the model training and search via weight sharing. Experiments demonstrate the effectiveness of our method across different supervision settings, network architectures, and learning methods. Without losing image quality, we reduce the computation of CycleGAN by 21×, Pix2pix by 12×, MUNIT by 29×, and GauGAN by 9×, paving the way for interactive image synthesis.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2021.3126742DOI Listing

Publication Analysis

Top Keywords

efficient architectures
8
image synthesis
8
gan training
8
gan compression
4
compression efficient
4
architectures
4
architectures interactive
4
interactive conditional
4
conditional gans
4
gans conditional
4

Similar Publications

Introduction: In the field of medical listening assessments,accurate transcription and effective cognitive load management are critical for enhancing healthcare delivery. Traditional speech recognition systems, while successful in general applications often struggle in medical contexts where the cognitive state of the listener plays a significant role. These conventional methods typically rely on audio-only inputs and lack the ability to account for the listener's cognitive load, leading to reduced accuracy and effectiveness in complex medical environments.

View Article and Find Full Text PDF

The development of devices capable of storing energy harnessed from photons is on the rise, owing to the increasing global energy demand for smart systems. The majority of reports in this field cover the use of integrated type devices, which houses a separate photovoltaic module and supercapacitor or battery. Herein, we are reporting a photocapacitor with a simple two-electrode design, capable of operating without a conventional electrolyte or metal ions.

View Article and Find Full Text PDF

Photodynamic and photothermal bacteria targeting nanosystems for synergistically combating bacteria and biofilms.

J Nanobiotechnology

January 2025

Shanghai Ninth People's Hospital, College of Stomatology, Shanghai Jiao Tong University School of Medicine, Shanghai Jiao Tong University, 639 Zhizaoju Road, Shanghai, 200011, China.

The escalating hazards posed by bacterial infections underscore the imperative for pioneering advancements in next-generation antibacterial modalities and treatments. Present therapeutic methodologies are frequently impeded by the constraints of insufficient biofilm infiltration and the absence of precision in pathogen-specific targeting. In this current study, we have used chlorin e6 (Ce6), zeolitic imidazolate framework-8 (ZIF-8), polydopamine (PDA), and UBI peptide to formulate an innovative nanosystem meticulously engineered to confront bacterial infections and effectually dismantle biofilm architectures through the concerted mechanism of photodynamic therapy (PDT)/photothermal therapy (PTT) therapies, including in-depth research, especially for oral bacteria and oral biofilm.

View Article and Find Full Text PDF

The advent of three-dimensional convolutional neural networks (3D CNNs) has revolutionized the detection and analysis of COVID-19 cases. As imaging technologies have advanced, 3D CNNs have emerged as a powerful tool for segmenting and classifying COVID-19 in medical images. These networks have demonstrated both high accuracy and rapid detection capabilities, making them crucial for effective COVID-19 diagnostics.

View Article and Find Full Text PDF

Prefabricated buildings have a series of advantages such as high efficiency, energy savings, and environmental protection, and are being strongly promoted by the Chinese government. However, due to the late start of prefabricated buildings in China, the installation process of prefabricated components is relatively complex, leading to difficulties in quality and safety control. A novel evaluation methodology integrating the technique for order preference by similarity to ideal solution (TOPSIS) with prospect theory and interval-valued Pythagorean fuzzy numbers (IVPFNs) is proposed.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!