Conditional Generative Adversarial Networks (cGANs) have enabled controllable image synthesis for many vision and graphics applications. However, recent cGANs are 1-2 orders of magnitude more compute-intensive than modern recognition CNNs. For example, GauGAN consumes 281G MACs per image, compared to 0.44G MACs for MobileNet-v3, making it difficult for interactive deployment. In this work, we propose a general-purpose compression framework for reducing the inference time and model size of the generator in cGANs. Directly applying existing compression methods yields poor performance due to the difficulty of GAN training and the differences in generator architectures. We address these challenges in two ways. First, to stabilize GAN training, we transfer knowledge of multiple intermediate representations of the original model to its compressed model and unify unpaired and paired learning. Second, instead of reusing existing CNN designs, our method finds efficient architectures via neural architecture search. To accelerate the search process, we decouple the model training and search via weight sharing. Experiments demonstrate the effectiveness of our method across different supervision settings, network architectures, and learning methods. Without losing image quality, we reduce the computation of CycleGAN by 21×, Pix2pix by 12×, MUNIT by 29×, and GauGAN by 9×, paving the way for interactive image synthesis.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TPAMI.2021.3126742 | DOI Listing |
Front Neurosci
January 2025
The Basic Department, The Tourism College of Changchun University, Changchun, China.
Introduction: In the field of medical listening assessments,accurate transcription and effective cognitive load management are critical for enhancing healthcare delivery. Traditional speech recognition systems, while successful in general applications often struggle in medical contexts where the cognitive state of the listener plays a significant role. These conventional methods typically rely on audio-only inputs and lack the ability to account for the listener's cognitive load, leading to reduced accuracy and effectiveness in complex medical environments.
View Article and Find Full Text PDFACS Appl Mater Interfaces
January 2025
Department of Chemistry, Sardar Patel University, Vallabh Vidyanagar 388120, Gujarat, India.
The development of devices capable of storing energy harnessed from photons is on the rise, owing to the increasing global energy demand for smart systems. The majority of reports in this field cover the use of integrated type devices, which houses a separate photovoltaic module and supercapacitor or battery. Herein, we are reporting a photocapacitor with a simple two-electrode design, capable of operating without a conventional electrolyte or metal ions.
View Article and Find Full Text PDFJ Nanobiotechnology
January 2025
Shanghai Ninth People's Hospital, College of Stomatology, Shanghai Jiao Tong University School of Medicine, Shanghai Jiao Tong University, 639 Zhizaoju Road, Shanghai, 200011, China.
The escalating hazards posed by bacterial infections underscore the imperative for pioneering advancements in next-generation antibacterial modalities and treatments. Present therapeutic methodologies are frequently impeded by the constraints of insufficient biofilm infiltration and the absence of precision in pathogen-specific targeting. In this current study, we have used chlorin e6 (Ce6), zeolitic imidazolate framework-8 (ZIF-8), polydopamine (PDA), and UBI peptide to formulate an innovative nanosystem meticulously engineered to confront bacterial infections and effectually dismantle biofilm architectures through the concerted mechanism of photodynamic therapy (PDT)/photothermal therapy (PTT) therapies, including in-depth research, especially for oral bacteria and oral biofilm.
View Article and Find Full Text PDFJ Imaging Inform Med
January 2025
College of Science and Engineering, Hamad Bin Khalifa University, Ar-Rayyan, Qatar.
The advent of three-dimensional convolutional neural networks (3D CNNs) has revolutionized the detection and analysis of COVID-19 cases. As imaging technologies have advanced, 3D CNNs have emerged as a powerful tool for segmenting and classifying COVID-19 in medical images. These networks have demonstrated both high accuracy and rapid detection capabilities, making them crucial for effective COVID-19 diagnostics.
View Article and Find Full Text PDFSci Rep
January 2025
Institute of Architectural Engineering, Shanghai Zhongqiao Vocational and Technical University, Shanghai, 201514, China.
Prefabricated buildings have a series of advantages such as high efficiency, energy savings, and environmental protection, and are being strongly promoted by the Chinese government. However, due to the late start of prefabricated buildings in China, the installation process of prefabricated components is relatively complex, leading to difficulties in quality and safety control. A novel evaluation methodology integrating the technique for order preference by similarity to ideal solution (TOPSIS) with prospect theory and interval-valued Pythagorean fuzzy numbers (IVPFNs) is proposed.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!