Given an input face photo, the goal of caricature generation is to produce stylized, exaggerated caricatures that share the same identity as the photo. It requires simultaneous style transfer and shape exaggeration with rich diversity, and meanwhile preserving the identity of the input. To address this challenging problem, we propose a novel framework called Multi-Warping GAN (MW-GAN), including a style network and a geometric network that are designed to conduct style transfer and geometric exaggeration respectively. We bridge the gap between the style/landmark space and their corresponding latent code spaces by a dual way design, so as to generate caricatures with arbitrary styles and geometric exaggeration, which can be specified either through random sampling of latent code or from a given caricature sample. Besides, we apply identity preserving loss to both image space and landmark space, leading to a great improvement in quality of generated caricatures. Experiments show that caricatures generated by MW-GAN have better quality than existing methods.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TIP.2021.3118984DOI Listing

Publication Analysis

Top Keywords

geometric exaggeration
12
multi-warping gan
8
caricature generation
8
style transfer
8
latent code
8
mw-gan multi-warping
4
gan caricature
4
generation multi-style
4
geometric
4
multi-style geometric
4

Similar Publications

As a non-contact method, vision-based measurement for vibration extraction and modal parameter identification has attracted much attention. In most cases, artificial textures are crucial elements for visual tracking, and this feature limits the application of vision-based vibration measurement on textureless targets. As a computation technique for visualizing subtle variations in videos, the video magnification technique can analyze modal responses and visualize modal shapes, but the efficiency is low, and the processing results contain clipping artifacts.

View Article and Find Full Text PDF
Article Synopsis
  • Genetic diseases linked to craniofacial malformations often result in issues with the craniocervical joint (CCJ), affecting mobility and potentially causing severe complications like headaches or spinal cord damage.
  • The study aimed to uncover the relationship between the anatomy and function of the CCJ, using data from nine cadaveric adults to analyze how bone shape influences movement.
  • Findings indicated that bone geometry significantly affects both primary and secondary movements of the CCJ, which may inform surgical approaches for correcting anomalies in children.
View Article and Find Full Text PDF

Caricaturing Shapes in Visual Memory.

Psychol Sci

July 2024

Department of Psychological and Brain Sciences, Johns Hopkins University.

When representing high-level stimuli, such as faces and animals, we tend to emphasize salient features-such as a face's prominent cheekbones or a bird's pointed beak. Such leaves traces in memory, which exaggerates these distinctive qualities. How broadly does this phenomenon extend? Here, in six experiments ( = 700 adults), we explored how memory automatically caricatures basic units of visual processing-simple geometric shapes-even without task-related demands to do so.

View Article and Find Full Text PDF

Morphological studies typically avoid using osteological samples that derive from captive animals because it is assumed that their morphology is not representative of wild populations. Rearing environments indeed differ between wild and captive individuals. For example, mechanical properties of the diets provided to captive animals can be drastically different from the food present in their natural habitats, which could impact cranial morphology and dental health.

View Article and Find Full Text PDF

A practical guide to calculating vocal tract length and scale-invariant formant patterns.

Behav Res Methods

September 2024

ENES Bioacoustics Research Laboratory, CRNL Center for Research in Neuroscience in Lyon, University of Saint Étienne, 42023, St-Étienne, France.

Formants (vocal tract resonances) are increasingly analyzed not only by phoneticians in speech but also by behavioral scientists studying diverse phenomena such as acoustic size exaggeration and articulatory abilities of non-human animals. This often involves estimating vocal tract length acoustically and producing scale-invariant representations of formant patterns. We present a theoretical framework and practical tools for carrying out this work, including open-source software solutions included in R packages soundgen and phonTools.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!