Given an input face photo, the goal of caricature generation is to produce stylized, exaggerated caricatures that share the same identity as the photo. It requires simultaneous style transfer and shape exaggeration with rich diversity, and meanwhile preserving the identity of the input. To address this challenging problem, we propose a novel framework called Multi-Warping GAN (MW-GAN), including a style network and a geometric network that are designed to conduct style transfer and geometric exaggeration respectively. We bridge the gap between the style/landmark space and their corresponding latent code spaces by a dual way design, so as to generate caricatures with arbitrary styles and geometric exaggeration, which can be specified either through random sampling of latent code or from a given caricature sample. Besides, we apply identity preserving loss to both image space and landmark space, leading to a great improvement in quality of generated caricatures. Experiments show that caricatures generated by MW-GAN have better quality than existing methods.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TIP.2021.3118984 | DOI Listing |
Sensors (Basel)
July 2024
College of Engineering, Anhui Agricultural University, Hefei 230036, China.
As a non-contact method, vision-based measurement for vibration extraction and modal parameter identification has attracted much attention. In most cases, artificial textures are crucial elements for visual tracking, and this feature limits the application of vision-based vibration measurement on textureless targets. As a computation technique for visualizing subtle variations in videos, the video magnification technique can analyze modal responses and visualize modal shapes, but the efficiency is low, and the processing results contain clipping artifacts.
View Article and Find Full Text PDFJ Morphol
July 2024
Arts et Métiers - Institute of Technology, Institut de Biomécanique Humaine Georges Charpak, Paris, France.
Psychol Sci
July 2024
Department of Psychological and Brain Sciences, Johns Hopkins University.
When representing high-level stimuli, such as faces and animals, we tend to emphasize salient features-such as a face's prominent cheekbones or a bird's pointed beak. Such leaves traces in memory, which exaggerates these distinctive qualities. How broadly does this phenomenon extend? Here, in six experiments ( = 700 adults), we explored how memory automatically caricatures basic units of visual processing-simple geometric shapes-even without task-related demands to do so.
View Article and Find Full Text PDFJ Anat
June 2024
Centre for Human Evolution Research, Natural History Museum, London, UK.
Morphological studies typically avoid using osteological samples that derive from captive animals because it is assumed that their morphology is not representative of wild populations. Rearing environments indeed differ between wild and captive individuals. For example, mechanical properties of the diets provided to captive animals can be drastically different from the food present in their natural habitats, which could impact cranial morphology and dental health.
View Article and Find Full Text PDFBehav Res Methods
September 2024
ENES Bioacoustics Research Laboratory, CRNL Center for Research in Neuroscience in Lyon, University of Saint Étienne, 42023, St-Étienne, France.
Formants (vocal tract resonances) are increasingly analyzed not only by phoneticians in speech but also by behavioral scientists studying diverse phenomena such as acoustic size exaggeration and articulatory abilities of non-human animals. This often involves estimating vocal tract length acoustically and producing scale-invariant representations of formant patterns. We present a theoretical framework and practical tools for carrying out this work, including open-source software solutions included in R packages soundgen and phonTools.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!