Toward Unbiased High-Quality Portraits through Latent-Space Evaluation.

J Imaging

Department of Control and Computer Engineering, Politecnico di Torino, 10129 Torino, Italy.

Published: June 2024

Images, texts, voices, and signals can be synthesized by latent spaces in a multidimensional vector, which can be explored without the hurdles of noise or other interfering factors. In this paper, we present a practical use case that demonstrates the power of latent space in exploring complex realities such as image space. We focus on DaVinciFace, an AI-based system that explores the StyleGAN2 space to create a high-quality portrait for anyone in the style of the Renaissance genius Leonardo da Vinci. The user enters one of their portraits and receives the corresponding Da Vinci-style portrait as an output. Since most of Da Vinci's artworks depict young and beautiful women (e.g., "La Belle Ferroniere", "Beatrice de' Benci"), we investigate the ability of DaVinciFace to account for other social categorizations, including gender, race, and age. The experimental results evaluate the effectiveness of our methodology on 1158 portraits acting on the vector representations of the latent space to produce high-quality portraits that retain the facial features of the subject's social categories, and conclude that sparser vectors have a greater effect on these features. To objectively evaluate and quantify our results, we solicited human feedback via a crowd-sourcing campaign. Analysis of the human feedback showed a high tolerance for the loss of important identity features in the resulting portraits when the Da Vinci style is more pronounced, with some exceptions, including Africanized individuals.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11278512PMC
http://dx.doi.org/10.3390/jimaging10070157DOI Listing

Publication Analysis

Top Keywords

high-quality portraits
8
latent space
8
human feedback
8
portraits
5
unbiased high-quality
4
portraits latent-space
4
latent-space evaluation
4
evaluation images
4
images texts
4
texts voices
4

Similar Publications

Toward Unbiased High-Quality Portraits through Latent-Space Evaluation.

J Imaging

June 2024

Department of Control and Computer Engineering, Politecnico di Torino, 10129 Torino, Italy.

Images, texts, voices, and signals can be synthesized by latent spaces in a multidimensional vector, which can be explored without the hurdles of noise or other interfering factors. In this paper, we present a practical use case that demonstrates the power of latent space in exploring complex realities such as image space. We focus on DaVinciFace, an AI-based system that explores the StyleGAN2 space to create a high-quality portrait for anyone in the style of the Renaissance genius Leonardo da Vinci.

View Article and Find Full Text PDF

This paper attempts to provide an overview of the history of Japanese psychopathology by presenting concise portraits of the second generation of Japanese psychopathologists, whose era is considered to be the heyday of Japanese psychopathology. Meanwhile, we also consider the historical background of the psychiatric reform movement in Japan that influenced many second-generation psychopathologists. First, the paper briefly discusses the emergence of the first-generation of psychopathologists through the adoption of German-centered psychiatry after the Meiji era.

View Article and Find Full Text PDF

Deepfake smiles matter less-the psychological and neural impact of presumed AI-generated faces.

Sci Rep

September 2023

Department of Psychology, Faculty of Life Sciences, Humboldt-Universität zu Berlin, Unter den Linden 6, 10099, Berlin, Germany.

High-quality AI-generated portraits ("deepfakes") are becoming increasingly prevalent. Understanding the responses they evoke in perceivers is crucial in assessing their societal implications. Here we investigate the impact of the belief that depicted persons are real or deepfakes on psychological and neural measures of human face perception.

View Article and Find Full Text PDF

Using portraits to quantify the changes of generalized social trust in European history: A replication study.

PLoS One

September 2023

Département d'études Cognitives, Institut Jean Nicod, ENS, EHESS, CNRS, PSL Research University, Paris, France.

A portrait is an exercise of impression management: the sitter can choose the impression she or he wants to create in the eyes of others': competence, trustworthiness, dominance, etc. Indirectly, this choice informs us about the qualities that were specifically valued at the time the portrait was created. In a previous paper, we have shown that cues of perceived trustworthiness in portraits increased in time during the modern period in Europe, meaning that people probably granted more importance to be seen as a trustworthy person.

View Article and Find Full Text PDF

With the development of three-dimensional (3D) light-field display technology, 3D scenes with correct location information and depth information can be perceived without wearing any external device. Only 2D stylized portrait images can be generated with traditional portrait stylization methods and it is difficult to produce high-quality stylized portrait content for 3D light-field displays. 3D light-field displays require the generation of content with accurate depth and spatial information, which is not achievable with 2D images alone.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!