Heatmap regression (HR) has become one of the mainstream approaches for face alignment and has obtained promising results under constrained environments. However, when a face image suffers from large pose variations, heavy occlusions and complicated illuminations, the performances of HR methods degrade greatly due to the low resolutions of the generated landmark heatmaps and the exclusion of important high-order information that can be used to learn more discriminative features. To address the alignment problem for faces with extremely large poses and heavy occlusions, this paper proposes a heatmap subpixel regression (HSR) method and a multi-order cross geometry-aware (MCG) model, which are seamlessly integrated into a novel multi-order high-precision hourglass network (MHHN). The HSR method is proposed to achieve high-precision landmark detection by a well-designed subpixel detection loss (SDL) and subpixel detection technology (SDT). At the same time, the MCG model is able to use the proposed multi-order cross information to learn more discriminative representations for enhancing facial geometric constraints and context information. To the best of our knowledge, this is the first study to explore heatmap subpixel regression for robust and high-precision face alignment. The experimental results from challenging benchmark datasets demonstrate that our approach outperforms state-of-the-art methods in the literature.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TIP.2020.3032029DOI Listing

Publication Analysis

Top Keywords

face alignment
12
multi-order high-precision
8
high-precision hourglass
8
hourglass network
8
heavy occlusions
8
learn discriminative
8
heatmap subpixel
8
subpixel regression
8
hsr method
8
multi-order cross
8

Similar Publications

In current alloplastic total temporomandibular joint replacements (TMJRs) typically the lateral pterygoid muscle (LPM) insertion is sacrificed, affecting joint function. This study assesses a novel additively manufactured TMJR (CADskills BV, Gent, Belgium) designed to enable LPM reinsertion through a scaffold feature on the implant. Thirteen TMJRs were implanted in Swifter crossbreed sheep, with follow-up CT scans after 288 days to evaluate LPM reintegration.

View Article and Find Full Text PDF

The human visual system possesses a remarkable ability to detect and process faces across diverse contexts, including the phenomenon of face pareidolia--seeing faces in inanimate objects. Despite extensive research, it remains unclear why the visual system employs such broadly tuned face detection capabilities. We hypothesized that face pareidolia results from the visual system's optimization for recognizing both faces and objects.

View Article and Find Full Text PDF

Today, most research evaluation frameworks are designed to assess mature projects with well-defined data and clearly articulated outcomes. Yet, few, if any, are equipped to evaluate the promise of early-stage research, which is inherently characterized by limited evidence, high uncertainty, and evolving objectives. These early-stage projects require nuanced assessments that can adapt to incomplete information, project maturity, and shifting research questions.

View Article and Find Full Text PDF

Global health prioritizes improving health and achieving equity in health for all people worldwide. It encompasses a wide range of efforts, including disease prevention and treatment, health promotion, healthcare delivery, and addressing health disparities across borders. Short-term medical and surgical missions often contribute to the global health landscape, especially in low and lower-middle income countries.

View Article and Find Full Text PDF

Detection of circular permutations by Protein Language Models.

Comput Struct Biotechnol J

December 2024

School of Bioengineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan, Shandong 250300, China.

Protein circular permutations are crucial for understanding protein evolution and functionality. Traditional detection methods face challenges: sequence-based approaches struggle with detecting distant homologs, while structure-based approaches are limited by the need for structure generation and often treat proteins as rigid bodies. Protein Language Model-based alignment tools have shown advantages in utilizing sequence information to overcome the challenges of detecting distant homologs without requiring structural input.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!