Unsupervised landmark learning is the task of learning semantic keypoint-like representations without the use of expensive input keypoint annotations. A popular approach is to factorize an image into a pose and appearance data stream, then to reconstruct the image from the factorized components. The pose representation should capture a set of consistent and tightly localized landmarks in order to facilitate reconstruction of the input image. Ultimately, we wish for our learned landmarks to focus on the foreground object of interest. However, the reconstruction task of the entire image forces the model to allocate landmarks to model the background. Using a motion-based foreground assumption, this work explores the effects of factorizing the reconstruction task into separate foreground and background reconstructions in an unsupervised way, allowing the model to condition only the foreground reconstruction on the unsupervised landmarks. Our experiments demonstrate that the proposed factorization results in landmarks that are focused on the foreground object of interest when measured against ground-truth foreground masks. Furthermore, the rendered background quality is also improved as ill-suited landmarks are no longer forced to model this content. We demonstrate this improvement via improved image fidelity in a video-prediction task. Code is available at https://github.com/NVIDIA/UnsupervisedLandmarkLearning.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2021.3055560DOI Listing

Publication Analysis

Top Keywords

pose appearance
8
foreground object
8
object interest
8
reconstruction task
8
landmarks
6
foreground
6
image
5
unsupervised
4
unsupervised disentanglement
4
disentanglement pose
4

Similar Publications

This study determined the concentrations and seasonal variations of phthalate esters (PAEs) in water and sediment samples of the receiving stream within the vicinity of the Obafemi Awolowo University, Ile-Ife dumpsite. The objective of this study was to evaluate the pollution status of the study area by determining the levels of PAEs in water and sediment samples. This assessment aimed to understand the presence and extent of phthalate ester pollution in the study area.

View Article and Find Full Text PDF

Road traffic accidents pose a significant global health concern, with an alarming 1.19 million fatalities reported in 2021. Traditionally, strategies to address this challenge have relied on expert input and subjective evaluations.

View Article and Find Full Text PDF

Healthcare-associated infections (HAI), particularly those involving multi-drug resistant organisms (MDRO), pose a significant public health threat. Understanding the transmission of these pathogens in short-term acute care hospitals (STACH) is crucial for effective control. Mathematical and computational models play a key role in studying transmission but often overlook the influence of long-term care facilities (LTCFs) and the broader community on transmission.

View Article and Find Full Text PDF

A novel adaptive lightweight multimodal efficient feature inference network ALME-FIN for EEG emotion recognition.

Cogn Neurodyn

December 2025

School of Mechatronical Engineering, Beijing Institute of Technology, No. 5 Zhongguancun South Street, Haidian District, Beijing, 100081 China.

Enhancing the accuracy of emotion recognition models through multimodal learning is a common approach. However, challenges such as insufficient modal feature learning in multimodal inference and scarcity of sample data continue to pose obstacles that need to be overcome. Therefore, we propose a novel adaptive lightweight multimodal efficient feature inference network (ALME-FIN).

View Article and Find Full Text PDF

Gastric cancer (GC) ranks among the top five most diagnosed cancers globally, with particularly high incidence and mortality rates observed in Asian regions. Despite certain advancements achieved through early screening and treatment strategies in many countries, GC continues to pose a significant public health challenge. Approximately 20% of patients infected with develop precancerous lesions, among which metaplasia is the most critical.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!