Despite remarkable success in a variety of computer vision applications, it is well-known that deep learning can fail catastrophically when presented with out-of-distribution data, where there are usually style differences between the training and test images. Toward addressing this challenge, we consider the domain generalization problem, wherein predictors are trained using data drawn from a family of related training (source) domains and then evaluated on a distinct and unseen test domain. Naively training a model on the aggregate set of data (pooled from all source domains) has been shown to perform suboptimally, since the information learned by that model might be domain-specific and generalizes imperfectly to test domains. Data augmentation has been shown to be an effective approach to overcome this problem. However, its application has been limited to enforcing invariance to simple transformations like rotation, brightness change, etc. Such perturbations do not necessarily cover plausible real-world variations that preserve the semantics of the input (such as a change in the image style). In this paper, taking the advantage of multiple source domains, we propose a novel approach to express and formalize robustness to these kind of real-world image perturbations. The three key ideas underlying our formulation are (1) leveraging disentangled representations of the images to define different factors of variations, (2) generating perturbed images by changing such factors composing the representations of the images, (3) enforcing the learner (classifier) to be invariant to such changes in the images. We use image-to-image translation models to demonstrate the efficacy of this approach. Based on this, we propose a domain-invariant regularization (DIR) loss function that enforces invariant prediction of targets (class labels) across domains which yields improved generalization performance. We demonstrate the effectiveness of our approach on several widely used datasets for the domain generalization problem, on all of which our results are competitive with the state-of-the-art.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TIP.2023.3321511DOI Listing

Publication Analysis

Top Keywords

domain generalization
12
source domains
12
generalization problem
8
representations images
8
images
5
domains
5
latent feature
4
feature disentanglement
4
disentanglement visual
4
domain
4

Similar Publications

Association Between Cigarette Smoking and Subclinical Markers of Cardiovascular Harm.

J Am Coll Cardiol

March 2025

Ciccarone Center for Prevention of Cardiovascular Disease, Johns Hopkins Medicine, Baltimore, Maryland, USA; American Heart Association Tobacco Regulation and Addiction Center, Dallas, Texas, USA. Electronic address:

Background: Cigarette smoking is a strong risk factor for cardiovascular harm.

Objectives: The study sought to explore the detailed relationships between smoking intensity, pack-years, and time since cessation with inflammation, thrombosis, and subclinical atherosclerosis markers of cardiovascular harm.

Methods: We included 182,364 participants (mean age 58.

View Article and Find Full Text PDF

Background: The EMPA-REG OUTCOME trial confirmed empagliflozin reduced mortality and heart failure hospitalization risk. These findings raised the possibility that empagliflozin may modulate cardiac autonomic function in patients with type 2 diabetes (T2D).

Methods And Results: The EMPYREAN study was a prospective randomized open-label assessor-blinded multicenter investigation of patients with T2D without prior antidiabetic therapy with sodium-glucose cotransporter 2 or dipeptidyl peptidase 4 inhibitors.

View Article and Find Full Text PDF

Increasing collagen synthesis in fibroblasts: The roles of PCL microspheres and the SAMD11-PLOD1 axis in skin rejuvenation.

Biochim Biophys Acta Mol Cell Res

March 2025

The Second School of Clinical Medicine, Southern Medical University, Guangzhou, China; Department of Plastic and Reconstructive Surgery, Guangdong Second Provincial General Hospital, Guangzhou, China. Electronic address:

The degradation of extracellular matrix proteins such as collagen and elastin with aging leads to skin sagging. Polycaprolactone (PCL) microspheres are used as facial fillers because of their ability to provide volume, biodegradability, and collagen-stimulating properties. The direct biological effects of PCL microspheres on fibroblasts, particularly in stimulating sustained collagen production, require further investigation.

View Article and Find Full Text PDF

Ethnopharmacological Relevance: Acute ischemic stroke (AIS) is an important cause of death and disability in the world. Based on the blood stasis syndrome of stroke, Shuxuetong Injection (SXT) is a representative prescription for the treatment of AIS, which extracted by modern technology from Whitmania pigra Whitman (Shuizhi) and Pheretima aspergillum E.Perrier (Dilong).

View Article and Find Full Text PDF

UniBrain: Universal Brain MRI diagnosis with hierarchical knowledge-enhanced pre-training.

Comput Med Imaging Graph

March 2025

School of Artificial Intelligence, Shanghai Jiao Tong University, Shanghai, 200230, China; Shanghai Artificial Intelligence Laboratory, Shanghai, 200232, China. Electronic address:

Magnetic Resonance Imaging (MRI) has become a pivotal tool in diagnosing brain diseases, with a wide array of computer-aided artificial intelligence methods being proposed to enhance diagnostic accuracy. However, early studies were often limited by small-scale datasets and a narrow range of disease types, which posed challenges in model generalization. This study presents UniBrain, a hierarchical knowledge-enhanced pre-training framework designed for universal brain MRI diagnosis.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!