We consider the problem of predicting a response Y from a set of covariates X when test- and training distributions differ. Since such differences may have causal explanations, we consider test distributions that emerge from interventions in a structural causal model, and focus on minimizing the worst-case risk. Causal regression models, which regress the response on its direct causes, remain unchanged under arbitrary interventions on the covariates, but they are not always optimal in the above sense. For example, for linear models and bounded interventions, alternative solutions have been shown to be minimax prediction optimal. We introduce the formal framework of distribution generalization that allows us to analyze the above problem in partially observed nonlinear models for both direct interventions on X and interventions that occur indirectly via exogenous variables A. It takes into account that, in practice, minimax solutions need to be identified from data. Our framework allows us to characterize under which class of interventions the causal function is minimax optimal. We prove sufficient conditions for distribution generalization and present corresponding impossibility results. We propose a practical method, NILE, that achieves distribution generalization in a nonlinear IV setting with linear extrapolation. We prove consistency and present empirical results.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2021.3094760DOI Listing

Publication Analysis

Top Keywords

distribution generalization
16
framework distribution
8
interventions
6
causal
5
causal framework
4
distribution
4
generalization
4
generalization consider
4
consider problem
4
problem predicting
4

Similar Publications

Distribution of opioid analgesics by community racial/ethnic and socioeconomic profiles, 2011-2021.

Pain

January 2025

Department of Population Health Sciences, Weill Cornell Medicine, New York, NY, United States.

Rapid declines in opioid analgesics dispensed in American communities since 2011 raise concerns about inadequate access to effective pain management among patients for whom opioid therapies are appropriate, especially for those living in racial/ethnic minority and socioeconomically deprived communities. Using 2011 to 2021 national data from the Automated Reports and Consolidated Ordering System and generalized linear models, this study examined quarterly per capita distribution of oxycodone, hydrocodone, and morphine (in oral morphine milligram equivalents [MMEs]) by communities' racial/ethnic and socioeconomic profiles. Communities (defined by 3-digit-zip codes areas) were classified as "majority White" (≥50% self-reported non-Hispanic White population) vs "majority non-White.

View Article and Find Full Text PDF

Polymer material innovations for a green hydrogen economy.

Chem Commun (Camb)

January 2025

Institute of Sustainability for Chemicals, Energy and Environment (ISCE2), Agency for Science, Technology and Research (A*STAR), 1 Pesek Road, Singapore 627833, Republic of Singapore.

Polymeric materials are ubiquitous in modern life. Similar to many other technological applications, polymer materials are essential in advancing the green hydrogen economy, offering solutions for hydrogen production, storage, transport, and utilization. In production, polymeric proton exchange membranes in water electrolysers enable efficient green hydrogen generation using renewable energy.

View Article and Find Full Text PDF

Using genetic data to infer evolutionary distances between molecular sequence pairs based on a Markov substitution model is a common procedure in phylogenetics, in particular for selecting a good starting tree to improve upon. Many evolutionary patterns can be accurately modelled using substitution models that are available in closed form, including the popular general time reversible model (GTR) for DNA data. For more complex biological phenomena, such as variations in lineage-specific evolutionary rates over time (heterotachy), other approaches such as the GTR with rate variation (GTR ) are required, but do not admit analytical solutions and do not automatically allow for likelihood calculations crucial for Bayesian analysis.

View Article and Find Full Text PDF

Background: Given the increasing prevalence of antiplatelet agent use and the lack of high-quality evidence, the CAPTAIN trial aimed to investigate the safety and provide recommendations on continuing acetylsalicylic acid perioperatively in patients undergoing elective laparoscopic totally extraperitoneal inguinal hernia repair (LIHR).

Methods: The CAPTAIN trial was a multicentre, surgeon blind, randomized controlled trial conducted from April 2016 to April 2023. Patients undergoing LIHR were eligible for inclusion.

View Article and Find Full Text PDF

The types and quantities of microorganisms in activated sludge are directly related to the stability and efficiency of sewage treatment systems. This paper proposes a sludge microorganism detection method based on microscopic phase contrast image optimisation and deep learning. Firstly, a dataset containing eight types of microorganisms is constructed, and an augmentation strategy based on single and multisamples processing is designed to address the issues of sample deficiency and uneven distribution.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!