Machine learning in causal inference for epidemiology.

Eur J Epidemiol

Cancer Epidemiology Unit, Department of Medical Sciences, University of Turin and CPO Piedmont, Via Santena 7, Turin, 10126, Italy.

Published: October 2024

In causal inference, parametric models are usually employed to address causal questions estimating the effect of interest. However, parametric models rely on the correct model specification assumption that, if not met, leads to biased effect estimates. Correct model specification is challenging, especially in high-dimensional settings. Incorporating Machine Learning (ML) into causal analyses may reduce the bias arising from model misspecification, since ML methods do not require the specification of a functional form of the relationship between variables. However, when ML predictions are directly plugged in a predefined formula of the effect of interest, there is the risk of introducing a "plug-in bias" in the effect measure. To overcome this problem and to achieve useful asymptotic properties, new estimators that combine the predictive potential of ML and the ability of traditional statistical methods to make inference about population parameters have been proposed. For epidemiologists interested in taking advantage of ML for causal inference investigations, we provide an overview of three estimators that represent the current state-of-art, namely Targeted Maximum Likelihood Estimation (TMLE), Augmented Inverse Probability Weighting (AIPW) and Double/Debiased Machine Learning (DML).

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11599438PMC
http://dx.doi.org/10.1007/s10654-024-01173-xDOI Listing

Publication Analysis

Top Keywords

machine learning
12
causal inference
12
learning causal
8
parametric models
8
correct model
8
model specification
8
causal
5
inference
4
inference epidemiology
4
epidemiology causal
4

Similar Publications

Background: Epidemiological research on the association between heavy metals and congestive heart failure (CHF) in individuals with abnormal glucose metabolism is scarce. The study addresses this research gap by examining the link between exposure to heavy metals and the odds of CHF in a population with dysregulated glucose metabolism.

Method: This cross-sectional study includes 7326 patients with diabetes and prediabetes from the National Health and Nutrition Examination Survey from 2011 to 2018.

View Article and Find Full Text PDF

Background: Pancreatic cancer is characterized by a complex tumor microenvironment that hinders effective immunotherapy. Identifying key factors that regulate the immunosuppressive landscape is crucial for improving treatment strategies.

Methods: We constructed a prognostic and risk assessment model for pancreatic cancer using 101 machine learning algorithms, identifying OSBPL3 as a key gene associated with disease progression and prognosis.

View Article and Find Full Text PDF

Background: Urinary tract infection (UTI) is a frequent health-threatening condition. Early reliable diagnosis of UTI helps to prevent misuse or overuse of antibiotics and hence prevent antibiotic resistance. The gold standard for UTI diagnosis is urine culture which is a time-consuming and also an error prone method.

View Article and Find Full Text PDF

A machine learning model accurately identifies glycogen storage disease Ia patients based on plasma acylcarnitine profiles.

Orphanet J Rare Dis

January 2025

Laboratory of Metabolic Diseases, Department of Laboratory Medicine, University Medical Center Groningen, University of Groningen, Hanzeplein 1, Postbus, Groningen, 30001 - 9700 RB, the Netherlands.

Background: Glycogen storage disease (GSD) Ia is an ultra-rare inherited disorder of carbohydrate metabolism. Patients often present in the first months of life with fasting hypoketotic hypoglycemia and hepatomegaly. The diagnosis of GSD Ia relies on a combination of different biomarkers, mostly routine clinical chemical markers and subsequent genetic confirmation.

View Article and Find Full Text PDF

Background: Steroid-induced osteonecrosis of the femoral head (SIONFH) is a universal hip articular disease and is very hard to perceive at an early stage. The understanding of the pathogenesis of SIONFH is still limited, and the identification of efficient diagnostic biomarkers is insufficient. This research aims to recognize and validate the latent exosome-related molecular signature in SIONFH diagnosis by employing bioinformatics to investigate exosome-related mechanisms in SIONFH.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!