Four covariate selection approaches were compared: a directed acyclic graph (DAG) full model and 3 DAG and change-in-estimate combined procedures. Twenty-five scenarios with case-control samples were generated from 10 simulated populations in order to address the performance of these covariate selection procedures in the presence of confounders of various strengths and under DAG misspecification with omission of confounders or inclusion of nonconfounders. Performance was evaluated by standard error, bias, square root of the mean-squared error, and 95% confidence interval coverage. In most scenarios, the DAG full model without further covariate selection performed as well as or better than the other procedures when the DAGs were correctly specified, as well as when confounders were omitted. Model reduction by using change-in-estimate procedures showed potential gains in precision when the DAGs included nonconfounders, but underestimation of regression-based standard error might cause reduction in 95% confidence interval coverage. For modeling binary outcomes in a case-control study, the authors recommend construction of a "conservative" DAG, determination of all potential confounders, and then change-in-estimate procedures to simplify this full model. The authors advocate that, under the conditions investigated, the selection of final model should be based on changes in precision: Adopt the reduced model if its standard error (derived from logistic regression) is substantially smaller; otherwise, the full DAG-based model is appropriate.

Download full-text PDF

Source
http://dx.doi.org/10.1093/aje/kwp035DOI Listing

Publication Analysis

Top Keywords

covariate selection
16
full model
12
standard error
12
directed acyclic
8
dag full
8
95% confidence
8
confidence interval
8
interval coverage
8
change-in-estimate procedures
8
model
7

Similar Publications

Background Selection between open thyroidectomy (OT) and minimally invasive (endoscopic/robotic) thyroidectomy (MT) for patients with thyroid cancer has been a subject of considerable debate. Comprehensive analysis of the short-term outcomes of endoscopic thyroidectomy (ET), robotic thyroidectomy (RT) and OT for thyroid cancer using a large-scale dataset is important. Methods This cohort study evaluated the outcomes of patients receiving ET, RT vs OT for thyroid cancer from January 1, 2003, to December 31, 2022.

View Article and Find Full Text PDF

High-dimensional multiple imputation (HDMI) for partially observed confounders including natural language processing-derived auxiliary covariates.

Am J Epidemiol

January 2025

Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.

Multiple imputation (MI) models can be improved with auxiliary covariates (AC), but their performance in high-dimensional data remains unclear. We aimed to develop and compare high-dimensional MI (HDMI) methods using structured and natural language processing (NLP)-derived AC in studies with partially observed confounders. We conducted a plasmode simulation with acute kidney injury as outcome and simulated 100 cohorts with a null treatment effect, incorporating creatinine labs, atrial fibrillation (AFib), and other investigator-derived confounders in the outcome generation.

View Article and Find Full Text PDF

Predictive models are indeed useful for causal inference.

Ecology

January 2025

Department of Natural Resources and the Environment, Cornell University, Ithaca, New York, USA.

The subject of investigating causation in ecology has been widely discussed in recent years, especially by advocates of a structural causal model (SCM) approach. Some of these advocates have criticized the use of predictive models and model selection for drawing inferences about causation. We argue that the comparison of model-based predictions with observations is a key step in hypothetico-deductive (H-D) science and remains a valid approach for assessing causation.

View Article and Find Full Text PDF

Background: Sarcopenia, a prevalent muscle disorder in the older adults, is characterized by accelerated loss of muscle mass and function, contributing to increased risks of falls, functional decline, and mortality. The relationship between dietary oxidative balance score (DOBS) and sarcopenia, however, remains unclear.

Methods: We conducted a cross-sectional analysis of the National Health and Nutritional Examination Survey (NHANES) 2011-2018 cohort, which included 8,240 participants, aged 47.

View Article and Find Full Text PDF

Background: Cluster randomized trials, which often enroll a small number of clusters, can benefit from constrained randomization, selecting a final randomization scheme from a set of known, balanced randomizations. Previous literature has addressed the suitability of adjusting the analysis for the covariates that were balanced in the design phase when the outcome is continuous or binary. Here we extended this work to time-to-event outcomes by comparing two model-based tests and a newly derived permutation test.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!