Selective inference for effect modification via the lasso.

J R Stat Soc Series B Stat Methodol

Department of Biostatistics and Computational Biology, University of Rochester, Rochester, New York, USA.

Published: April 2022

Effect modification occurs when the effect of the treatment on an outcome varies according to the level of other covariates and often has important implications in decision-making. When there are tens or hundreds of covariates, it becomes necessary to use the observed data to select a simpler model for effect modification and then make valid statistical inference. We propose a two-stage procedure to solve this problem. First, we use Robinson's transformation to decouple the nuisance parameters from the treatment effect of interest and use machine learning algorithms to estimate the nuisance parameters. Next, after plugging in the estimates of the nuisance parameters, we use the lasso to choose a low-complexity model for effect modification. Compared to a full model consisting of all the covariates, the selected model is much more interpretable. Compared to the univariate subgroup analyses, the selected model greatly reduces the number of false discoveries. We show that the conditional selective inference for the selected model is asymptotically valid given the rate assumptions in classical semiparametric regression. Extensive simulation studies are conducted to verify the asymptotic results and an epidemiological application is used to demonstrate the method.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9491375PMC
http://dx.doi.org/10.1111/rssb.12483DOI Listing

Publication Analysis

Top Keywords

nuisance parameters
12
selected model
12
selective inference
8
model modification
8
model
6
modification
4
inference modification
4
modification lasso
4
lasso modification
4
modification occurs
4

Similar Publications

We study kernel-based estimation methods for partially linear varying coefficient additive hazards models, where the effects of one type of covariates can be modified by another. Existing kernel estimation methods for varying coefficient models often use a "local" approach, where only a small local neighborhood of subjects are used for estimating the varying coefficient functions. Such a local approach, however, is generally inefficient as information about some non-varying nuisance parameter from subjects outside the neighborhood is discarded.

View Article and Find Full Text PDF

Estimating evolutionary and demographic parameters via ARG-derived IBD.

PLoS Genet

January 2025

Melbourne Integrative Genomics, School of Mathematics & Statistics, University of Melbourne, Victoria, Australia.

Inference of evolutionary and demographic parameters from a sample of genome sequences often proceeds by first inferring identical-by-descent (IBD) genome segments. By exploiting efficient data encoding based on the ancestral recombination graph (ARG), we obtain three major advantages over current approaches: (i) no need to impose a length threshold on IBD segments, (ii) IBD can be defined without the hard-to-verify requirement of no recombination, and (iii) computation time can be reduced with little loss of statistical efficiency using only the IBD segments from a set of sequence pairs that scales linearly with sample size. We first demonstrate powerful inferences when true IBD information is available from simulated data.

View Article and Find Full Text PDF

Anaerobic co-digestion is emerging as an option for wastewater biosolids management. Variations in treatment parameters can impact odour emissions and, in turn, odour nuisance reduces community acceptance and alternatives for beneficial reuse of biosolids via land application. This study assessed odour emissions from digested sludge and biosolids resulting from the anaerobic co-digestion of wastewater sludge with beverage rejects (beer and cola) and food wastes.

View Article and Find Full Text PDF

Call for Decision Support for Electrocardiographic Alarm Administration Among Neonatal Intensive Care Unit Staff: Multicenter, Cross-Sectional Survey.

J Med Internet Res

December 2024

Shanghai Engineering Research Center of Intelligence Pediatrics, Shanghai Children's Medical Center, School of Medicine, Shanghai Jiao Tong University, Shanghai, China.

Background: Previous studies have shown that electrocardiographic (ECG) alarms have high sensitivity and low specificity, have underreported adverse events, and may cause neonatal intensive care unit (NICU) staff fatigue or alarm ignoring. Moreover, prolonged noise stimuli in hospitalized neonates can disrupt neonatal development.

Objective: The aim of the study is to conduct a nationwide, multicenter, large-sample cross-sectional survey to identify current practices and investigate the decision-making requirements of health care providers regarding ECG alarms.

View Article and Find Full Text PDF

We present methods for estimating loss-based measures of the performance of a prediction model in a target population that differs from the source population in which the model was developed, in settings where outcome and covariate data are available from the source population but only covariate data are available on a simple random sample from the target population. Prior work adjusting for differences between the two populations has used various weighting estimators with inverse odds or density ratio weights. Here, we develop more robust estimators for the target population risk (expected loss) that can be used with data-adaptive (e.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!