Variable selection when missing values are present: a case study.

Stat Methods Med Res

Department of Public Health, Oregon State University, Corvallis, OR, USA.

Published: August 2011

We consider variable selection when missing values are present in the predictor variables. We compare using complete cases with multiple imputation using backward selection (backwards stepping) and least angle regression. These are studied using a data set from a rheumatological disease (myositis). We find that the coefficients are slightly different and the estimated standard errors are smaller in the complete cases (not a surprise). This seems to be due to the fact that because the estimated residual variance is small the complete cases are more homogeneous than the full data cases.

Download full-text PDF

Source
http://dx.doi.org/10.1177/0962280209358003DOI Listing

Publication Analysis

Top Keywords

complete cases
12
variable selection
8
selection missing
8
missing values
8
values case
4
case study
4
study consider
4
consider variable
4
values predictor
4
predictor variables
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!