Multivariate imputation using chained equations (MICE) is a popular algorithm for imputing missing data that entails specifying multivariate models through conditional distributions. For imputing missing continuous variables, two common imputation methods are the use of parametric imputation using a linear model and predictive mean matching. When imputing missing binary variables, the default approach is parametric imputation using a logistic regression model. In the R implementation of MICE, the use of predictive mean matching can be substantially faster than using logistic regression as the imputation model for missing binary variables. However, there is a paucity of research into the statistical performance of predictive mean matching for imputing missing binary variables. Our objective was to compare the statistical performance of predictive mean matching with that of logistic regression for imputing missing binary variables. Monte Carlo simulations were used to compare the statistical performance of predictive mean matching with that of logistic regression for imputing missing binary outcomes when the analysis model of scientific interest was a multivariable logistic regression model. We varied the size of the analysis samples ( = 250, 500, 1,000, 5,000, and 10,000) and the prevalence of missing data (5%-50% in increments of 5%). In general, the statistical performance of predictive mean matching was virtually identical to that of logistic regression for imputing missing binary variables when the analysis model was a logistic regression model. This was true across a wide range of scenarios defined by sample size and the prevalence of missing data. In conclusion, predictive mean matching can be used to impute missing binary variables. The use of predictive mean matching to impute missing binary variables can result in a substantial reduction in computer processing time when conducting simulations of multiple imputation.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10683343PMC
http://dx.doi.org/10.1177/09622802231198795DOI Listing

Publication Analysis

Top Keywords

predictive matching
36
logistic regression
32
missing binary
32
imputing missing
28
binary variables
28
statistical performance
16
performance predictive
16
matching imputing
12
missing
12
missing data
12

Similar Publications

Purpose: Patients with Cushing's syndrome (CS) have an increased venous thromboembolism (VTE) risk with most studies focusing on the perioperative period. The purpose of this study was to assess the 5-year VTE risk and identify predictors of VTE at CS diagnosis.

Methods: A comparative nationwide retrospective cohort study of 609 patients (mean age 48.

View Article and Find Full Text PDF

Novel isothermal nucleic acid amplification method for detecting malaria parasites.

Appl Microbiol Biotechnol

December 2024

Laboratório de Pesquisa em Malária, Instituto Oswaldo Cruz (IOC), Fundação Oswaldo Cruz (Fiocruz), Rio de Janeiro, Brazil.

Malaria, a parasitic disease caused by Plasmodium spp. and transmitted by Anopheles mosquitoes, remains a major global health issue, with an estimated 249 million cases and 608,000 deaths in 2022. Rapid and accurate diagnosis and treatment are crucial for malaria control and elimination.

View Article and Find Full Text PDF

Abdominal aortic aneurysm (AAA) is a life-threatening condition characterized by the weakening and dilation of the abdominal aorta. Few diagnostic biomarkers have been proposed for this condition. We performed mass spectrometry-based proteomics analysis of affinity-enriched plasma from 45 patients with AAA and 45 matched controls to identify changes to the plasma proteome and potential diagnostic biomarkers.

View Article and Find Full Text PDF

Smoking is a well known risk factor for coronary artery disease (CAD). However, the effects of smoking on gene expression in the blood of CAD subjects in Hungary have not been extensively studied. This study aimed to identify differentially expressed genes (DEGs) associated with smoking in CAD subjects.

View Article and Find Full Text PDF

Baseline thyroid function, as measured by the fT3 to fT4 ratio, has been shown to influence the prognosis of advanced cancer patients receiving active treatments. Although immune checkpoint blockade can alter the balance of thyroid hormones, this interaction has not been thoroughly investigated. The present research sought to determine whether changes in the fT3/fT4 ratio could affect the survival outcomes of patients with advanced non-small cell lung cancer (NSCLC) who were undergoing pembrolizumab-based therapies.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!