AI Article Synopsis

  • Multiple imputation and maximum likelihood estimation are two common methods for handling missing data, but improper multiple imputation can act similarly to a stochastic expectation-maximization approach.
  • The article suggests that traditional model selection tools like Akaike's Information Criterion (AIC) and Bayesian Information Criterion (BIC) can help effectively choose the best imputation model, which is crucial to avoid bias in analysis.
  • Simulations show that not only can incorrect imputation lead to biased parameter estimates, but also overfitting the imputation model can have negative effects, highlighting the need for careful model selection in imputation strategies.

Article Abstract

Multiple imputation and maximum likelihood estimation (via the expectation-maximization algorithm) are two well-known methods readily used for analyzing data with missing values. While these two methods are often considered as being distinct from one another, multiple imputation (when using improper imputation) is actually equivalent to a stochastic expectation-maximization approximation to the likelihood. In this article, we exploit this key result to show that familiar likelihood-based approaches to model selection, such as Akaike's information criterion (AIC) and the Bayesian information criterion (BIC), can be used to choose the imputation model that best fits the observed data. Poor choice of imputation model is known to bias inference, and while sensitivity analysis has often been used to explore the implications of different imputation models, we show that the data can be used to choose an appropriate imputation model via conventional model selection tools. We show that BIC can be consistent for selecting the correct imputation model in the presence of missing data. We verify these results empirically through simulation studies, and demonstrate their practicality on two classical missing data examples. An interesting result we saw in simulations was that not only can parameter estimates be biased by misspecifying the imputation model, but also by overfitting the imputation model. This emphasizes the importance of using model selection not just to choose the appropriate type of imputation model, but also to decide on the appropriate level of imputation model complexity.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8248419PMC
http://dx.doi.org/10.1002/sim.8915DOI Listing

Publication Analysis

Top Keywords

imputation model
32
imputation
13
multiple imputation
12
missing data
12
model selection
12
model
11
choose appropriate
8
data
6
selecting model
4
model multiple
4

Similar Publications

Background And Objectives: Although a substantial amount of research has focused on negative aspects of caregiving, less research has been conducted investigating positive aspects of providing informal care. The aim of this study was to investigate the longitudinal association between caregiving satisfaction and psychological distress in informal carers of dependent older people, and whether this relationship is mediated by caregiver burden.

Research Design And Methods: Prospective longitudinal study with a probabilistic sample of 332 caregivers of older relatives, with data collected at baseline and at 1-year follow-up.

View Article and Find Full Text PDF

Epidemiological and genetic factors affecting severe epizootic hemorrhagic disease in Spanish Holstein cattle during the Southern Europe outbreak of 2023.

J Dairy Sci

January 2025

Confederación de Asociaciones de Frisona Española (CONAFE), Ctra. de Andalucía km 23600 Valdemoro, 28340 Madrid, Spain.

Epizootic hemorrhagic disease (EHD) is a non-contagious viral infection that can cause important economic losses in dairy farms. This study aimed to identify epidemiological and genetic factors influencing the susceptibility and severity of EHD in Holstein dairy cattle during the 2023 outbreak in Spain. Data from 2852 animals in 7 affected farms from 5 Spanish provinces were used.

View Article and Find Full Text PDF

AI-based models to predict decompensation on traumatic brain injury patients.

Comput Biol Med

January 2025

INESC TEC - Institute for Systems and Computer Engineering, Technology and Science, Porto, Portugal; FCTUC - Faculty of Sciences and Technology of the University of Coimbra, Coimbra, Portugal. Electronic address:

Traumatic Brain Injury (TBI) is a form of brain injury caused by external forces, resulting in temporary or permanent impairment of brain function. Despite advancements in healthcare, TBI mortality rates can reach 30%-40% in severe cases. This study aims to assist clinical decision-making and enhance patient care for TBI-related complications by employing Artificial Intelligence (AI) methods and data-driven approaches to predict decompensation.

View Article and Find Full Text PDF

Background: Time-restricted eating (TRE) manages weight effectively, but choosing how long and what time window remain debatable. Although an 8:00 a.m.

View Article and Find Full Text PDF

Introduction: The Big Five Inventory (BFI) is a popular measure that evaluates personality on the Big-Five model. Apart from its utilization across cultures, the literature did not reveal any meta-analysis for the reliability of the different versions of the BFI and its translations. The current study carried out a reliability generalization meta-analysis (REGEMA) to establish the reliability of the BFI across cultures and languages.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!