Censored observations are a common occurrence in biomedical data sets. Although a large amount of research has been devoted to estimation and inference for data with censored responses, very little research has focused on proper statistical procedures when predictors are censored. In this paper, we consider statistical methods for dealing with multiple predictors subject to detection limits within the context of generalized linear models. We investigate and adapt several conventional methods and develop a new multiple imputation approach for analyzing data sets with predictors censored due to detection limits. We establish the consistency and asymptotic normality of the proposed multiple imputation estimator and suggest a computationally simple and consistent variance estimator. We also demonstrate that the conditional mean imputation method often leads to inconsistent estimates in generalized linear models, while several other methods are either computationally intensive or lead to parameter estimates that are biased or more variable compared to the proposed multiple imputation estimator. In an extensive simulation study, we assess the bias and variability of different approaches within the context of a logistic regression model and compare variance estimation methods for the proposed multiple imputation estimator. Lastly, we apply several methods to analyze the data set from a recently-conducted GenIMS study.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4526268PMC
http://dx.doi.org/10.1007/s12561-013-9099-4DOI Listing

Publication Analysis

Top Keywords

multiple imputation
16
generalized linear
12
linear models
12
detection limits
12
proposed multiple
12
imputation estimator
12
statistical methods
8
subject detection
8
data sets
8
predictors censored
8

Similar Publications

Determinants of COVID-19 vaccination coverage in European and Organisation for Economic Co-operation and Development (OECD) countries.

Front Public Health

January 2025

Centre for Health Economics Research and Modelling Infectious Diseases, Vaccine and Infectious Disease Institute, University of Antwerp, Antwerp, Belgium.

Introduction: In relatively wealthy countries, substantial between-country variability in COVID-19 vaccination coverage occurred. We aimed to identify influential national-level determinants of COVID-19 vaccine uptake at different COVID-19 pandemic stages in such countries.

Methods: We considered over 50 macro-level demographic, healthcare resource, disease burden, political, socio-economic, labor, cultural, life-style indicators as explanatory factors and coverage with at least one dose by June 2021, completed initial vaccination protocols by December 2021, and booster doses by June 2022 as outcomes.

View Article and Find Full Text PDF

Introduction: The exponential growth of genomic datasets necessitates advanced analytical tools to effectively identify genetic loci from large-scale high throughput sequencing data. This study presents Deep-Block, a multi-stage deep learning framework that incorporates biological knowledge into its AI architecture to identify genetic regions as significantly associated with Alzheimer's disease (AD). The framework employs a three-stage approach: (1) genome segmentation based on linkage disequilibrium (LD) patterns, (2) selection of relevant LD blocks using sparse attention mechanisms, and (3) application of TabNet and Random Forest algorithms to quantify single nucleotide polymorphism (SNP) feature importance, thereby identifying genetic factors contributing to AD risk.

View Article and Find Full Text PDF

Objective: To determine whether BMI differences observed at 5 years of age, from early intervention in infancy, remained apparent at 11 years.

Methods: Participants (n = 734) from the original randomized controlled trial (n = 802) underwent measures of body mass index (BMI), body composition (DXA), sleep and physical activity (24-h accelerometry, questionnaire), diet (repeated 24-h recalls), screen time (daily diaries), wellbeing (CHU-9D, WHO-5), and family functioning (McMaster FAD) around their 11th birthday. Following multiple imputation, regression models explored the effects of two interventions ('Sleep' vs.

View Article and Find Full Text PDF

Background: Coronary heart disease (CHD) is the leading cause of death among adults in Germany. There is evidence that occupational exposure to particulate matter, noise, psychosocial stressors, shift work and high physical workload are associated with CHD. The aim of this study is to identify occupations that are associated with CHD and to elaborate on occupational exposures associated with CHD by using the job exposure matrix (JEM) BAuA-JEM ETB 2018 in a German study population.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!