Handling missing covariate data in clinical studies in haematology.

Best Pract Res Clin Haematol

Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, the Netherlands; DKMS Clinical Trials Unit, Dresden, Germany.

Published: June 2023

Missing data are frequently encountered across studies in clinical haematology. Failure to handle these missing values in an appropriate manner can complicate the interpretation of a study's findings, as estimates presented may be biased and/or imprecise. In the present work, we first provide an overview of current methods for handling missing covariate data, along with their advantages and disadvantages. Furthermore, a systematic review is presented, exploring both contemporary reporting of missing values in major haematological journals, and the methods used for handling them. A principal finding was that the method of handling missing data was explicitly specified in a minority of articles (in 76 out of 195 articles reporting missing values, 39%). Among these, complete case analysis and the missing indicator method were the most common approaches to dealing with missing values, with more complex methods such as multiple imputation being extremely rare (in 7 out of 195 articles). An example analysis (with associated code) is also provided using hematopoietic stem cell transplantation data, illustrating the different approaches to handling missing values. We conclude with various recommendations regarding the reporting and handling of missing values for future studies in clinical haematology.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.beha.2023.101477DOI Listing

Publication Analysis

Top Keywords

missing values
24
handling missing
20
missing
10
missing covariate
8
covariate data
8
missing data
8
studies clinical
8
clinical haematology
8
methods handling
8
reporting missing
8

Similar Publications

Introduction: High-Flow Nasal Therapy (HFNT) is an innovative non-invasive form of respiratory support. Compared to standard oxygen therapy (SOT), there is an equipoise regarding the effect of HFNT on patient-centred outcomes among those at high risk of developing postoperative pulmonary complications after undergoing cardiac surgery. The NOTACS trial aims to determine the clinical and cost-effectiveness of HFNT compared to SOT within 90 days of surgery in the United Kingdom, Australia, and New Zealand.

View Article and Find Full Text PDF

Background: Accurate detection of hepatocellular carcinoma (HCC) in multiphasic contrast CT is essential for effective treatment and surgical planning. However, the variety of CT images, the misdiagnosis and missed diagnosis, and the inconsistent diagnosis among different radiologists pose challenges to accurate detection which demands sufficient clinical experience and can be time-consuming and labor-intensive.

Purpose: To evaluate the detection performance of an artificial intelligence (AI)-assisted platform for HCC by the external validation dataset.

View Article and Find Full Text PDF

Enhancing Diagnostic Accuracy of Lung Nodules in Chest Computed Tomography Using Artificial Intelligence: Retrospective Analysis.

J Med Internet Res

January 2025

Department of Health Policy and Management, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, MD, United States.

Background: Uncertainty in the diagnosis of lung nodules is a challenge for both patients and physicians. Artificial intelligence (AI) systems are increasingly being integrated into medical imaging to assist diagnostic procedures. However, the accuracy of AI systems in identifying and measuring lung nodules on chest computed tomography (CT) scans remains unclear, which requires further evaluation.

View Article and Find Full Text PDF

Quantitative measurements produced by mass spectrometry proteomics experiments offer a direct way to explore the role of proteins in molecular mechanisms. However, analysis of such data is challenging due to the large proportion of missing values. A common strategy to address this issue is to utilize an imputed dataset, which often introduces systematic bias into down-stream analyses if the imputation errors are ignored.

View Article and Find Full Text PDF

Motivation: Missing values are prevalent in high-throughput measurements due to various experimental or analytical reasons. Imputation, the process of replacing missing values in a dataset with estimated values, plays an important role in multivariate and machine learning analyses. The three missingness patterns, including missing completely at random, missing at random, and missing not at random, describe unique dependencies between the missing and observed data.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!