The adoption of electronic health records (EHRs) has made patient data increasingly accessible, precipitating the development of various clinical decision support systems and data-driven models to help physicians. However, missing data are common in EHR-derived datasets, which can introduce significant uncertainty, if not invalidating the use of a predictive model. Machine learning (ML)-based imputation methods have shown promise in various domains for the task of estimating values and reducing uncertainty to the point that a predictive model can be employed. We introduce Autopopulus, a novel framework that enables the design and evaluation of various autoencoder architectures for efficient imputation on large datasets. Autopopulus implements existing autoencoder methods as well as a new technique that outputs a range of estimated values (rather than point estimates), and demonstrates a workflow that helps users make an informed decision on an appropriate imputation method. To further illustrate Autopopulus' utility, we use it to identify not only which imputation methods can most accurately impute on a large clinical dataset, but to also identify the imputation methods that enable downstream predictive models to achieve the best performance for prediction of chronic kidney disease (CKD) progression.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8862635PMC
http://dx.doi.org/10.1109/EMBC46164.2021.9630135DOI Listing

Publication Analysis

Top Keywords

imputation methods
12
autopopulus novel
8
novel framework
8
imputation large
8
large clinical
8
predictive model
8
identify imputation
8
imputation
6
framework autoencoder
4
autoencoder imputation
4

Similar Publications

Background: The global prevalence of lower urinary tract symptoms suggestive of benign prostate hyperplasia (LUTS/BPH) escalates, with obesity recognized as a major contributing factor. However, the association between the relative fat mass (RFM) and LUTS/BPH remains unexplored. This 7-year follow-up study aimed to investigate the cross-sectional and longitudinal relationships between RFM and LUTS/BPH.

View Article and Find Full Text PDF

Background: The accuracy of low birth weight (LBW) and small for gestational age (SGA) in administrative healthcare records is crucial for perinatal studies but has few validity studies.

Methods: Using 1999-2010 MAX linked to birth certificates (BC), we identified mother-infant dyads (≥30 days enrollment after delivery, with valid gestational age (GA) and birth weight (BW)). LBW and SGA were identified based on ICD-9-CM codes.

View Article and Find Full Text PDF

Objective: This study examines the impact of dose step-down in patients with rheumatoid arthritis (RA) who achieved sustained disease control with baricitinib 4-mg once-daily up to 96-weeks.

Methods: Patients who completed a baricitinib phase 3 study could enter a long-term extension (LTE). In the LTE, patients who received baricitinib 4-mg for ≥15 months and maintained clinical disease activity index (CDAI) low disease activity (LDA) or remission (REM) were blindly randomized to continue 4-mg or taper to 2-mg.

View Article and Find Full Text PDF

Background: A greater understanding of the effectiveness of digital self-management programs and their ability to support longer-term weight loss is needed.

Objective: This study aimed to explore the total weight loss and patterns of weight loss of CSIRO (Commonwealth Scientific and Industrial Research Organisation) Total Wellbeing Diet Online members during their first 12 months of membership and examine the patterns of platform use associated with greater weight loss.

Methods: Participants were Australian adults who joined the program between October 2014 and June 2022 and were classified as longer-term members, meaning they completed at least 12 weeks of the program, had baseline and 12-week weight data, and had a paid membership of ≥1 year (N=24,035).

View Article and Find Full Text PDF

Importance: Mental health issues among young people are increasingly concerning. Conventional psychological interventions face challenges, including limited staffing, time commitment, and low completion rates.

Objective: To evaluate the effect of a low-intensity online intervention on young people in Hong Kong experiencing moderate or greater mental distress.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!