Background: Predicting outcomes of patients with COVID-19 at an early stage is crucial for optimised clinical care and resource management, especially during a pandemic. Although multiple machine learning models have been proposed to address this issue, because of their requirements for extensive data preprocessing and feature engineering, they have not been validated or implemented outside of their original study site. Therefore, we aimed to develop accurate and transferrable predictive models of outcomes on hospital admission for patients with COVID-19.

Methods: In this study, we developed recurrent neural network-based models (CovRNN) to predict the outcomes of patients with COVID-19 by use of available electronic health record data on admission to hospital, without the need for specific feature selection or missing data imputation. CovRNN was designed to predict three outcomes: in-hospital mortality, need for mechanical ventilation, and prolonged hospital stay (>7 days). For in-hospital mortality and mechanical ventilation, CovRNN produced time-to-event risk scores (survival prediction; evaluated by the concordance index) and all-time risk scores (binary prediction; area under the receiver operating characteristic curve [AUROC] was the main metric); we only trained a binary classification model for prolonged hospital stay. For binary classification tasks, we compared CovRNN against traditional machine learning algorithms: logistic regression and light gradient boost machine. Our models were trained and validated on the heterogeneous, deidentified data of 247 960 patients with COVID-19 from 87 US health-care systems derived from the Cerner Real-World COVID-19 Q3 Dataset up to September 2020. We held out the data of 4175 patients from two hospitals for external validation. The remaining 243 785 patients from the 85 health systems were grouped into training (n=170 626), validation (n=24 378), and multi-hospital test (n=48 781) sets. Model performance was evaluated in the multi-hospital test set. The transferability of CovRNN was externally validated by use of deidentified data from 36 140 patients derived from the US-based Optum deidentified COVID-19 electronic health record dataset (version 1015; from January, 2007, to Oct 15, 2020). Exact dates of data extraction were masked by the databases to ensure patient data safety.

Findings: CovRNN binary models achieved AUROCs of 93·0% (95% CI 92·6-93·4) for the prediction of in-hospital mortality, 92·9% (92·6-93·2) for the prediction of mechanical ventilation, and 86·5% (86·2-86·9) for the prediction of a prolonged hospital stay, outperforming light gradient boost machine and logistic regression algorithms. External validation confirmed AUROCs in similar ranges (91·3-97·0% for in-hospital mortality prediction, 91·5-96·0% for the prediction of mechanical ventilation, and 81·0-88·3% for the prediction of prolonged hospital stay). For survival prediction, CovRNN achieved a concordance index of 86·0% (95% CI 85·1-86·9) for in-hospital mortality and 92·6% (92·2-93·0) for mechanical ventilation.

Interpretation: Trained on a large, heterogeneous, real-world dataset, our CovRNN models showed high prediction accuracy and transferability through consistently good performances on multiple external datasets. Our results show the feasibility of a COVID-19 predictive model that delivers high accuracy without the need for complex feature engineering.

Funding: Cancer Prevention and Research Institute of Texas.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9023005PMC
http://dx.doi.org/10.1016/S2589-7500(22)00049-8DOI Listing

Publication Analysis

Top Keywords

in-hospital mortality
20
patients covid-19
16
mechanical ventilation
16
prolonged hospital
16
hospital stay
16
outcomes patients
12
electronic health
12
health record
12
prediction
10
covrnn
9

Similar Publications

Prognostic Implications of Cardiac Geometry in Cirrhosis: Findings From a Large Cohort.

Liver Int

February 2025

General Practice Ward/International Medical Center Ward, General Practice Medical Center, West China Hospital, Sichuan University, Chengdu, China.

Background And Aims: Cirrhosis is characterised by hyperdynamic circulation, which contributes to cirrhotic cardiomyopathy (CCM). However, the expert consensus on CCM did not initially include cardiac structure because of scant evidence. Therefore, this study investigated the associations of cardiac chamber geometry with mortality and CCM.

View Article and Find Full Text PDF

Background: Type 2 diabetes mellitus (T2DM) poses an escalating public health challenge among adolescents and young adults worldwide. Despite the rising incidence, comprehensive data on the burden and trends of T2DM in this demographic remain scarce. This study aims to evaluate the burden of T2DM among individuals aged 10-24 years globally, regionally, and nationally from 1990 to 2021.

View Article and Find Full Text PDF

Hepatocellular carcinoma (HCC) is a primary malignant neoplasm exhibiting a high mortality rate. Taxifolin is a naturally occurring flavonoid compound that exhibits a range of pharmacological properties. The effects of taxifolin on HCC remain largely unexplored.

View Article and Find Full Text PDF

Sepsis is a major cause of morbidity and mortality worldwide. Among the various types of end-organ damage associated with sepsis, hepatic injury is linked to significantly higher mortality rates compared to dysfunction in other organ systems. This study aimed to investigate potential biomarkers of hepatic injury in sepsis patients through a multi-center, case-control approach.

View Article and Find Full Text PDF

Glioma is characterized by high heterogeneity and poor prognosis. Attempts have been made to understand its diversity in both genetic expressions and radiomic characteristics, while few integrated the two omics in predicting survival of glioma. This study was intended to investigate the connection between glioma imaging and genome, and examine its predictive value in glioma mortality risk and tumor immune microenvironment (TIME).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!