Artificial Intelligence for Prognostic Scores in Oncology: a Benchmarking Study.

Front Artif Intell

Data Science, Pharmaceutical Research and Early Development Informatics (pREDi), Roche Innovation Center Munich (RICM), Penzberg, Germany.

Published: April 2021

Prognostic scores are important tools in oncology to facilitate clinical decision-making based on patient characteristics. To date, classic survival analysis using Cox proportional hazards regression has been employed in the development of these prognostic scores. With the advance of analytical models, this study aimed to determine if more complex machine-learning algorithms could outperform classical survival analysis methods. In this benchmarking study, two datasets were used to develop and compare different prognostic models for overall survival in pan-cancer populations: a nationwide EHR-derived de-identified database for training and in-sample testing and the OAK (phase III clinical trial) dataset for out-of-sample testing. A real-world database comprised 136K first-line treated cancer patients across multiple cancer types and was split into a 90% training and 10% testing dataset, respectively. The OAK dataset comprised 1,187 patients diagnosed with non-small cell lung cancer. To assess the effect of the covariate number on prognostic performance, we formed three feature sets with 27, 44 and 88 covariates. In terms of methods, we benchmarked ROPRO, a prognostic score based on the Cox model, against eight complex machine-learning models: regularized Cox, Random Survival Forests (RSF), Gradient Boosting (GB), DeepSurv (DS), Autoencoder (AE) and Super Learner (SL). The C-index was used as the performance metric to compare different models. For in-sample testing on the real-world database the resulting C-index [95% CI] values for RSF 0.720 [0.716, 0.725], GB 0.722 [0.718, 0.727], DS 0.721 [0.717, 0.726] and lastly, SL 0.723 [0.718, 0.728] showed significantly better performance as compared to ROPRO 0.701 [0.696, 0.706]. Similar results were derived across all feature sets. However, for the out-of-sample validation on OAK, the stronger performance of the more complex models was not apparent anymore. Consistently, the increase in the number of prognostic covariates did not lead to an increase in model performance. The stronger performance of the more complex models did not generalize when applied to an out-of-sample dataset. We hypothesize that future research may benefit by adding multimodal data to exploit advantages of more complex models.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8086599PMC
http://dx.doi.org/10.3389/frai.2021.625573DOI Listing

Publication Analysis

Top Keywords

prognostic scores
12
complex models
12
benchmarking study
8
survival analysis
8
complex machine-learning
8
in-sample testing
8
testing real-world
8
real-world database
8
number prognostic
8
feature sets
8

Similar Publications

Introduction: The Naples prognostic score (NPS) is a novel indicator of nutritional and inflammatory statuses in cancer patients. Development of atrial fibrillation after cardiac surgery (POAF) is a common complication that increases the incidence of adverse events. Numerous studies have investigated predictors of POAF.

View Article and Find Full Text PDF

Introduction: Gastric cancer (GC) is a leading cause of cancer-related deaths worldwide, with delayed diagnosis often limiting effective treatment options. This study introduces a novel, non-invasive radiomics-based approach utilizing [18F] FDG PET/CT to predict VEGF status and survival in GC patients. The ability to non-invasively assess these parameters can significantly influence therapeutic decisions and outcomes.

View Article and Find Full Text PDF

Background: Predicting mortality and morbidity poses a significant challenge to physicians, leading to the development of various scoring systems. Among these, the hemoglobin, albumin, lymphocyte and platelet (HALP) score evaluates a patient's nutritional and immune status. The primary aim of this study was to determine the predictive effect of the HALP score on 30-day and 1-year mortality in elderly patients with proximal femoral fractures (PFFs).

View Article and Find Full Text PDF

Introduction: Accurate prognostication in comatose survivors of cardiac arrest is a challenging and high-stakes endeavor. We sought to determine whether internal EEG subparameters extracted by the Bispectral Index (BIS) monitor, a device commonly used to estimate depth-of-anesthesia intraoperatively, could be repurposed to predict recovery of consciousness after cardiac arrest.

Methods: In this retrospective cohort study, we trained a 3-layer neural network to predict recovery of consciousness to the point of command following versus not based on 48 hours of continuous EEG recordings in 315 comatose patients admitted to a single US academic medical center after cardiac arrest (Derivation cohort: N=181; Validation cohort: N=134).

View Article and Find Full Text PDF

The prognostic value of differentially expressed senescence-related genes(DESRGs) in ST-segment elevation myocardial infarction(STEMI) patients is unclear. We used GEO2R to identify DESRGs from GSE60993 and performed functional enrichment analysis. We built an optimal prognostic model with LASSO penalized Cox regression via GSE49925.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!