Assessing Discriminative Performance at External Validation of Clinical Prediction Models.

Daan Nieboer Tjeerd van der Ploeg Ewout W Steyerberg

PLoS One

Department of Public Health, Erasmus MC-University medical center, Rotterdam, the Netherlands.

Published: August 2016

Introduction: External validation studies are essential to study the generalizability of prediction models. Recently a permutation test, focusing on discrimination as quantified by the c-statistic, was proposed to judge whether a prediction model is transportable to a new setting. We aimed to evaluate this test and compare it to previously proposed procedures to judge any changes in c-statistic from development to external validation setting.

Methods: We compared the use of the permutation test to the use of benchmark values of the c-statistic following from a previously proposed framework to judge transportability of a prediction model. In a simulation study we developed a prediction model with logistic regression on a development set and validated them in the validation set. We concentrated on two scenarios: 1) the case-mix was more heterogeneous and predictor effects were weaker in the validation set compared to the development set, and 2) the case-mix was less heterogeneous in the validation set and predictor effects were identical in the validation and development set. Furthermore we illustrated the methods in a case study using 15 datasets of patients suffering from traumatic brain injury.

Results: The permutation test indicated that the validation and development set were homogenous in scenario 1 (in almost all simulated samples) and heterogeneous in scenario 2 (in 17%-39% of simulated samples). Previously proposed benchmark values of the c-statistic and the standard deviation of the linear predictors correctly pointed at the more heterogeneous case-mix in scenario 1 and the less heterogeneous case-mix in scenario 2.

Conclusion: The recently proposed permutation test may provide misleading results when externally validating prediction models in the presence of case-mix differences between the development and validation population. To correctly interpret the c-statistic found at external validation it is crucial to disentangle case-mix differences from incorrect regression coefficients.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4755533	PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0148820	PLOS

Publication Analysis

Top Keywords

external validation

permutation test

development set

prediction models

prediction model

validation set

validation

c-statistic proposed

benchmark values

values c-statistic

Similar Publications

A Simple Machine Learning-Based Quantitative Structure-Activity Relationship Model for Predicting pIC Inhibition Values of FLT3 Tyrosine Kinase.

Pharmaceuticals (Basel)

January 2025

Centro de Química Médica, Facultad de Medicina Clínica Alemana, Universidad del Desarrollo, Santiago 7780272, Chile.

Jackson J Alcázar Ignacio Sánchez Cristian Merino Bruno Monasterio Gaspar Sajuria

Acute myeloid leukemia (AML) presents significant therapeutic challenges, particularly in cases driven by mutations in the FLT3 tyrosine kinase. This study aimed to develop a robust and user-friendly machine learning-based quantitative structure-activity relationship (QSAR) model to predict the inhibitory potency (pIC values) of FLT3 inhibitors, addressing the limitations of previous models in dataset size, diversity, and predictive accuracy. Using a dataset which was 14 times larger than those employed in prior studies (1350 compounds with 1269 molecular descriptors), we trained a random forest regressor, chosen due to its superior predictive performance and resistance to overfitting.

View Article and Find Full Text PDF

Similar Publications

Prediction Model for POstoperative atriaL fibrillAtion in caRdIac Surgery: The POLARIS Score.

J Clin Med

January 2025

Division of Cardiac Surgery, Spedali Civili di Brescia, University of Brescia, 25123 Brescia, Italy.

Fabrizio Rosati Massimo Baudo Cesare Tomasi Giacomo Scotti Sergio Pirola

: New-onset postoperative atrial fibrillation (POAF) is the most common complication after cardiac surgery, occurring approximately in one-third of the patients. This study considered all-comer patients who underwent cardiac surgery to build a predictive model for POAF. : A total of 3467 (Center 1) consecutive patients were used as a derivation cohort to build the model.

View Article and Find Full Text PDF

Similar Publications

The External Validation of a Multivariable Prediction Model for Recurrent Pelvic Organ Prolapse After Native Tissue Repair: A Prospective Cohort Study.

J Clin Med

January 2025

Department of Obstetrics and Gynecology, Zuyderland Medical Center, Henri Dunantstraat 5, 6419 PC Heerlen, The Netherlands.

Imke Kessels Sander van Kuijk Tineke Vergeldt Iris van Gestel Wilbert Spaans

: A prediction model for anatomical cystocele recurrence after native tissue repair was developed and internally validated in 2016. This model estimates a patients' individual risk of recurrence and can be used for counseling. Before implementation in urogynecological clinical practice, external validation is needed.

View Article and Find Full Text PDF

Similar Publications

Machine Learning-Based Radiomics Analysis for Identifying KRAS Mutations in Non-Small-Cell Lung Cancer from CT Images: Challenges, Insights and Implications.

Life (Basel)

January 2025

Institute for Diagnostic and Interventional Radiology, Faculty of Medicine and University Hospital Cologne, University of Cologne, 50937 Cologne, Germany.

Mirjam Schöneck Nicolas Rehbach Lars Lotter-Becker Thorsten Persigehl Simon Lennartz

Kirsten Rat Sarcoma viral oncogene homolog (KRAS) is a frequently occurring mutation in non-small-cell lung cancer (NSCLC) and influences cancer treatment and disease progression. In this study, a machine learning (ML) pipeline was applied to radiomic features extracted from public and internal CT images to identify KRAS mutations in NSCLC patients. Both datasets were analyzed using parametric ( test) and non-parametric statistical tests (Mann-Whitney U test) and dimensionality reduction techniques.

View Article and Find Full Text PDF

Similar Publications

Development of a Predictive Model of Occult Cancer After a Venous Thromboembolism Event Using Machine Learning: The CLOVER Study.

Medicina (Kaunas)

December 2024

Department of Internal Medicine, Hospital Universitario Infanta Leonor-Virgen de la Torre, 28031 Madrid, Spain.

Anabel Franco-Moreno Elena Madroñal-Cerezo Cristina Lucía de Ancos-Aracil Ana Isabel Farfán-Sedano Nuria Muñoz-Rivas

: Venous thromboembolism (VTE) can be the first manifestation of an underlying cancer. This study aimed to develop a predictive model to assess the risk of occult cancer between 30 days and 24 months after a venous thrombotic event using machine learning (ML). : We designed a case-control study nested in a cohort of patients with VTE included in a prospective registry from two Spanish hospitals between 2005 and 2021.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!