Introduction: External validation studies are essential to study the generalizability of prediction models. Recently a permutation test, focusing on discrimination as quantified by the c-statistic, was proposed to judge whether a prediction model is transportable to a new setting. We aimed to evaluate this test and compare it to previously proposed procedures to judge any changes in c-statistic from development to external validation setting.

Methods: We compared the use of the permutation test to the use of benchmark values of the c-statistic following from a previously proposed framework to judge transportability of a prediction model. In a simulation study we developed a prediction model with logistic regression on a development set and validated them in the validation set. We concentrated on two scenarios: 1) the case-mix was more heterogeneous and predictor effects were weaker in the validation set compared to the development set, and 2) the case-mix was less heterogeneous in the validation set and predictor effects were identical in the validation and development set. Furthermore we illustrated the methods in a case study using 15 datasets of patients suffering from traumatic brain injury.

Results: The permutation test indicated that the validation and development set were homogenous in scenario 1 (in almost all simulated samples) and heterogeneous in scenario 2 (in 17%-39% of simulated samples). Previously proposed benchmark values of the c-statistic and the standard deviation of the linear predictors correctly pointed at the more heterogeneous case-mix in scenario 1 and the less heterogeneous case-mix in scenario 2.

Conclusion: The recently proposed permutation test may provide misleading results when externally validating prediction models in the presence of case-mix differences between the development and validation population. To correctly interpret the c-statistic found at external validation it is crucial to disentangle case-mix differences from incorrect regression coefficients.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4755533PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0148820PLOS

Publication Analysis

Top Keywords

external validation
16
permutation test
16
development set
16
prediction models
12
prediction model
12
validation set
12
validation
10
c-statistic proposed
8
benchmark values
8
values c-statistic
8

Similar Publications

A Simple Machine Learning-Based Quantitative Structure-Activity Relationship Model for Predicting pIC Inhibition Values of FLT3 Tyrosine Kinase.

Pharmaceuticals (Basel)

January 2025

Centro de Química Médica, Facultad de Medicina Clínica Alemana, Universidad del Desarrollo, Santiago 7780272, Chile.

Acute myeloid leukemia (AML) presents significant therapeutic challenges, particularly in cases driven by mutations in the FLT3 tyrosine kinase. This study aimed to develop a robust and user-friendly machine learning-based quantitative structure-activity relationship (QSAR) model to predict the inhibitory potency (pIC values) of FLT3 inhibitors, addressing the limitations of previous models in dataset size, diversity, and predictive accuracy. Using a dataset which was 14 times larger than those employed in prior studies (1350 compounds with 1269 molecular descriptors), we trained a random forest regressor, chosen due to its superior predictive performance and resistance to overfitting.

View Article and Find Full Text PDF

: New-onset postoperative atrial fibrillation (POAF) is the most common complication after cardiac surgery, occurring approximately in one-third of the patients. This study considered all-comer patients who underwent cardiac surgery to build a predictive model for POAF. : A total of 3467 (Center 1) consecutive patients were used as a derivation cohort to build the model.

View Article and Find Full Text PDF

: A prediction model for anatomical cystocele recurrence after native tissue repair was developed and internally validated in 2016. This model estimates a patients' individual risk of recurrence and can be used for counseling. Before implementation in urogynecological clinical practice, external validation is needed.

View Article and Find Full Text PDF

Kirsten Rat Sarcoma viral oncogene homolog (KRAS) is a frequently occurring mutation in non-small-cell lung cancer (NSCLC) and influences cancer treatment and disease progression. In this study, a machine learning (ML) pipeline was applied to radiomic features extracted from public and internal CT images to identify KRAS mutations in NSCLC patients. Both datasets were analyzed using parametric ( test) and non-parametric statistical tests (Mann-Whitney U test) and dimensionality reduction techniques.

View Article and Find Full Text PDF

: Venous thromboembolism (VTE) can be the first manifestation of an underlying cancer. This study aimed to develop a predictive model to assess the risk of occult cancer between 30 days and 24 months after a venous thrombotic event using machine learning (ML). : We designed a case-control study nested in a cohort of patients with VTE included in a prospective registry from two Spanish hospitals between 2005 and 2021.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!