Introduction: External validation studies are essential to study the generalizability of prediction models. Recently a permutation test, focusing on discrimination as quantified by the c-statistic, was proposed to judge whether a prediction model is transportable to a new setting. We aimed to evaluate this test and compare it to previously proposed procedures to judge any changes in c-statistic from development to external validation setting.
Methods: We compared the use of the permutation test to the use of benchmark values of the c-statistic following from a previously proposed framework to judge transportability of a prediction model. In a simulation study we developed a prediction model with logistic regression on a development set and validated them in the validation set. We concentrated on two scenarios: 1) the case-mix was more heterogeneous and predictor effects were weaker in the validation set compared to the development set, and 2) the case-mix was less heterogeneous in the validation set and predictor effects were identical in the validation and development set. Furthermore we illustrated the methods in a case study using 15 datasets of patients suffering from traumatic brain injury.
Results: The permutation test indicated that the validation and development set were homogenous in scenario 1 (in almost all simulated samples) and heterogeneous in scenario 2 (in 17%-39% of simulated samples). Previously proposed benchmark values of the c-statistic and the standard deviation of the linear predictors correctly pointed at the more heterogeneous case-mix in scenario 1 and the less heterogeneous case-mix in scenario 2.
Conclusion: The recently proposed permutation test may provide misleading results when externally validating prediction models in the presence of case-mix differences between the development and validation population. To correctly interpret the c-statistic found at external validation it is crucial to disentangle case-mix differences from incorrect regression coefficients.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4755533 | PMC |
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0148820 | PLOS |
Pharmaceuticals (Basel)
January 2025
Centro de Química Médica, Facultad de Medicina Clínica Alemana, Universidad del Desarrollo, Santiago 7780272, Chile.
Acute myeloid leukemia (AML) presents significant therapeutic challenges, particularly in cases driven by mutations in the FLT3 tyrosine kinase. This study aimed to develop a robust and user-friendly machine learning-based quantitative structure-activity relationship (QSAR) model to predict the inhibitory potency (pIC values) of FLT3 inhibitors, addressing the limitations of previous models in dataset size, diversity, and predictive accuracy. Using a dataset which was 14 times larger than those employed in prior studies (1350 compounds with 1269 molecular descriptors), we trained a random forest regressor, chosen due to its superior predictive performance and resistance to overfitting.
View Article and Find Full Text PDFJ Clin Med
January 2025
Division of Cardiac Surgery, Spedali Civili di Brescia, University of Brescia, 25123 Brescia, Italy.
: New-onset postoperative atrial fibrillation (POAF) is the most common complication after cardiac surgery, occurring approximately in one-third of the patients. This study considered all-comer patients who underwent cardiac surgery to build a predictive model for POAF. : A total of 3467 (Center 1) consecutive patients were used as a derivation cohort to build the model.
View Article and Find Full Text PDFJ Clin Med
January 2025
Department of Obstetrics and Gynecology, Zuyderland Medical Center, Henri Dunantstraat 5, 6419 PC Heerlen, The Netherlands.
: A prediction model for anatomical cystocele recurrence after native tissue repair was developed and internally validated in 2016. This model estimates a patients' individual risk of recurrence and can be used for counseling. Before implementation in urogynecological clinical practice, external validation is needed.
View Article and Find Full Text PDFLife (Basel)
January 2025
Institute for Diagnostic and Interventional Radiology, Faculty of Medicine and University Hospital Cologne, University of Cologne, 50937 Cologne, Germany.
Kirsten Rat Sarcoma viral oncogene homolog (KRAS) is a frequently occurring mutation in non-small-cell lung cancer (NSCLC) and influences cancer treatment and disease progression. In this study, a machine learning (ML) pipeline was applied to radiomic features extracted from public and internal CT images to identify KRAS mutations in NSCLC patients. Both datasets were analyzed using parametric ( test) and non-parametric statistical tests (Mann-Whitney U test) and dimensionality reduction techniques.
View Article and Find Full Text PDFMedicina (Kaunas)
December 2024
Department of Internal Medicine, Hospital Universitario Infanta Leonor-Virgen de la Torre, 28031 Madrid, Spain.
: Venous thromboembolism (VTE) can be the first manifestation of an underlying cancer. This study aimed to develop a predictive model to assess the risk of occult cancer between 30 days and 24 months after a venous thrombotic event using machine learning (ML). : We designed a case-control study nested in a cohort of patients with VTE included in a prospective registry from two Spanish hospitals between 2005 and 2021.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!