A PHP Error was encountered

Severity: Warning

Message: file_get_contents(https://...@pubfacts.com&api_key=b8daa3ad693db53b1410957c26c9a51b4908&a=1): Failed to open stream: HTTP request failed! HTTP/1.1 429 Too Many Requests

Filename: helpers/my_audit_helper.php

Line Number: 176

Backtrace:

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 176
Function: file_get_contents

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 250
Function: simplexml_load_file_from_url

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 1034
Function: getPubMedXML

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 3152
Function: GetPubMedArticleOutput_2016

File: /var/www/html/application/controllers/Detail.php
Line: 575
Function: pubMedSearch_Global

File: /var/www/html/application/controllers/Detail.php
Line: 489
Function: pubMedGetRelatedKeyword

File: /var/www/html/index.php
Line: 316
Function: require_once

Generalizability of Machine Learning Models: Quantitative Evaluation of Three Methodological Pitfalls. | LitMetric

Generalizability of Machine Learning Models: Quantitative Evaluation of Three Methodological Pitfalls.

Radiol Artif Intell

Department of Computer Science, University of Calgary, Calgary, Canada (F.M., K.O.); Department of Radiology, Massachusetts General Hospital, Boston, Mass (R.G.); Augmented Intelligence & Precision Health Laboratory (AIPHL), Department of Radiology and the Research Institute of the McGill University Health Centre, McGill University, Montreal, Canada (C.R., R.F.); Montreal Imaging Experts, Montreal, Canada (C.R., R.F.); Division of Pathology, Jewish General Hospital, Montreal, Canada (A.S.); and Radiomics and Augmented Intelligence Laboratory (RAIL), Department of Radiology and the Norman Fixel Institute for Neurologic Diseases, University of Florida College of Medicine, UF Health Shands Hospital, 1600 SW Archer Rd, Gainesville, FL 32610-0374 (R.F.).

Published: January 2023

Purpose: To investigate the impact of the following three methodological pitfalls on model generalizability: () violation of the independence assumption, () model evaluation with an inappropriate performance indicator or baseline for comparison, and () batch effect.

Materials And Methods: The authors used retrospective CT, histopathologic analysis, and radiography datasets to develop machine learning models with and without the three methodological pitfalls to quantitatively illustrate their effect on model performance and generalizability. F1 score was used to measure performance, and differences in performance between models developed with and without errors were assessed using the Wilcoxon rank sum test when applicable.

Results: Violation of the independence assumption by applying oversampling, feature selection, and data augmentation before splitting data into training, validation, and test sets seemingly improved model F1 scores by 71.2% for predicting local recurrence and 5.0% for predicting 3-year overall survival in head and neck cancer and by 46.0% for distinguishing histopathologic patterns in lung cancer. Randomly distributing data points for a patient across datasets superficially improved the F1 score by 21.8%. High model performance metrics did not indicate high-quality lung segmentation. In the presence of a batch effect, a model built for pneumonia detection had an F1 score of 98.7% but correctly classified only 3.86% of samples from a new dataset of healthy patients.

Conclusion: Machine learning models developed with these methodological pitfalls, which are undetectable during internal evaluation, produce inaccurate predictions; thus, understanding and avoiding these pitfalls is necessary for developing generalizable models. Random Forest, Diagnosis, Prognosis, Convolutional Neural Network (CNN), Medical Image Analysis, Generalizability, Machine Learning, Deep Learning, Model Evaluation Published under a CC BY 4.0 license.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9885377PMC
http://dx.doi.org/10.1148/ryai.220028DOI Listing

Publication Analysis

Top Keywords

machine learning
16
methodological pitfalls
16
learning models
12
three methodological
12
generalizability machine
8
violation independence
8
independence assumption
8
model evaluation
8
model performance
8
models developed
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!