Background Radiomics is the extraction of predefined mathematic features from medical images for the prediction of variables of clinical interest. While some studies report superlative accuracy of radiomic machine learning (ML) models, the published methodology is often incomplete, and the results are rarely validated in external testing data sets. Purpose To characterize the type, prevalence, and statistical impact of methodologic errors present in radiomic ML studies. Materials and Methods Radiomic ML publications were reviewed for the presence of performance-inflating methodologic flaws. Common flaws were subsequently reproduced with randomly generated features interpolated from publicly available radiomic data sets to demonstrate the precarious nature of reported findings. Results In an assessment of radiomic ML publications, the authors uncovered two general categories of data analysis errors: inconsistent partitioning and unproductive feature associations. In simulations, the authors demonstrated that inconsistent partitioning augments radiomic ML accuracy by 1.4 times from unbiased performance and that correcting for flawed methodologic results in areas under the receiver operating characteristic curve approaching a value of 0.5 (random chance). With use of randomly generated features, the authors illustrated that unproductive associations between radiomic features and gene sets can imply false causality for biologic phenomenon. Conclusion Radiomic machine learning studies may contain methodologic flaws that undermine their validity. This study provides a review template to avoid such flaws. © RSNA, 2022 . See also the editorial by Jacobs in this issue.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10068883 | PMC |
http://dx.doi.org/10.1148/radiol.220715 | DOI Listing |
Epigenomics
December 2024
Precision Medicine, School of Clinical Sciences at Monash Health, Monash University, Clayton, VIC, Australia.
Aims: Clustering algorithms have been widely applied to tumor DNA methylation datasets to define methylation-based cancer subtypes. This study aimed to evaluate the agreement between subtypes obtained from common clustering strategies.
Materials & Methods: We used tumor DNA methylation data from 409 women with breast cancer from the Melbourne Collaborative Cohort Study (MCCS) and 781 breast tumors from The Cancer Genome Atlas (TCGA).
Res Sq
October 2024
Harvard Medical School, Massachusetts Eye and Ear.
The middle ear transforms sound from low-impedance external air to high-impedance cochlear fluid. However, the human stapes - the input to the cochlea - has been reported to have minimal or no motion above ~ 4 kHz. For decades, this lack of observed high-frequency stapes motion has been puzzling, as it is inconsistent with our ability to hear up to 20 kHz.
View Article and Find Full Text PDFCancer Med
November 2024
Department of Radiation Oncology, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-sen University Cancer Center, Guangzhou, China.
J Act Sedentary Sleep Behav
October 2024
University of Kansas Medical Center, 3901 Rainbow Blvd, Kansas City, KS 66160 USA.
Faraday Discuss
September 2024
Chemical Sciences and Engineering Division, Argonne National Laboratory, Lemont, Illinois, USA.
Active Thermochemical Tables (ATcT) were successfully used to resolve the existing inconsistencies related to the thermochemistry of glycine, based on statistically analyzing and solving a thermochemical network that includes >3350 chemical species interconnected by nearly 35 000 thermochemically-relevant determinations from experiment and high-level theory. The current ATcT results for the 298.15 K enthalpies of formation are -394.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!