Inconsistent Partitioning and Unproductive Feature Associations Yield Idealized Radiomic Models.

Radiology

From the Athinoula A. Martinos Center for Biomedical Imaging (M.G., K.C., J.B.P., K.V.H., S.R.A., P.S., J.K.C.) and Department of Radiology (J.K.C.), Massachusetts General Brigham, 13th St, Building 149, Room 2301, Charlestown, MA 02129; Case Western School of Medicine, Cleveland, Ohio (M.G.); Harvard-MIT Division of Health Sciences and Technology, Cambridge, Mass (J.B.P., K.V.H.); Harvard Graduate Program in Biophysics, Harvard Medical School, Harvard University, Cambridge, Mass (S.R.A.); Geisel School of Medicine at Dartmouth, Dartmouth College, Hanover, NH (S.R.A.); and Department of Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, Tex (C.D.F.).

Published: April 2023

Background Radiomics is the extraction of predefined mathematic features from medical images for the prediction of variables of clinical interest. While some studies report superlative accuracy of radiomic machine learning (ML) models, the published methodology is often incomplete, and the results are rarely validated in external testing data sets. Purpose To characterize the type, prevalence, and statistical impact of methodologic errors present in radiomic ML studies. Materials and Methods Radiomic ML publications were reviewed for the presence of performance-inflating methodologic flaws. Common flaws were subsequently reproduced with randomly generated features interpolated from publicly available radiomic data sets to demonstrate the precarious nature of reported findings. Results In an assessment of radiomic ML publications, the authors uncovered two general categories of data analysis errors: inconsistent partitioning and unproductive feature associations. In simulations, the authors demonstrated that inconsistent partitioning augments radiomic ML accuracy by 1.4 times from unbiased performance and that correcting for flawed methodologic results in areas under the receiver operating characteristic curve approaching a value of 0.5 (random chance). With use of randomly generated features, the authors illustrated that unproductive associations between radiomic features and gene sets can imply false causality for biologic phenomenon. Conclusion Radiomic machine learning studies may contain methodologic flaws that undermine their validity. This study provides a review template to avoid such flaws. © RSNA, 2022 . See also the editorial by Jacobs in this issue.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10068883PMC
http://dx.doi.org/10.1148/radiol.220715DOI Listing

Publication Analysis

Top Keywords

inconsistent partitioning
12
radiomic
9
partitioning unproductive
8
unproductive feature
8
feature associations
8
radiomic machine
8
machine learning
8
data sets
8
radiomic publications
8
methodologic flaws
8

Similar Publications

Aims: Clustering algorithms have been widely applied to tumor DNA methylation datasets to define methylation-based cancer subtypes. This study aimed to evaluate the agreement between subtypes obtained from common clustering strategies.

Materials & Methods: We used tumor DNA methylation data from 409 women with breast cancer from the Melbourne Collaborative Cohort Study (MCCS) and 781 breast tumors from The Cancer Genome Atlas (TCGA).

View Article and Find Full Text PDF

The middle ear transforms sound from low-impedance external air to high-impedance cochlear fluid. However, the human stapes - the input to the cochlea - has been reported to have minimal or no motion above ~ 4 kHz. For decades, this lack of observed high-frequency stapes motion has been puzzling, as it is inconsistent with our ability to hear up to 20 kHz.

View Article and Find Full Text PDF

Establishment of a Novel Risk Stratification System Integrating Clinical and Pathological Parameters for Prognostication and Clinical Decision-Making in Early-Stage Cervical Cancer.

Cancer Med

November 2024

Department of Radiation Oncology, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-sen University Cancer Center, Guangzhou, China.

Article Synopsis
  • A study was conducted on 2,071 early-stage cervical cancer patients to create a risk stratification model for better prognosis by analyzing clinical and pathological factors.
  • The model, developed through recursive partitioning analysis (RPA), successfully categorized patients into four distinct risk groups, showing significant differences in their 5-year overall survival rates.
  • The RPA model outperformed existing staging systems and revealed that treatment efficacy varies based on risk groups, particularly benefiting patients in higher-risk categories who received adjuvant chemoradiotherapy.
View Article and Find Full Text PDF
Article Synopsis
  • Compositional data analysis (CoDA) helps researchers look at how different daily activities like sleeping, sitting, and exercising are connected and affect health, which has become more popular over the last ten years.
  • A review of studies using CoDA found that while many did a good job explaining their methods, they were inconsistent in sharing other important details.
  • Out of 102 studies examined, things like how they measured activities and reported finding were done differently, showing a need for better and clearer reporting practices.
View Article and Find Full Text PDF

Accurate and reliable thermochemistry by data analysis of complex thermochemical networks using Active Thermochemical Tables: the case of glycine thermochemistry.

Faraday Discuss

September 2024

Chemical Sciences and Engineering Division, Argonne National Laboratory, Lemont, Illinois, USA.

Active Thermochemical Tables (ATcT) were successfully used to resolve the existing inconsistencies related to the thermochemistry of glycine, based on statistically analyzing and solving a thermochemical network that includes >3350 chemical species interconnected by nearly 35 000 thermochemically-relevant determinations from experiment and high-level theory. The current ATcT results for the 298.15 K enthalpies of formation are -394.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!