AI Article Synopsis

  • - This study aimed to evaluate the effectiveness of deep learning models using imbalanced imaging data from osteoarthritis research, specifically analyzing knee MRIs and their corresponding MRI Osteoarthritis Knee Score readings.
  • - The research utilized a dataset of 2,996 knee MRIs to compare various performance metrics (like ROC and PR curves) across different data levels and class ratios related to the detection of bone marrow lesions (BMLs).
  • - Results indicated that the ROC curve alone is not effective for imbalanced data, leading to recommendations that PR-AUC should be used for moderate imbalances, while severe imbalances may render deep learning models impractical regardless of adjustments for imbalanced data.

Article Abstract

Purpose: To compare the evaluation metrics for deep learning methods that were developed using imbalanced imaging data in osteoarthritis studies.

Materials And Methods: This retrospective study utilized 2996 sagittal intermediate-weighted fat-suppressed knee MRIs with MRI Osteoarthritis Knee Score readings from 2467 participants in the Osteoarthritis Initiative study. We obtained probabilities of the presence of bone marrow lesions (BMLs) from MRIs in the testing dataset at the sub-region (15 sub-regions), compartment, and whole-knee levels based on the trained deep learning models. We compared different evaluation metrics (e.g., receiver operating characteristic (ROC) and precision-recall (PR) curves) in the testing dataset with various class ratios (presence of BMLs vs. absence of BMLs) at these three data levels to assess the model's performance.

Results: In a subregion with an extremely high imbalance ratio, the model achieved a ROC-AUC of 0.84, a PR-AUC of 0.10, a sensitivity of 0, and a specificity of 1.

Conclusion: The commonly used ROC curve is not sufficiently informative, especially in the case of imbalanced data. We provide the following practical suggestions based on our data analysis: 1) ROC-AUC is recommended for balanced data, 2) PR-AUC should be used for moderately imbalanced data (i.e., when the proportion of the minor class is above 5% and less than 50%), and 3) for severely imbalanced data (i.e., when the proportion of the minor class is below 5%), it is not practical to apply a deep learning model, even with the application of techniques addressing imbalanced data issues.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10524686PMC
http://dx.doi.org/10.1016/j.joca.2023.05.006DOI Listing

Publication Analysis

Top Keywords

deep learning
16
imbalanced data
16
evaluation metrics
12
data
9
metrics deep
8
imbalanced imaging
8
imaging data
8
data osteoarthritis
8
testing dataset
8
data proportion
8

Similar Publications

Background: Digital biomarkers are increasingly used in clinical decision support for various health conditions. Speech features as digital biomarkers can offer insights into underlying physiological processes due to the complexity of speech production. This process involves respiration, phonation, articulation, and resonance, all of which rely on specific motor systems for the preparation and execution of speech.

View Article and Find Full Text PDF

PHIStruct: Improving phage-host interaction prediction at low sequence similarity settings using structure-aware protein embeddings.

Bioinformatics

January 2025

Bioinformatics Lab, Advanced Research Institute for Informatics, Computing and Networking, De La Salle University, Manila, 1004, Philippines.

Motivation: Recent computational approaches for predicting phage-host interaction have explored the use of sequence-only protein language models to produce embeddings of phage proteins without manual feature engineering. However, these embeddings do not directly capture protein structure information and structure-informed signals related to host specificity.

Results: We present PHIStruct, a multilayer perceptron that takes in structure-aware embeddings of receptor-binding proteins, generated via the structure-aware protein language model SaProt, and then predicts the host from among the ESKAPEE genera.

View Article and Find Full Text PDF

Motivation: Predicting RNA-binding proteins (RBPs) is central to understanding post-transcriptional regulatory mechanisms. Here, we introduce EnrichRBP, an automated and interpretable computational platform specifically designed for the comprehensive analysis of RBP interactions with RNA.

Results: EnrichRBP is a web service that enables researchers to develop original deep learning and machine learning architectures to explore the complex dynamics of RNA-binding proteins.

View Article and Find Full Text PDF

Objective: To develop an automatic segmentation model to delineate the adnexal masses and construct a machine learning model to differentiate between low malignant risk and intermediate-high malignant risk of adnexal masses based on ovarian-adnexal reporting and data system (O-RADS).

Methods: A total of 663 ultrasound images of adnexal mass were collected and divided into two sets according to experienced radiologists: a low malignant risk set (n = 446) and an intermediate-high malignant risk set (n = 217). Deep learning segmentation models were trained and selected to automatically segment adnexal masses.

View Article and Find Full Text PDF

Cerebral microbleeds (CMBs) are small, hypointense hemosiderin deposits in the brain measuring 2-10 mm in diameter. As one of the important biomarkers of small vessel disease, they have been associated with various neurodegenerative and cerebrovascular diseases. Hence, automated detection, and subsequent extraction of clinically useful metrics (e.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!