ENRICHing medical imaging training sets enables more efficient machine learning.

J Am Med Inform Assoc

Department of Medicine, Division of Cardiology, Department of Radiology, Bakar Computational Health Sciences Institute, University of California, San Francisco, San Francisco, California, USA.

Published: May 2023

AI Article Synopsis

  • Deep learning (DL) is increasingly used in biomedical imaging, but the scarcity of labeled data from expert annotators makes training models difficult, leading to a need for instance selection to improve model performance.
  • The authors introduce ENRICH, a method designed to prioritize medical images based on the diversity they contribute to the training dataset, addressing the unique challenges posed by medical data.
  • ENRICH significantly enhances classification and segmentation results by using fewer images compared to random selection, while also helping to identify errors in medical imaging datasets efficiently.

Article Abstract

Objective: Deep learning (DL) has been applied in proofs of concept across biomedical imaging, including across modalities and medical specialties. Labeled data are critical to training and testing DL models, but human expert labelers are limited. In addition, DL traditionally requires copious training data, which is computationally expensive to process and iterate over. Consequently, it is useful to prioritize using those images that are most likely to improve a model's performance, a practice known as instance selection. The challenge is determining how best to prioritize. It is natural to prefer straightforward, robust, quantitative metrics as the basis for prioritization for instance selection. However, in current practice, such metrics are not tailored to, and almost never used for, image datasets.

Materials And Methods: To address this problem, we introduce ENRICH-Eliminate Noise and Redundancy for Imaging Challenges-a customizable method that prioritizes images based on how much diversity each image adds to the training set.

Results: First, we show that medical datasets are special in that in general each image adds less diversity than in nonmedical datasets. Next, we demonstrate that ENRICH achieves nearly maximal performance on classification and segmentation tasks on several medical image datasets using only a fraction of the available images and without up-front data labeling. ENRICH outperforms random image selection, the negative control. Finally, we show that ENRICH can also be used to identify errors and outliers in imaging datasets.

Conclusions: ENRICH is a simple, computationally efficient method for prioritizing images for expert labeling and use in DL.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10198519PMC
http://dx.doi.org/10.1093/jamia/ocad055DOI Listing

Publication Analysis

Top Keywords

instance selection
8
image adds
8
image
5
enriching medical
4
imaging
4
medical imaging
4
training
4
imaging training
4
training sets
4
sets enables
4

Similar Publications

Background: Complicated wrist amputation caused by severe trauma poses a real challenge for orthopedic and hand surgeons. This study aimed to evaluate a procedure of ulnoradial-metacarpal reconstruction as a rescue option in this challenging situation.

Methods: In total, 12 patients with complicated wrist amputation induced by serious injury were selected from 2015 to 2020 and followed up for 1∼6 years at a level 1 trauma center.

View Article and Find Full Text PDF

Background: The role of beta-blockers in severe, traumatic brain injury (TBI) management is debated. Severe TBI may elicit a surge of catecholamines, which has been associated with increased morbidity and mortality. We hypothesize administering propranolol, a non-selective beta-blocker, within 48 h of TBI will reduce patient mortality within 30 days of injury.

View Article and Find Full Text PDF

Leaflet: Operative Steps for Interventional Studies in Neuroscience.

Neurol Int

December 2024

Santa Lucia Foundation, IRCCS Fondazione Santa Lucia, 00179 Rome, Italy.

Background/objectives: Drug development involves multiple stages, spanning from initial discovery to clinical trials. This intricate process entails understanding disease mechanisms, identifying potential drug targets, and evaluating the efficacy and safety of candidate drugs. Clinical trials are designed to assess the effects of drugs on humans, focusing on determining safety profiles, appropriate modes of administration, and comparative efficacy against placebos.

View Article and Find Full Text PDF

Introdution: Although oral and dental diseases may occur in unexpected or even emergency conditions, but some of the diagnosis and treatments can be algorithmically done following some guidelines. The development and implementation of a system that provides users with a record of history and a proposal of required actions can be not only efficiently practical, but also virtually simple.

Materials And Method: A system made up of web and mobile apps is proposed and evaluated for screening and self-care of oral and dental problems and for providing advice on dental emergencies and therapeutic measures.

View Article and Find Full Text PDF

An external control arm based on health registry data can serve as an alternative comparator in single-arm drug development studies that lack a benchmark for comparison to the experimental treatment. However, accessing such observational healthcare data involves a lengthy and intricate application process, delaying drug approval studies and access to novel treatments. Clinical trials typically comprise only a few hundred patients usually with high-cardinality features, which makes individual data instances more exposed to re-identification attacks.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!