AI Article Synopsis

  • * The study conducts experiments on medical imaging, particularly using X-rays affected by COVID-19, to assess three methods for detecting data drift, revealing that monitoring performance alone is insufficient.
  • * The findings emphasize the need for effective data drift detection techniques, which vary based on sample size and patient features, while highlighting gaps in the application of current methods in real-world scenarios.

Article Abstract

While it is common to monitor deployed clinical artificial intelligence (AI) models for performance degradation, it is less common for the input data to be monitored for data drift - systemic changes to input distributions. However, when real-time evaluation may not be practical (eg., labeling costs) or when gold-labels are automatically generated, we argue that tracking data drift becomes a vital addition for AI deployments. In this work, we perform empirical experiments on real-world medical imaging to evaluate three data drift detection methods' ability to detect data drift caused (a) naturally (emergence of COVID-19 in X-rays) and (b) synthetically. We find that monitoring performance alone is not a good proxy for detecting data drift and that drift-detection heavily depends on sample size and patient features. Our work discusses the need and utility of data drift detection in various scenarios and highlights gaps in knowledge for the practical application of existing methods.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10904813PMC
http://dx.doi.org/10.1038/s41467-024-46142-wDOI Listing

Publication Analysis

Top Keywords

data drift
28
drift detection
12
experiments real-world
8
real-world medical
8
medical imaging
8
data
8
drift
7
empirical data
4
detection experiments
4
imaging data
4

Similar Publications

Medical Imaging Data Strategies for Catalyzing AI Medical Device Innovation.

J Imaging Inform Med

January 2025

Office of Science and Engineering Laboratories, Center for Devices and Radiological Health, U.S. Food and Drug Administration, 10903 New Hampshire Ave, Silver Spring, MD, 20993, USA.

Continuous and consistent access to quality medical imaging data stimulates innovations in artificial intelligence (AI) technologies for patient care. Breakthrough innovations in data-driven AI technologies are founded on seamless communication between data providers, data managers, data users and regulators or other evaluators to determine the standards for quality data. However, the complexity in imaging data quality and heterogeneous nature of AI-enabled medical devices and their intended uses presents several challenges limiting the clinical translation of novel AI technologies.

View Article and Find Full Text PDF

Online ensemble model compression for nonstationary data stream learning.

Neural Netw

January 2025

School of Computer Science, University of Birmingham, Edgbaston, Birmingham, B15 2TT, United Kingdom.

Learning from data streams that emerge from nonstationary environments has many real-world applications and poses various challenges. A key characteristic of such a task is the varying nature of the underlying data distributions over time (concept drifts). However, the most common type of data stream learning approach are ensemble approaches, which involve the training of multiple base learners.

View Article and Find Full Text PDF

Background: To use unmanned aerial vehicles (UAVs) to deliver pesticides, new data are needed to allow regulators to conduct risk assessments. A field trial was conducted to obtain spray drift data relating to ground deposits and airborne spray resulting from a spray application delivered by a small UAV.

Results: A 12 m width area was sprayed with four passes of the UAV and spray deposits were collected within the sprayed area and up to 50 m downwind.

View Article and Find Full Text PDF

The expression of genomically-encoded information is not error-free. Transcript-error rates are dramatically higher than DNA-level mutation rates, and despite their transient nature, the steady-state load of such errors must impose some burden on cellular performance. However, a broad perspective on the degree to which transcript-error rates are constrained by natural selection and diverge among lineages remains to be developed.

View Article and Find Full Text PDF

Multiplexed Immunofluorescence (MxIF) enables detailed immune cell phenotyping, providing critical insights into cell behavior within the tumor immune microenvironment (TIME). However, signal integrity can be compromised due to the complex cyclic staining processes inherent to MxIF. Hematoxylin and Eosin (H&E) staining, on the other hand, offers complementary information through its depiction of cell morphology and texture patterns and is often visually cross-referenced with MxIF in clinical settings.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!