AI Article Synopsis

  • This study investigates potential biases in both human readings of medical images and AI tools trained on human data, specifically focusing on knee osteoarthritis grading.
  • Researchers used a dataset of 50 patients for external validation and a larger cohort of 8,273 to analyze the performance of an FDA-approved AI tool.
  • Findings indicated that the AI tool displayed non-uniformity in disease grading, showing discrepancies of 20-22% and 13.6% in different patient datasets, but its overall accuracy was comparable to experienced radiologists without evidence of age or sex bias.

Article Abstract

Humans have been shown to have biases when reading medical images, raising questions about whether humans are uniform in their disease gradings. Artificial intelligence (AI) tools trained on human-labeled data may have inherent human non-uniformity. In this study, we used a radiographic knee osteoarthritis external validation dataset of 50 patients and a six-year retrospective consecutive clinical cohort of 8,273 patients. An FDA-approved and CE-marked AI tool was tested for potential non-uniformity in Kellgren-Lawrence grades between the right and left sides of the images. We flipped the images horizontally so that a left knee looked like a right knee and vice versa. According to human review, the AI tool showed non-uniformity with 20-22% disagreements on the external validation dataset and 13.6% on the cohort. However, we found no evidence of a significant difference in the accuracy compared to senior radiologists on the external validation dataset, or age bias or sex bias on the cohort. AI non-uniformity can boost the evaluated performance against humans, but image areas with inferior performance should be investigated.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11538298PMC
http://dx.doi.org/10.1038/s41598-024-75752-zDOI Listing

Publication Analysis

Top Keywords

external validation
12
validation dataset
12
artificial intelligence
8
intelligence tools
8
tools trained
8
trained human-labeled
8
human-labeled data
8
knee osteoarthritis
8
data reflect
4
reflect human
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!