The METRIC-framework for assessing data quality for trustworthy AI in medicine: a systematic review.

NPJ Digit Med

Division Medical Physics and Metrological Information Technology, Physikalisch-Technische Bundesanstalt, Berlin, Germany.

Published: August 2024

The adoption of machine learning (ML) and, more specifically, deep learning (DL) applications into all major areas of our lives is underway. The development of trustworthy AI is especially important in medicine due to the large implications for patients' lives. While trustworthiness concerns various aspects including ethical, transparency and safety requirements, we focus on the importance of data quality (training/test) in DL. Since data quality dictates the behaviour of ML products, evaluating data quality will play a key part in the regulatory approval of medical ML products. We perform a systematic review following PRISMA guidelines using the databases Web of Science, PubMed and ACM Digital Library. We identify 5408 studies, out of which 120 records fulfil our eligibility criteria. From this literature, we synthesise the existing knowledge on data quality frameworks and combine it with the perspective of ML applications in medicine. As a result, we propose the METRIC-framework, a specialised data quality framework for medical training data comprising 15 awareness dimensions, along which developers of medical ML applications should investigate the content of a dataset. This knowledge helps to reduce biases as a major source of unfairness, increase robustness, facilitate interpretability and thus lays the foundation for trustworthy AI in medicine. The METRIC-framework may serve as a base for systematically assessing training datasets, establishing reference datasets, and designing test datasets which has the potential to accelerate the approval of medical ML products.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11297942PMC
http://dx.doi.org/10.1038/s41746-024-01196-4DOI Listing

Publication Analysis

Top Keywords

data quality
24
trustworthy medicine
12
systematic review
8
approval medical
8
medical products
8
data
7
quality
6
metric-framework assessing
4
assessing data
4
quality trustworthy
4

Similar Publications

Objective: This study evaluates the utility of word embeddings, generated by large language models (LLMs), for medical diagnosis by comparing the semantic proximity of symptoms to their eponymic disease embedding ("eponymic condition") and the mean of all symptom embeddings associated with a disease ("ensemble mean").

Materials And Methods: Symptom data for 5 diagnostically challenging pediatric diseases-CHARGE syndrome, Cowden disease, POEMS syndrome, Rheumatic fever, and Tuberous sclerosis-were collected from PubMed. Using the Ada-002 embedding model, disease names and symptoms were translated into vector representations in a high-dimensional space.

View Article and Find Full Text PDF

Background: Egg allergy is among the most common food allergies in children, significantly affecting the dietary habits and quality of life of both the affected children and their families. This study aims to assess the clinical role of the Basophil Activation Test (BAT) in children with egg allergy and to evaluate its diagnostic accuracy in comparison to other tests.

Methods: The study included 46 children with egg allergy.

View Article and Find Full Text PDF

Motivation: Recent advancements in parallel sequencing methods have precipitated a surge in publicly available short-read sequence data. This has encouraged the development of novel computational tools for the de novo assembly of transcriptomes from RNA-seq data. Despite the availability of these tools, performing an end-to-end transcriptome assembly remains a programmatically involved task necessitating familiarity with best practices.

View Article and Find Full Text PDF

Evaluating for Health Equity in a Safety Net Hospital: Socioeconomic Status, Adherence, and Outcomes in Cardiac Rehabilitation.

J Cardiopulm Rehabil Prev

January 2025

Author Affiliations: Department of Medicine, Cardiology Section, Boston Medical Center, Boston University School of Medicine, Boston, Massachusetts (Drs Washington-Plaskett and Gilman, Ms Zombeck, and Dr Balady), Biostatistics and Epidemiology Data Analytics Center, Boston University School of Public Health, Boston, Massachusetts (Ms Quinn).

Purpose: Uncovering the racial/ethnic health disparities that exist within cardiovascular medicine offers potential to mitigate treatment gaps that might affect outcomes. Socioeconomic status (SES) may be a more appropriate underlying factor to assess these disparities. We aimed to evaluate whether adherence, attendance, and outcomes in cardiac rehabilitation are associated with SES in a safety net hospital.

View Article and Find Full Text PDF

Objective: This exploratory prospective cohort study aimed to investigate the trajectory of psychological distress and posttraumatic growth (PTG) in rectal cancer patients from diagnosis to follow-up and to explore factors that could predict PTG and psychological distress at follow-up.

Method: We assessed psychological distress (anxiety and depression), PTG, physical symptoms, quality of life, cancer-related coping, state and trait affectivity, resilience, and alexithymia in 43 rectal cancer patients, ) age: 61.6 (12.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!