The METRIC-framework for assessing data quality for trustworthy AI in medicine: a systematic review.

Daniel Schwabe Katinka Becker Martin Seyferth Andreas Klaß Tobias Schaeffter

NPJ Digit Med

Division Medical Physics and Metrological Information Technology, Physikalisch-Technische Bundesanstalt, Berlin, Germany.

Published: August 2024

The adoption of machine learning (ML) and, more specifically, deep learning (DL) applications into all major areas of our lives is underway. The development of trustworthy AI is especially important in medicine due to the large implications for patients' lives. While trustworthiness concerns various aspects including ethical, transparency and safety requirements, we focus on the importance of data quality (training/test) in DL. Since data quality dictates the behaviour of ML products, evaluating data quality will play a key part in the regulatory approval of medical ML products. We perform a systematic review following PRISMA guidelines using the databases Web of Science, PubMed and ACM Digital Library. We identify 5408 studies, out of which 120 records fulfil our eligibility criteria. From this literature, we synthesise the existing knowledge on data quality frameworks and combine it with the perspective of ML applications in medicine. As a result, we propose the METRIC-framework, a specialised data quality framework for medical training data comprising 15 awareness dimensions, along which developers of medical ML applications should investigate the content of a dataset. This knowledge helps to reduce biases as a major source of unfairness, increase robustness, facilitate interpretability and thus lays the foundation for trustworthy AI in medicine. The METRIC-framework may serve as a base for systematically assessing training datasets, establishing reference datasets, and designing test datasets which has the potential to accelerate the approval of medical ML products.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11297942	PMC
http://dx.doi.org/10.1038/s41746-024-01196-4	DOI Listing

Publication Analysis

Top Keywords

data quality

trustworthy medicine

systematic review

approval medical

medical products

data

quality

metric-framework assessing

assessing data

quality trustworthy

Similar Publications

Utility of word embeddings from large language models in medical diagnosis.

J Am Med Inform Assoc

January 2025

Kennewick, WA 99338, United States.

Shahram Yazdani Ronald Claude Henry Avery Byrne Isaac Claude Henry

Objective: This study evaluates the utility of word embeddings, generated by large language models (LLMs), for medical diagnosis by comparing the semantic proximity of symptoms to their eponymic disease embedding ("eponymic condition") and the mean of all symptom embeddings associated with a disease ("ensemble mean").

Materials And Methods: Symptom data for 5 diagnostically challenging pediatric diseases-CHARGE syndrome, Cowden disease, POEMS syndrome, Rheumatic fever, and Tuberous sclerosis-were collected from PubMed. Using the Ada-002 embedding model, disease names and symptoms were translated into vector representations in a high-dimensional space.

View Article and Find Full Text PDF

Similar Publications

The Role of Basophil Activation Test in the Diagnosis of Pediatric Egg Allergy in Turkey: A Comparison of Clinical and Laboratory Findings with Real-Life Data.

Allergol Immunopathol (Madr)

January 2025

Faculty of Medicine, Department of Pediatric Allergy and Immunology, Ondokuz Mayıs University, Samsun, Turkey.

Şefika İlknur Kökcü Karadağ Fadıl Öztürk Recep Sancak Alişan Yıldıran

Background: Egg allergy is among the most common food allergies in children, significantly affecting the dietary habits and quality of life of both the affected children and their families. This study aims to assess the clinical role of the Basophil Activation Test (BAT) in children with egg allergy and to evaluate its diagnostic accuracy in comparison to other tests.

Methods: The study included 46 children with egg allergy.

View Article and Find Full Text PDF

Similar Publications

Semblans: Automated assembly and processing of RNA-Seq data.

Bioinformatics

January 2025

Department of Biological Sciences, University of Illinois at Chicago, Illinois 60607, United States.

Miles D Woodcock-Girard Eric C Bretz Holly M Robertson Karolis Ramanauskas Jarrad T Hampton-Marcell

Motivation: Recent advancements in parallel sequencing methods have precipitated a surge in publicly available short-read sequence data. This has encouraged the development of novel computational tools for the de novo assembly of transcriptomes from RNA-seq data. Despite the availability of these tools, performing an end-to-end transcriptome assembly remains a programmatically involved task necessitating familiarity with best practices.

View Article and Find Full Text PDF

Similar Publications

Evaluating for Health Equity in a Safety Net Hospital: Socioeconomic Status, Adherence, and Outcomes in Cardiac Rehabilitation.

J Cardiopulm Rehabil Prev

January 2025

Author Affiliations: Department of Medicine, Cardiology Section, Boston Medical Center, Boston University School of Medicine, Boston, Massachusetts (Drs Washington-Plaskett and Gilman, Ms Zombeck, and Dr Balady), Biostatistics and Epidemiology Data Analytics Center, Boston University School of Public Health, Boston, Massachusetts (Ms Quinn).

Tulani Washington-Plaskett Joshua P Gilman Emily Quinn Stephanie Zombeck Gary Balady

Purpose: Uncovering the racial/ethnic health disparities that exist within cardiovascular medicine offers potential to mitigate treatment gaps that might affect outcomes. Socioeconomic status (SES) may be a more appropriate underlying factor to assess these disparities. We aimed to evaluate whether adherence, attendance, and outcomes in cardiac rehabilitation are associated with SES in a safety net hospital.

View Article and Find Full Text PDF

Similar Publications

Trajectory and predictors of psychological distress and posttraumatic growth among rectal cancer patients undergoing combined modality treatment: An exploratory prospective study.

Psychol Trauma

January 2025

Department of Psychology, University of Turin.

Agata Benfante Valentina Tesio Pierfrancesco Franco Annunziata Romeo Francesca Arcadipane

Objective: This exploratory prospective cohort study aimed to investigate the trajectory of psychological distress and posttraumatic growth (PTG) in rectal cancer patients from diagnosis to follow-up and to explore factors that could predict PTG and psychological distress at follow-up.

Method: We assessed psychological distress (anxiety and depression), PTG, physical symptoms, quality of life, cancer-related coping, state and trait affectivity, resilience, and alexithymia in 43 rectal cancer patients, ) age: 61.6 (12.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!