The adoption of machine learning (ML) and, more specifically, deep learning (DL) applications into all major areas of our lives is underway. The development of trustworthy AI is especially important in medicine due to the large implications for patients' lives. While trustworthiness concerns various aspects including ethical, transparency and safety requirements, we focus on the importance of data quality (training/test) in DL. Since data quality dictates the behaviour of ML products, evaluating data quality will play a key part in the regulatory approval of medical ML products. We perform a systematic review following PRISMA guidelines using the databases Web of Science, PubMed and ACM Digital Library. We identify 5408 studies, out of which 120 records fulfil our eligibility criteria. From this literature, we synthesise the existing knowledge on data quality frameworks and combine it with the perspective of ML applications in medicine. As a result, we propose the METRIC-framework, a specialised data quality framework for medical training data comprising 15 awareness dimensions, along which developers of medical ML applications should investigate the content of a dataset. This knowledge helps to reduce biases as a major source of unfairness, increase robustness, facilitate interpretability and thus lays the foundation for trustworthy AI in medicine. The METRIC-framework may serve as a base for systematically assessing training datasets, establishing reference datasets, and designing test datasets which has the potential to accelerate the approval of medical ML products.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11297942 | PMC |
http://dx.doi.org/10.1038/s41746-024-01196-4 | DOI Listing |
J Am Med Inform Assoc
January 2025
Kennewick, WA 99338, United States.
Objective: This study evaluates the utility of word embeddings, generated by large language models (LLMs), for medical diagnosis by comparing the semantic proximity of symptoms to their eponymic disease embedding ("eponymic condition") and the mean of all symptom embeddings associated with a disease ("ensemble mean").
Materials And Methods: Symptom data for 5 diagnostically challenging pediatric diseases-CHARGE syndrome, Cowden disease, POEMS syndrome, Rheumatic fever, and Tuberous sclerosis-were collected from PubMed. Using the Ada-002 embedding model, disease names and symptoms were translated into vector representations in a high-dimensional space.
Allergol Immunopathol (Madr)
January 2025
Faculty of Medicine, Department of Pediatric Allergy and Immunology, Ondokuz Mayıs University, Samsun, Turkey.
Background: Egg allergy is among the most common food allergies in children, significantly affecting the dietary habits and quality of life of both the affected children and their families. This study aims to assess the clinical role of the Basophil Activation Test (BAT) in children with egg allergy and to evaluate its diagnostic accuracy in comparison to other tests.
Methods: The study included 46 children with egg allergy.
Bioinformatics
January 2025
Department of Biological Sciences, University of Illinois at Chicago, Illinois 60607, United States.
Motivation: Recent advancements in parallel sequencing methods have precipitated a surge in publicly available short-read sequence data. This has encouraged the development of novel computational tools for the de novo assembly of transcriptomes from RNA-seq data. Despite the availability of these tools, performing an end-to-end transcriptome assembly remains a programmatically involved task necessitating familiarity with best practices.
View Article and Find Full Text PDFJ Cardiopulm Rehabil Prev
January 2025
Author Affiliations: Department of Medicine, Cardiology Section, Boston Medical Center, Boston University School of Medicine, Boston, Massachusetts (Drs Washington-Plaskett and Gilman, Ms Zombeck, and Dr Balady), Biostatistics and Epidemiology Data Analytics Center, Boston University School of Public Health, Boston, Massachusetts (Ms Quinn).
Purpose: Uncovering the racial/ethnic health disparities that exist within cardiovascular medicine offers potential to mitigate treatment gaps that might affect outcomes. Socioeconomic status (SES) may be a more appropriate underlying factor to assess these disparities. We aimed to evaluate whether adherence, attendance, and outcomes in cardiac rehabilitation are associated with SES in a safety net hospital.
View Article and Find Full Text PDFPsychol Trauma
January 2025
Department of Psychology, University of Turin.
Objective: This exploratory prospective cohort study aimed to investigate the trajectory of psychological distress and posttraumatic growth (PTG) in rectal cancer patients from diagnosis to follow-up and to explore factors that could predict PTG and psychological distress at follow-up.
Method: We assessed psychological distress (anxiety and depression), PTG, physical symptoms, quality of life, cancer-related coping, state and trait affectivity, resilience, and alexithymia in 43 rectal cancer patients, ) age: 61.6 (12.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!