Background: Error analysis plays a crucial role in clinical concept extraction, a fundamental subtask within clinical natural language processing (NLP). The process typically involves a manual review of error types, such as contextual and linguistic factors contributing to their occurrence, and the identification of underlying causes to refine the NLP model and improve its performance. Conducting error analysis can be complex, requiring a combination of NLP expertise and domain-specific knowledge. Due to the high heterogeneity of electronic health record (EHR) settings across different institutions, challenges may arise when attempting to standardize and reproduce the error analysis process.
Objectives: This study aims to facilitate a collaborative effort to establish common definitions and taxonomies for capturing diverse error types, fostering community consensus on error analysis for clinical concept extraction tasks.
Materials And Methods: We iteratively developed and evaluated an error taxonomy based on existing literature, standards, real-world data, multisite case evaluations, and community feedback. The finalized taxonomy was released in both .dtd and .owl formats at the Open Health Natural Language Processing Consortium. The taxonomy is compatible with several different open-source annotation tools, including MAE, Brat, and MedTator.
Results: The resulting error taxonomy comprises 43 distinct error classes, organized into 6 error dimensions and 4 properties, including model type (symbolic and statistical machine learning), evaluation subject (model and human), evaluation level (patient, document, sentence, and concept), and annotation examples. Internal and external evaluations revealed strong variations in error types across methodological approaches, tasks, and EHR settings. Key points emerged from community feedback, including the need to enhancing clarity, generalizability, and usability of the taxonomy, along with dissemination strategies.
Conclusion: The proposed taxonomy can facilitate the acceleration and standardization of the error analysis process in multi-site settings, thus improving the provenance, interpretability, and portability of NLP models. Future researchers could explore the potential direction of developing automated or semi-automated methods to assist in the classification and standardization of error analysis.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11187420 | PMC |
http://dx.doi.org/10.1093/jamia/ocae101 | DOI Listing |
Pain Ther
January 2025
Department of Anaesthesia, Tawam Hospital, PO Box 15258, Al Ain, United Arab Emirates.
Introduction: This review aimed to investigate the inadvertent administration of antibiotics via epidural and intrathecal routes. The secondary objective was to identify the contributing human and systemic factors.
Methods: PubMed, Scopus and Google Scholar databases were searched for the last five decades (1973-2023).
Anal Bioanal Chem
January 2025
Institute of Chemistry, Analytical Chemistry, University of Graz, Graz, Austria.
This work provides a statistical analysis of four different approaches suggested in the literature for the estimation of an unknown concentration based on data collected using the standard addition method. These approaches are the conventional extrapolation approach, the interpolation approach, inverse regression, and the normalization approach. These methods are compared under the assumption that the measurement errors are normally distributed and homoscedastic.
View Article and Find Full Text PDFLangenbecks Arch Surg
January 2025
Department of Surgery, Division of HBP Surgery & Liver Transplantation, Korea University College of Medicine, Seoul, Korea.
Purpose: Pancreatectomy patients often experience challenging fluctuations in blood glucose levels; therefore, they require a reliable monitoring system. This study aimed to determine the accuracy and acceptability of a continuous glucose monitoring (CGM) system compared with the intermittent capillary glucose test in patients who have undergone pancreatectomy.
Methods: Thirty non-diabetic pancreatectomy patients participated.
Vet Med Sci
January 2025
Department of Statistics, Faculty of Science, University of Muğla Sıtkı Koçman, Muğla, Turkey.
Background: There is a lack of data on the validation and diagnostic performance of the Freestyle Optium Neo-H (Freestyle) and Centrivet GK (Centrivet) devices for the diagnosis of hypoglycaemia, hyperglycaemia and hyperketonaemia in goats.
Objectives: The aim of the present study was to validate the Freestyle and Centrivet for the analysis of whole blood beta-hydroxybutyric acid (BHBA) and to validate the Freestyle for the analysis of whole blood glucose concentrations using the reference method (RM) in goat blood collected from the jugular and ear veins.
Methods: Venous blood samples were utilised to assess glucose and BHBA concentrations using the Freestyle, Centrivet and RM.
Neuropsychopharmacol Rep
March 2025
National Center of Neurology and Psychiatry, National Institute of Mental Health, Kodaira, Tokyo, Japan.
Aim: The Internet Gaming Disorder Scale is a 9-item screening instrument developed based on the diagnostic criteria for Internet Gaming Disorder (IGD) in the DSM-5. This study aimed to examine the reliability and validity of the Internet Gaming Disorder Scale for children (IGDS-C) in Japanese clinical and nonclinical populations.
Methods: The study included clinical outpatients aged 9-29 with problematic game use and nonclinical adolescents aged 12-18 who played online games at least once a week.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!