AI Article Synopsis

  • - The study aims to create and validate machine learning (ML) models that use natural language processing (NLP) to confirm diagnoses of monoclonal gammopathy of undetermined significance (MGUS) and multiple myeloma (MM) from electronic health records (EHRs) in the Veterans Health Administration.
  • - Researchers analyzed 36,044 EHR documents through various ML classifiers and found that the support vector machine (SVM) model performed best, achieving high scores in diagnosing MGUS and MM, with recall rates of 98.8% and 100%, respectively.
  • - The NLP-assisted model not only accurately confirmed diagnoses but also closely matched the dates of diagnosis with a high degree of accuracy and has the

Article Abstract

Purpose: To develop and validate natural language processing (NLP)-assisted machine learning (ML)-based classification models to confirm diagnoses of monoclonal gammopathy of undetermined significance (MGUS) and multiple myeloma (MM) from electronic health records (EHRs) in the Veterans Health Administration (VHA).

Materials And Methods: We developed precompiled lexicons and classification rules as features for the following ML classifiers: logistic regression, random forest, and support vector machines (SVMs). These features were trained on 36,044 EHR documents from a random sample of 400 patients with at least one International Classification of Disease code for MGUS diagnosis from 1999 to 2021. The best-performing feature combination was calibrated in the validation set (17,826 documents/200 patients) and evaluated in the testing set (9,250 documents/100 patients). Model performance in diagnosis confirmation was compared with manual chart review results (gold standard) using recall, precision, accuracy, and F1 score. For patients correctly labeled as disease-positive, the difference between model-identified diagnosis dates and the gold standard was also computed.

Results: In the testing set, the NLP-assisted classification model using SVMs achieved best performance in both MGUS and MM confirmation with recall/precision/accuracy/F1 of 98.8%/93.3%/93.0%/96.0% for MGUS and 100.0%/92.3%/99.0%/96.0% for MM. Dates of diagnoses matched (±45 days) with those of gold standard in 73.0% of model-confirmed MGUS and 84.6% of model-confirmed MM.

Conclusion: An NLP-assisted classification model can reliably confirm MGUS and MM diagnoses and dates and extract laboratory results using automated interpretation of EHR data. This algorithm has the potential to be adapted to other disease areas in VHA EHR system.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10703129PMC
http://dx.doi.org/10.1200/CCI.23.00081DOI Listing

Publication Analysis

Top Keywords

gold standard
12
natural language
8
classification models
8
models confirm
8
monoclonal gammopathy
8
gammopathy undetermined
8
undetermined significance
8
electronic health
8
health records
8
testing set
8

Similar Publications

Background: The Montreal classification has been widely used in Crohn's disease since 2005 to categorize patients by the age of onset (A), disease location (L), behavior (B), and upper gastrointestinal tract and perianal involvement. With evolving management paradigms in Crohn's disease, we aimed to assess the performance of gastroenterologists in applying the Montreal classification.

Methods: An online survey was conducted among participants at an international educational conference on inflammatory bowel diseases.

View Article and Find Full Text PDF

Objective: To evaluate Chicago Sky Blue (CSB) stain, Calcofluor white (CW) stain, and Potassium Hydroxide (KOH) mount for rapid diagnosis of dermatomycosis, using fungal culture as the gold standard.

Study Design: Cross-sectional analytical study. Place and Duration of the Study: This study was conducted in the Department of Microbiology, Armed Forces Institute of Pathology / National University of Medical Sciences, Rawalpindi, Pakistan, from July 2023 to February 2024.

View Article and Find Full Text PDF

Thyroid tissue is sensitive to the effects of endocrine disrupting substances, and this represents a significant health concern. Histopathological analysis of tissue sections of the rat thyroid gland remains the gold standard for the evaluation for agrochemical effects on the thyroid. However, there is a high degree of variability in the appearance of the rat thyroid gland, and toxicologic pathologists often struggle to decide on and consistently apply a threshold for recording low-grade thyroid follicular hypertrophy.

View Article and Find Full Text PDF

Background: The prognosis of a plasma cell neoplasm (PCN) varies depending on the presence of genetic abnormalities. However, detecting sensitive genetic mutations poses challenges due to the heterogeneous nature of the cell population in bone marrow aspiration. The established gold standard for cell sorting is fluorescence-activated cell sorting (FACS), which is associated with lengthy processing times, substantial cell quantities, and expensive equipment.

View Article and Find Full Text PDF

The accurate assessment of body composition in cirrhosis is challenging as fluid accumulation affects most techniques. The whole-body counter is a state-of-the-art method that measures total body potassium (TBK) unbiased by fluid, from which body cell mass (BCM) is derived. This pilot study in 20 patients with cirrhosis evaluated bedside tools including the liver frailty index (LFI), bioimpedance analysis-based phase angle, calf circumference (CC), and BMI (body mass index)/edema-adjusted CC, and explored their association with TBK and BCM.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!