Using an artificial intelligence tool incorporating natural language processing to identify patients with a diagnosis of ANCA-associated vasculitis in electronic health records.

Comput Biol Med

Center of Expertise for Lupus-, Vasculitis- and Complement-mediated Systemic diseases (LuVaCs), Department of Internal Medicine - Nephrology Section, Leiden University Medical Center, Leiden, the Netherlands. Electronic address:

Published: January 2024

Background: Because anti-neutrophil cytoplasmatic antibody (ANCA)-associated vasculitis (AAV) is a rare, life-threatening, auto-immune disease, conducting research is difficult but essential. A long-lasting challenge is to identify rare AAV patients within the electronic-health-record (EHR)-system to facilitate real-world research. Artificial intelligence (AI)-search tools using natural language processing (NLP) for text-mining are increasingly postulated as a solution.

Methods: We employed an AI-tool that combined text-mining with NLP-based exclusion, to accurately identify rare AAV patients within large EHR-systems (>2.000.000 records). We developed an identification method in an academic center with an established AAV-training set (n = 203) and validated the method in a non-academic center with an AAV-validation set (n = 84). To assess accuracy anonymized patient records were manually reviewed.

Results: Based on an iterative process, a text-mining search was developed on disease description, laboratory measurements, medication and specialisms. In the training center, 608 patients were identified with a sensitivity of 97.0 % (95%CI [93.7, 98.9]) and positive predictive value (PPV) of 56.9 % (95%CI [52.9, 60.1]). NLP-based exclusion resulted in 444 patients increasing PPV to 77.9 % (95%CI [73.7, 81.7]) while sensitivity remained 96.3 % (95%CI [93.8, 98.0]). In the validation center, text-mining identified 333 patients (sensitivity 97.6 % (95%CI [91.6, 99.7]), PPV 58.2 % (95%CI [52.8, 63.6])) and NLP-based exclusion resulted in 223 patients, increasing PPV to 86.1 % (95%CI [80.9, 90.4]) with 98.0 % (95%CI [94.9, 99.4]) sensitivity. Our identification method outperformed ICD-10-coding predominantly in identifying MPO+ and organ-limited AAV patients.

Conclusions: Our study highlights the advantages of implementing AI, notably NLP, to accurately identify rare AAV patients within large EHR-systems and demonstrates the applicability and transportability. Therefore, this method can reduce efforts to identify AAV patients and accelerate real-world research, while avoiding bias by ICD-10-coding.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.compbiomed.2023.107757DOI Listing

Publication Analysis

Top Keywords

aav patients
16
identify rare
12
rare aav
12
nlp-based exclusion
12
patients
9
artificial intelligence
8
natural language
8
language processing
8
anca-associated vasculitis
8
accurately identify
8

Similar Publications

Background: Patients with arrhythmogenic cardiomyopathy (ACM) due to pathogenic variants in , the gene for the desmosomal protein plakophilin-2, are being enrolled in gene therapy trials designed to replace the defective allele via adeno-associated viral (AAV) transduction of cardiac myocytes. Evidence from experimental systems and patients indicates that ventricular myocytes in ACM have greatly reduced electrical coupling at gap junctions and reduced Na current density. In previous AAV gene therapy trials, <50% of ventricular myocytes have generally been transduced.

View Article and Find Full Text PDF

Mutations in the gene cause the most common form of human hereditary hearing loss, known as DFNB1. is expressed in two cell groups of the cochlea-epithelial cells of the organ of Corti and fibrocytes of the inner sulcus and lateral wall-but not by sensory hair cells or neurons. Attempts to treat mouse models of DFNB1 with AAV vectors mediating nonspecific expression have not substantially restored function, perhaps because inappropriate expression in hair cells and neurons could compromise their electrical activity.

View Article and Find Full Text PDF

Background: Infectious episodes contribute to morbidity and mortality in patients with anti-neutrophil cytoplasmic antibody (ANCA)-associated vasculitis (AAV). Renal involvement, also known as ANCA-associated glomerulonephritis (AGN), is frequently observed in AAV. Little is known about whether co-infection at initial diagnosis is associated with renal outcome and prognosis in children with AGN.

View Article and Find Full Text PDF

Background: Cardiovascular disease (CVD) is a leading cause of death in ANCA-associated vasculitis (AAV). Screening and primary cardiovascular prevention may improve outcomes.

Methods: We identified patients in the 2002-2019 Mass General Brigham AAV cohort with thoracic CT scans obtained for other clinical purposes.

View Article and Find Full Text PDF

Comprehensive characteristics of pulmonary antineutrophil cytoplasmic antibody-associated vasculitis and the development of a predictive nomogram for mortality.

Int Immunopharmacol

January 2025

Department of Pulmonary and Critical Care Medicine, West China Hospital, Sichuan University, Chengdu 610041, People's Republic of China; State Key Laboratory of Respiratory Health and Multimorbidity, West China Hospital, Sichuan University, Chengdu 610041, People's Republic of China. Electronic address:

Objectives: Antineutrophil cytoplasmic antibody (ANCA) associated vasculitis (AAV) is a rare and potential devastating disease with high mortality, frequently with pulmonary involvement. Our study aimed to explore the pulmonary features of AAV and identify predictors of long-term survival.

Methods: We retrospectively analyzed 538 AAV patients diagnosed between January 2013 and July 2019, with follow-up data extending to August 2020.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!