Identification of patients with carotid stenosis using natural language processing.

Eur Radiol

Department of Radiology and Biomedical Imaging, Yale University School of Medicine, Box 208042, Tompkins East 2, 333 Cedar St, New Haven, CT, 06520-8042, USA.

Published: July 2020

Purpose: The highly structured nature of medical reports makes them feasible for automated large-scale patient identification. This study aimed to develop a natural language processing (NLP) model to retrospectively retrieve patients with presence and history of carotid stenosis (CS) using their ultrasound reports.

Methods: Ultrasound reports from our institution between January 2016 and December 2017 were selected. To process the texts, we developed a parser to divide the raw text into fields. For baseline method, we used bag-of-n-grams and term frequency inverse document frequency as the features and used linear classifiers. Logistic regression was performed as the baseline model. Convolution and recurrent neural networks (CNN; RNN) with attention mechanism were applied to the dataset to improve the classification accuracy.

Results: We had 1220 ultrasound reports for training and 307 for testing, totaling to 1527 reports. For predicting history of CS, both CNN and RNN-attention models had a significantly higher specificity than logistic regression. In addition, RNN-attention also had a significantly higher F1 score and accuracy. For predicting presence of carotid stenosis, all models achieved above 93% accuracy. RNN-attention achieved a 95.4% accuracy, although the difference with logistic regression was not statistically significant. RNN-attention had a statistically significant higher specificity than logistic regression.

Conclusions: We developed linear, CNN, and RNN models to predict history and presence of CS from ultrasound reports. We have demonstrated NLP to be an efficient, accurate approach for large-scale retrospective patient identification, with applications in long-term follow-up of patients and clinical research studies.

Key Points: • Natural language processing models using both linear classifiers and neural networks can achieve a good performance, with an overall accuracy above 90% in predicting history and presence of carotid stenosis. • Convolution and recurrent neural networks, especially with additional features including field awareness and attention mechanism, have superior performance than traditional linear classifiers. • NLP is shown to be an efficient approach for large-scale retrospective patient identification, with applications in long-term follow-up of patients and further clinical research studies.

Download full-text PDF

Source
http://dx.doi.org/10.1007/s00330-020-06721-zDOI Listing

Publication Analysis

Top Keywords

carotid stenosis
16
natural language
12
language processing
12
patient identification
12
ultrasound reports
12
linear classifiers
12
logistic regression
12
neural networks
12
convolution recurrent
8
recurrent neural
8

Similar Publications

Background: Cerebral hyperperfusion syndrome (CHS) is a rare but serious complication after carotid artery revascularization. This study aims to determine the impact of carotid artery stenting (CAS) modality on the incidence, severity and overall outcomes of CHS after carotid revascularization.

Methods: Data from patients who underwent CAS with either distal embolic protection (CAS+DEP) or transcarotid artery revascularization (TCAR) were obtained from the Vascular Quality Initiative (VQI) database 2016-2023.

View Article and Find Full Text PDF

Background And Objectives: Although previous trials have established the efficacy and safety of endovascular thrombectomy (EVT) in large ischemic core strokes, most of them excluded patients with extracranial internal carotid artery (e-ICA) occlusion. We aimed to compare outcomes in patients with e-ICA occlusion and large ischemic core infarcts treated with EVT vs medical management (MM).

Methods: This was a secondary analysis of the SELECT2 trial, a randomized controlled trial conducted at 31 international sites.

View Article and Find Full Text PDF

Copy Number Variant Does Not Influence Stroke Severity in 2 C57BL/6J Mouse Models nor in Humans: An Exploratory Study.

Stroke

January 2025

Department of Experimental Neurology, Center for Stroke Research Berlin (CSB), Charité-Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health, Germany (M.F., S.B., S.M., K.W., M.E., A.M., U.D., C.S.).

Background: Contrary to the common belief, the most commonly used laboratory C57BL/6J mouse inbred strain presents a distinctive genetic and phenotypic variability, and for several traits, the genotype-phenotype link remains still unknown. Recently, we characterized the most important stroke survival factor such as brain collateral plasticity in 2 brain ischemia C57BL/6J mouse models (bilateral common carotid artery stenosis and middle cerebral artery occlusion) and observed a Mendelian-like fashion of inheritance of the posterior communicating artery (PcomA) patency. Interestingly, a copy number variant (CNV) spanning locus was reported to segregate in an analogous Mendelian-like pattern in the C57BL/6J colonies of the Jackson Laboratory.

View Article and Find Full Text PDF

Symptomatic Cervical Carotid Artery Stenosis: Evolving Paradigms in Risk Stratification and Intervention.

Ann Indian Acad Neurol

January 2025

Departments of Clinical Neurosciences and Community Health Sciences, The Hotchkiss Brain Institute, The Mathison Centre for Mental Health Research and Education, and The O'Brien Institute for Public Health, University of Calgary Cumming School of Medicine, Calgary, Alberta, Canada.

Symptomatic carotid disease, characterized by atherosclerotic or non-atherosclerotic internal carotid artery disease with ipsilateral stroke symptoms, represents a critical condition in stroke neurology. This "hot carotid" state carries a high risk of stroke recurrence, with almost one-fourth of the patients experiencing recurrent ischemic events within 2 weeks of initial presentation. The global prevalence of significant carotid stenosis (conventionally defined as ≥50% narrowing) is estimated at around 1.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!