Background: Multicenter electronic health records can support quality improvement and comparative effectiveness research in stroke. However, limitations of electronic health record-based research include challenges in abstracting key clinical variables, including stroke severity, along with missing data. We developed a natural language processing model that reads electronic health record notes to directly extract the National Institutes of Health Stroke Scale score when documented and predict the score from clinical documentation when missing.

Methods And Results: The study included notes from patients with acute stroke (aged ≥18 years) admitted to Massachusetts General Hospital (2015-2022). The Massachusetts General Hospital data were divided into training/holdout test (70%/30%) sets. We developed a 2-stage model to predict the admission National Institutes of Health Stroke Scale, obtained from the GWTG (Get With The Guidelines) stroke registry. We trained a model with the least absolute shrinkage and selection operator. For test notes with documented National Institutes of Health Stroke Scale, scores were extracted using regular expressions (stage 1); when not documented, least absolute shrinkage and selection operator was used for prediction (stage 2). The 2-stage model was tested on the holdout test set and validated in the Medical Information Mart for Intensive Care (2001-2012) version 1.4, using root mean squared error and Spearman correlation. We included 4163 patients (Massachusetts General Hospital, 3876; Medical Information Mart for Intensive Care, 287); average age, 69 (SD, 15) years; 53% men, and 72% White individuals. The model achieved a root mean squared error of 2.89 (95% CI, 2.62-3.19) and Spearman correlation of 0.92 (95% CI, 0.91-0.93) in the Massachusetts General Hospital test set, and 2.20 (95% CI, 1.69-2.66) and 0.96 (95% CI, 0.94-0.97) in the MIMIC validation set, respectively.

Conclusions: The automatic natural language processing-based model can enable large-scale stroke severity phenotyping from the electronic health record and support real-world quality improvement and comparative effectiveness studies in stroke.

Download full-text PDF

Source
http://dx.doi.org/10.1161/JAHA.124.036386DOI Listing

Publication Analysis

Top Keywords

electronic health
20
massachusetts general
16
general hospital
16
stroke severity
12
natural language
12
national institutes
12
institutes health
12
health stroke
12
stroke scale
12
stroke
10

Similar Publications

The Crimean Congo virus has been reported to be a part of the spherical RNA-enveloped viruses from the Bunyaviridae family. Crimean Congo fever (CCHF) is a fatal disease with having fatality rate of up to 40%. It is declared endemic by the World Health Organization.

View Article and Find Full Text PDF

Ventilation and features of the lung environment dynamically alter modeled intrapulmonary aerosol exposure from inhaled electronic cigarettes.

Sci Rep

December 2024

Division of Pulmonary and Critical Care, Department of Medicine, David Geffen School of Medicine, University of California, Los Angeles, CA, 90095-1690, USA.

Electronic cigarettes (e-cigs) fundamentally differ from tobacco cigarettes in their generation of liquid-based aerosols. Investigating how e-cig aerosols behave when inhaled into the dynamic environment of the lung is important for understanding vaping-related exposure and toxicity. A ventilated artificial lung model was developed to replicate the ventilatory and environmental features of the human lung and study their impact on the characteristics of inhaled e-cig aerosols from simulated vaping scenarios.

View Article and Find Full Text PDF

Natural honey is enriched with essential and beneficial nutrients. This study aimed to investigate the melliferous flora microscopic techniques and assess the biochemical properties of honey. Flavonoid and phenolic contents in honey samples were analyzed via colorimetric and Folin-Ciocalteu methods and the alpha-amylase, reducing power, and minerals using Pull's and spectroscopy methods.

View Article and Find Full Text PDF

Prognostic role of aetiological agent vs. clinical pattern in candidates to lead extraction for cardiac implantable electronic device infections.

Sci Rep

December 2024

Department of Medical and Surgical Sciences, Institute of Cardiology, University of Bologna, Policlinico S.Orsola-Malpighi, via Massarenti 9, Bologna, 40138, Italy.

Cardiac implantable electronic devices infections (CIEDI) are associated with poor survival despite the improvement in transvenous lead extraction (TLE). Aetiology and systemic involvement are driving factors of clinical outcomes. The aim of this study was to explore their contribute on overall mortality.

View Article and Find Full Text PDF

E-cigarette/vaping-associated lung injury (EVALI) is strongly associated with vitamin E acetate and often occurs with concomitant tetrahydrocannabinol (THC) use. To uncover pathways associated with EVALI, we examined cytokines, transcriptomic signatures, and lipidomic profiles in bronchoalveolar lavage fluid (BALF) from THC-EVALI patients. At a single center, we prospectively enrolled mechanically ventilated patients with EVALI from THC-containing products (N = 4) and patients with non-vaping acute lung injury and airway controls (N = 5).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!