Natural language processing improves identification of colorectal cancer testing in the electronic medical record.

Med Decis Making

Division of General Internal Medicine and Public Health, Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee (JCD, NNC, JFP, NBP)

Published: June 2012

Background: Difficulty identifying patients in need of colorectal cancer (CRC) screening contributes to low screening rates.

Objective: To use Electronic Health Record (EHR) data to identify patients with prior CRC testing.

Design: A clinical natural language processing (NLP) system was modified to identify 4 CRC tests (colonoscopy, flexible sigmoidoscopy, fecal occult blood testing, and double contrast barium enema) within electronic clinical documentation. Text phrases in clinical notes referencing CRC tests were interpreted by the system to determine whether testing was planned or completed and to estimate the date of completed tests.

Setting: Large academic medical center.

Patients: 200 patients ≥ 50 years old who had completed ≥ 2 non-acute primary care visits within a 1-year period.

Measures: Recall and precision of the NLP system, billing records, and human chart review were compared to a reference standard of human review of all available information sources.

Results: For identification of all CRC tests, recall and precision were as follows: NLP system (recall 93%, precision 94%), chart review (74%, 98%), and billing records review (44%, 83%). Recall and precision for identification of patients in need of screening were: NLP system (recall 95%, precision 88%), chart review (99%, 82%), and billing records (99%, 67%).

Limitations: Small sample size and requirement for a robust EHR.

Conclusions: Applying NLP to EHR records detected more CRC tests than either manual chart review or billing records review alone. NLP had better precision but marginally lower recall to identify patients who were due for CRC screening than billing record review.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9616628PMC
http://dx.doi.org/10.1177/0272989X11400418DOI Listing

Publication Analysis

Top Keywords

nlp system
16
crc tests
16
billing records
16
chart review
16
recall precision
12
natural language
8
language processing
8
colorectal cancer
8
crc screening
8
identify patients
8

Similar Publications

Centrifugal compressors are widely used in the oil and natural gas industry for gas compression, reinjection, and transportation. Fault diagnosis and identification of centrifugal compressors are crucial. To promptly monitor abnormal changes in compressor data and trace the causes leading to these data anomalies, this paper proposes a security monitoring and root cause tracing method for compressor data anomalies.

View Article and Find Full Text PDF

Carrot callus grown on a medium with increased nitrogen have reduced carotenoid accumulation, changed gene expression, high amount of vesicular plastids and altered cell wall composition. Carotenoid biosynthesis is vital for plant development and quality, yet its regulation under varying nutrient conditions remains unclear. To explore the effects of nitrogen (N) availability, we used carrot (Daucus carota L.

View Article and Find Full Text PDF

Reevaluating Anti-Inflammatory Therapy: Targeting Senescence to Balance Anti-Cancer Efficacy and Vascular Disease.

Arterioscler Thromb Vasc Biol

January 2025

Department of Cardiology, The University of Texas MD Anderson Cancer Center, Houston. (B.C.-C., N.A.V.G., N.L.P., L.P.E., V.S.K.S., A.M.O., J.L., G.M., O.H., A.D., S.W.Y., C.A.I., K.C.O.M., S. Kotla, J.-i.A.).

Modulating immune function is a critical strategy in cancer and atherosclerosis treatments. For cancer, boosting or maintaining the immune system is crucial to prevent tumor growth. However, in vascular disease, mitigating immune responses can decrease inflammation and slow atherosclerosis progression.

View Article and Find Full Text PDF

This study aimed to develop an advanced ensemble approach for automated classification of mental health disorders in social media posts. The research question was: can an ensemble of fine-tuned transformer models (XLNet, RoBERTa, and ELECTRA) with Bayesian hyperparameter optimization improve the accuracy of mental health disorder classification in social media text. Three transformer models (XLNet, RoBERTa, and ELECTRA) were fine-tuned on a dataset of social media posts labelled with 15 distinct mental health disorders.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!