Identify diabetic retinopathy-related clinical concepts and their attributes using transformer-based natural language processing methods.

BMC Med Inform Decis Mak

Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL, USA.

Published: September 2022

Background: Diabetic retinopathy (DR) is a leading cause of blindness in American adults. If detected, DR can be treated to prevent further damage causing blindness. There is an increasing interest in developing artificial intelligence (AI) technologies to help detect DR using electronic health records. The lesion-related information documented in fundus image reports is a valuable resource that could help diagnoses of DR in clinical decision support systems. However, most studies for AI-based DR diagnoses are mainly based on medical images; there is limited studies to explore the lesion-related information captured in the free text image reports.

Methods: In this study, we examined two state-of-the-art transformer-based natural language processing (NLP) models, including BERT and RoBERTa, compared them with a recurrent neural network implemented using Long short-term memory (LSTM) to extract DR-related concepts from clinical narratives. We identified four different categories of DR-related clinical concepts including lesions, eye parts, laterality, and severity, developed annotation guidelines, annotated a DR-corpus of 536 image reports, and developed transformer-based NLP models for clinical concept extraction and relation extraction. We also examined the relation extraction under two settings including 'gold-standard' setting-where gold-standard concepts were used-and end-to-end setting.

Results: For concept extraction, the BERT model pretrained with the MIMIC III dataset achieve the best performance (0.9503 and 0.9645 for strict/lenient evaluation). For relation extraction, BERT model pretrained using general English text achieved the best strict/lenient F1-score of 0.9316. The end-to-end system, BERT_general_e2e, achieved the best strict/lenient F1-score of 0.8578 and 0.8881, respectively. Another end-to-end system based on the RoBERTa architecture, RoBERTa_general_e2e, also achieved the same performance as BERT_general_e2e in strict scores.

Conclusions: This study demonstrated the efficiency of transformer-based NLP models for clinical concept extraction and relation extraction. Our results show that it's necessary to pretrain transformer models using clinical text to optimize the performance for clinical concept extraction. Whereas, for relation extraction, transformers pretrained using general English text perform better.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9513862PMC
http://dx.doi.org/10.1186/s12911-022-01996-2DOI Listing

Publication Analysis

Top Keywords

relation extraction
20
concept extraction
16
nlp models
12
models clinical
12
clinical concept
12
extraction relation
12
extraction
9
clinical
8
clinical concepts
8
transformer-based natural
8

Similar Publications

Children and adolescents with neurodevelopmental disorders such as autism spectrum disorder (ASD) and attention deficit hyperactivity disorder (ADHD) may be more susceptible to early life stress compared to their neurotypical peers. This increased susceptibility may be linked to regionally-specific changes in the striatum and amygdala, brain regions sensitive to stress and critical for shaping maladaptive behavioural responses. This study examined early life stress and its impact on striatal and amygdala development in 62 children and adolescents (35 males, mean age = 10.

View Article and Find Full Text PDF

Background: Edible insects are used for consumption and traditional medicine due to their rich bioactive compounds. This study examined the bioactive compounds and inhibitory effects of crude extracts from Bombyx mori and Omphisa fuscidentalis on α-glucosidase, α-amylase, acetylcholinesterase (AChE), and tyrosinase. Fatty acids, including n-hexadecanoic acid and oleic acid, were identified in the extracts and evaluated for their inhibitory potential against the enzymes in vitro and in silico.

View Article and Find Full Text PDF

Health-Related Quality of Life in Juvenile Idiopathic Arthritis: A Systematic Review of Phase III Clinical Trials.

J Clin Med

January 2025

Department of Clinical and Biological Sciences, Section of Translational Pharmacology, University of Turin, Regione Gonzole 10, 10043 Orbassano, Italy.

Juvenile idiopathic arthritis (JIA) is the most common rheumatic disease in childhood, leading to severe disability and negatively affecting patients' health-related quality of life (HRQoL). The aim of this systematic review was to evaluate the adoption, reporting and assessment methodology of HRQoL in phase III clinical trials involving children with JIA. An electronic and manual search was conducted to identify primary and secondary publications of pharmacological trials conducted between 2012 and 2023.

View Article and Find Full Text PDF

: (HP) is under investigation for its potential role in postoperative complications. While some studies indicate no impact, they often cite short or incomplete follow-up. This study aims to compare 1-year outcomes in groups with and without active HP infection after bariatric surgery, also assessing HP prevalence in postoperative specimens of sleeve gastrectomy (SG) patients.

View Article and Find Full Text PDF

The Loess Plateau in northwest China features fragmented terrain and is prone to landslides. However, the complex environment of the Loess Plateau, combined with the inherent limitations of convolutional neural networks (CNNs), often results in false positives and missed detection for deep learning models based on CNNs when identifying landslides from high-resolution remote sensing images. To deal with this challenge, our research introduced a CNN-transformer hybrid network.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!