Improving biomedical Named Entity Recognition with additional external contexts.

J Biomed Inform

University of Engineering and Technology, Vietnam National University, Hanoi, Viet Nam. Electronic address:

Published: August 2024

AI Article Synopsis

  • Biomedical Named Entity Recognition (bio NER) is advanced by a new model that incorporates external contexts for better identification of entities in biomedical texts.
  • The model retrieves relevant sentences from PubMed, ranks them, and combines them with original input sequences to improve feature representation using PubMedBERT and a BiLSTM layer.
  • Experiments show that this approach outperforms existing methods, highlighting that context improves entity recognition, with PubMed being more effective than Google for relevant information.

Article Abstract

Objective: Biomedical Named Entity Recognition (bio NER) is the task of recognizing named entities in biomedical texts. This paper introduces a new model that addresses bio NER by considering additional external contexts. Different from prior methods that mainly use original input sequences for sequence labeling, the model takes into account additional contexts to enhance the representation of entities in the original sequences, since additional contexts can provide enhanced information for the concept explanation of biomedical entities.

Methods: To exploit an additional context, given an original input sequence, the model first retrieves the relevant sentences from PubMed and then ranks the retrieved sentences to form the contexts. It next combines the context with the original input sequence to form a new enhanced sequence. The original and new enhanced sequences are fed into PubMedBERT for learning feature representation. To obtain more fine-grained features, the model stacks a BiLSTM layer on top of PubMedBERT. The final named entity label prediction is done by using a CRF layer. The model is jointly trained in an end-to-end manner to take advantage of the additional context for NER of the original sequence.

Results: Experimental results on six biomedical datasets show that the proposed model achieves promising performance compared to strong baselines and confirms the contribution of additional contexts for bio NER.

Conclusion: The promising results confirm three important points. First, the additional context from PubMed helps to improve the quality of the recognition of biomedical entities. Second, PubMed is more appropriate than the Google search engine for providing relevant information of bio NER. Finally, more relevant sentences from the context are more beneficial than irrelevant ones to provide enhanced information for the original input sequences. The model is flexible to integrate any additional context types for the NER task.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.jbi.2024.104674DOI Listing

Publication Analysis

Top Keywords

original input
16
additional context
16
named entity
12
bio ner
12
additional contexts
12
additional
9
biomedical named
8
entity recognition
8
additional external
8
external contexts
8

Similar Publications

Individual movements of bats are triggered by their life requirements, limited by their recognition of the environment and risks of moving, and mediated by habitat selection. Mining adds fragmentation and heterogeneity to landscapes, with poorly understood consequences to the life activities of the bats. Cave dwelling bats spend most of their life cycles within caves, and as they constantly forage in external landscapes, their contribution in the input of organic matter to the caves is of paramount importance to the subterranean biodiversity.

View Article and Find Full Text PDF

Objective: Brief hospital course (BHC) summaries are clinical documents that summarize a patient's hospital stay. While large language models (LLMs) depict remarkable capabilities in automating real-world tasks, their capabilities for healthcare applications such as synthesizing BHCs from clinical notes have not been shown. We introduce a novel preprocessed dataset, the MIMIC-IV-BHC, encapsulating clinical note and BHC pairs to adapt LLMs for BHC synthesis.

View Article and Find Full Text PDF

Biomarkers.

Alzheimers Dement

December 2024

School of Psychological Sciences and Turner Institute for Brain and Mental Health, Monash University, Monash, VIC, Australia.

Background: Diagnostic and prognostic decisions about Alzheimer's disease (AD) are more accurate when based on large data sets. We developed and validated a machine learning (ML) data harmonization tool for aggregation of prospective data from neuropsychological tests applied to study AD. The online ML-combine application (OML-combine app) allows researchers to utilize the ML-harmonization method for harmonization of their own data with that from other large available data bases (e.

View Article and Find Full Text PDF

Developing Topics.

Alzheimers Dement

December 2024

Institute of Cognitive Neurology and Dementia Research (IKND), Otto-von-Guericke University, Magdeburg, Germany.

Background: Alzheimer's Disease (AD), a progressively worsening neurodegenerative disorder, impacts millions globally. Understanding its progression is crucial for developing effective interventions and management strategies. However, high variability in disease progression amongst individuals and the complexity of neuroimaging data pose significant challenges.

View Article and Find Full Text PDF

Background: The National Institute on Aging (NIA) Imbedded Pragmatic Alzheimer's Disease and Alzheimer's Related Dementia (AD/ADRD) Clinical Trials (IMPACT) Collaboratory, in partnership with the Alzheimer's Association, convened a Lived Experience Panel (LEP), a group of 9-12 individuals, including people living with cognitive symptoms, proxies representing people with an advanced cognitive disorder or who are deceased, and care partners of a person living with dementia. The aim was for the LEP members to share their experiences with research, inform the development of research priorities, and provide input on conducting embedded pragmatic clinical trials (ePCTs) of dementia care interventions. Given the importance of providing a space for people with lived experiences to share their thoughts and recommendations, we continue to report on the final stage of LEP in its original design.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!