BioBBC: a multi-feature model that enhances the detection of biomedical entities.

Sci Rep

Computer Science Program, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia.

Published: April 2024

AI Article Synopsis

  • The paper addresses the need for effective systems in Biomedical Named Entity Recognition (BioNER) due to the rising volume of biomedical publications that often contain complex names and abbreviations.
  • It introduces BioBBC, a deep learning model built on a combination of BERT, Bi-LSTM, and CRF, designed to process biomedical text and identify entities by using multiple embedding techniques.
  • Experimental results show that BioBBC significantly outperforms existing state-of-the-art models across six BioNER benchmark datasets, demonstrating its effectiveness in recognizing various biomedical entities.

Article Abstract

The rapid increase in biomedical publications necessitates efficient systems to automatically handle Biomedical Named Entity Recognition (BioNER) tasks in unstructured text. However, accurately detecting biomedical entities is quite challenging due to the complexity of their names and the frequent use of abbreviations. In this paper, we propose BioBBC, a deep learning (DL) model that utilizes multi-feature embeddings and is constructed based on the BERT-BiLSTM-CRF to address the BioNER task. BioBBC consists of three main layers; an embedding layer, a Long Short-Term Memory (Bi-LSTM) layer, and a Conditional Random Fields (CRF) layer. BioBBC takes sentences from the biomedical domain as input and identifies the biomedical entities mentioned within the text. The embedding layer generates enriched contextual representation vectors of the input by learning the text through four types of embeddings: part-of-speech tags (POS tags) embedding, char-level embedding, BERT embedding, and data-specific embedding. The BiLSTM layer produces additional syntactic and semantic feature representations. Finally, the CRF layer identifies the best possible tag sequence for the input sentence. Our model is well-constructed and well-optimized for detecting different types of biomedical entities. Based on experimental results, our model outperformed state-of-the-art (SOTA) models with significant improvements based on six benchmark BioNER datasets.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10987643PMC
http://dx.doi.org/10.1038/s41598-024-58334-xDOI Listing

Publication Analysis

Top Keywords

biomedical entities
16
embedding layer
8
crf layer
8
biomedical
7
embedding
6
layer
6
biobbc
4
biobbc multi-feature
4
model
4
multi-feature model
4

Similar Publications

Gold-silver synergism has been well documented in many scientific works dealing with luminescent nanostructures that are exploitable in biomedical and environmental application. Frequently, the ratio of Au : Ag in synthetic mixtures was varied to influence the extent of Au-Ag synergism of the resulting luminescent gold-silver nanoclusters (GSNCs). However, in our approach, a new step, maturing under differing conditions using the same Au : Ag ratio (5 : 1), has been investigated systematically for the very first time.

View Article and Find Full Text PDF

Bipolar disorder is a leading contributor to the global burden of disease. Despite high heritability (60-80%), the majority of the underlying genetic determinants remain unknown. We analysed data from participants of European, East Asian, African American and Latino ancestries (n = 158,036 cases with bipolar disorder, 2.

View Article and Find Full Text PDF

The hypothalamic-pituitary-gonadal axis is regulated by the gonadotropin-releasing hormone pulse generator in the hypothalamus. This is comprised of neurons that secrete kisspeptin in a pulsatile manner to stimulate the release of GnRH, and, in turn, downstream gonadotropins from the pituitary gland, and subsequently sex steroids and gametogenesis from the gonads. Many reproductive disorders in both males and females are characterized by hypothalamic dysfunction, including functional disorders (such as age-related hypogonadism, obesity-related secondary hypogonadism, hyperprolactinemia, functional hypothalamic amenorrhea and polycystic ovary syndrome), structural pathologies (such as craniopharyngiomas or radiation or surgery-related hypothalamic dysfunction), and pubertal disorders (constitutional delay of growth and puberty and congenital hypogonadotropic hypogonadism).

View Article and Find Full Text PDF

Evaluating the effectiveness of mandibular advancement devices in treating very severe obstructive sleep apnea: a retrospective cohort study.

Sleep Breath

January 2025

Department of Oral Medicine, Sedation and Imaging, Hadassah Medical Center, Faculty of Dental Medicine, Hebrew University of Jerusalem, Jerusalem, Israel.

Background: The repeated airway obstructions in the common disorder Obstructive Sleep Apnea (OSA) cause health risks. Continuous Positive Airway Pressure (CPAP), the standard treatment, faces adherence challenges. Mandibular Advancement Devices (MADs) have been used successfully for mild to moderate OSA, as a good alternative for these patients.

View Article and Find Full Text PDF

Analysis of longitudinal social media for monitoring symptoms during a pandemic.

J Biomed Inform

January 2025

School of Public Health, Zhejiang University School of Medicine, Hangzhou 310058 China; Department of Medicine, Harvard Medical School, Boston, MA 02115, USA. Electronic address:

Objective: Current studies leveraging social media data for disease monitoring face challenges like noisy colloquial language and insufficient tracking of user disease progression in longitudinal data settings. This study aims to develop a pipeline for collecting, cleaning, and analyzing large-scale longitudinal social media data for disease monitoring, with a focus on COVID-19 pandemic.

Materials And Methods: This pipeline initiates by screening COVID-19 cases from tweets spanning February 1, 2020, to April 30, 2022.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!