For any molecule, network, or process of interest, keeping up with new publications on these is becoming increasingly difficult. For many cellular processes, the amount molecules and their interactions that need to be considered can be very large. Automated mining of publications can support large-scale molecular interaction maps and database curation. Text mining and Natural-Language-Processing (NLP)-based techniques are finding their applications in mining the biological literature, handling problems such as Named Entity Recognition (NER) and Relationship Extraction (RE). Both rule-based and Machine-Learning (ML)-based NLP approaches have been popular in this context, with multiple research and review articles examining the scope of such models in Biological Literature Mining (BLM). In this review article, we explore self-attention-based models, a special type of Neural-Network (NN)-based architecture that has recently revitalized the field of NLP, applied to biological texts. We cover self-attention models operating either at the sentence level or an abstract level, in the context of molecular interaction extraction, published from 2019 onwards. We conducted a comparative study of the models in terms of their architecture. Moreover, we also discuss some limitations in the field of BLM that identifies opportunities for the extraction of molecular interactions from biological text.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8615611PMC
http://dx.doi.org/10.3390/biom11111591DOI Listing

Publication Analysis

Top Keywords

self-attention-based models
8
extraction molecular
8
molecular interactions
8
interactions biological
8
biological texts
8
molecular interaction
8
biological literature
8
biological
5
extraction
4
models extraction
4

Similar Publications

LOGOWheat: deep learning-based prediction of regulatory effects for noncoding variants in wheats.

Brief Bioinform

November 2024

Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, No. 97 Buxin Road, Dapeng New District, Shenzhen 518124, China.

Identifying the regulatory effects of noncoding variants presents a significant challenge. Recently, the accumulation of epigenomic profiling data in wheat has provided an opportunity to model the functional impacts of these variants. In this study, we introduce Language of Genome for Wheat (LOGOWheat), a deep learning-based tool designed to predict the regulatory effects of noncoding variants in wheat.

View Article and Find Full Text PDF

Purpose: Breast cancer is a significant threat to women's health. Precise prognosis prediction for breast cancer can help doctors implement more rational treatment strategies. Artificial intelligence can assist doctors in decision-making and enhance prediction accuracy.

View Article and Find Full Text PDF

Simultaneous structured light imaging of multiple objects has become more demanding and widely in many scenarios involving robot operations in intelligent manufacturing. However, it is challenged by pattern aliasing caused by mutual reflection between high-reflective objects. To this end, we propose to learn clear fringe patterns from aliased mutual-reflective observations by diffusion models for achieving high-fidelity multi-body reconstruction in line with typical phase-shift algorithms.

View Article and Find Full Text PDF

Prototype-based methods in deep learning offer interpretable explanations for decisions by comparing inputs to typical representatives in the data. This study explores the adaptation of SESM, a self-attention-based prototype method successful in electrocardiogram (ECG) tasks, for electroencephalogram (EEG) signals. The architecture is evaluated on sleep stage classification, exploring its efficacy in predicting stages with single-channel EEG.

View Article and Find Full Text PDF
Article Synopsis
  • The annotation of cell types in single-cell RNA sequencing data is essential for understanding biological processes, but current methods require labor-intensive manual efforts for accuracy.
  • The authors introduced scSwinTNet, a pre-trained model that utilizes self-attention techniques to automatically annotate cell types from large scRNA-seq datasets.
  • Tested on a dataset of over 399,000 cells from human and mouse tissues, scSwinTNet shows improved annotation accuracy without needing prior knowledge or manual intervention.
View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!