Text Mining for Building Biomedical Networks Using Cancer as a Case Study.

Biomolecules

LASIGE, Faculdade de Ciências, Universidade de Lisboa, 1749-016 Lisboa, Portugal.

Published: September 2021

In the assembly of biological networks it is important to provide reliable interactions in an effort to have the most possible accurate representation of real-life systems. Commonly, the data used to build a network comes from diverse high-throughput essays, however most of the interaction data is available through scientific literature. This has become a challenge with the notable increase in scientific literature being published, as it is hard for human curators to track all recent discoveries without using efficient tools to help them identify these interactions in an automatic way. This can be surpassed by using text mining approaches which are capable of extracting knowledge from scientific documents. One of the most important tasks in text mining for biological network building is relation extraction, which identifies relations between the entities of interest. Many interaction databases already use text mining systems, and the development of these tools will lead to more reliable networks, as well as the possibility to personalize the networks by selecting the desired relations. This review will focus on different approaches of automatic information extraction from biomedical text that can be used to enhance existing networks or create new ones, such as deep learning state-of-the-art approaches, focusing on cancer disease as a case-study.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8533101PMC
http://dx.doi.org/10.3390/biom11101430DOI Listing

Publication Analysis

Top Keywords

text mining
16
scientific literature
8
text
5
networks
5
mining building
4
building biomedical
4
biomedical networks
4
networks cancer
4
cancer case
4
case study
4

Similar Publications

Talaromeroterpenoids A-G (-), seven new 3,5-dimethylorsellinic-acid-derived meroterpenoids, and two known analogues ( and ) were isolated from the mangrove endophytic fungus sp. JNQQJ-4 by genome analysis and a molecular networking strategy. Their structures and absolute configurations were established by nuclear magnetic resonance data, high-resolution electrospray ionization mass spectrometry, and X-ray diffraction.

View Article and Find Full Text PDF

Background: Stroke is one of the leading causes of death and long-term adult disability worldwide. Stroke causes neurodegeneration and impairs synaptic function. Understanding the role of synaptic proteins and associated signalling pathways in stroke pathology could offer insights into therapeutic approaches as well as improving rehabilitation-related treatment regimes.

View Article and Find Full Text PDF

Preparation and evaluation of T4SS recombinant proteins in serodiagnosis of human brucellosis based on TMT-based proteomics technology.

Front Cell Infect Microbiol

January 2025

Jiangsu Engineering Research Center of Biological Data Mining and Healthcare Transformation, Xuzhou Medical University, Xuzhou, Jiangsu, China.

Introduction: Brucellosis, a significant zoonotic infectious disease, poses a global health threat. Accurate and efficient diagnosis is crucial for prevention, control, and treatment of brucellosis. VirB proteins, components of the Type IV secretion system (T4SS) in , play a pivotal role in bacterial virulence and pathogenesis but have been understudied for their diagnostic potential.

View Article and Find Full Text PDF

Classification performance and reproducibility of GPT-4 omni for information extraction from veterinary electronic health records.

Front Vet Sci

January 2025

Department of Pathology, Microbiology and Immunology, School of Veterinary Medicine, University of California, Davis, Davis, CA, United States.

Large language models (LLMs) can extract information from veterinary electronic health records (EHRs), but performance differences between models, the effect of hyperparameter settings, and the influence of text ambiguity have not been previously evaluated. This study addresses these gaps by comparing the performance of GPT-4 omni (GPT-4o) and GPT-3.5 Turbo under different conditions and by investigating the relationship between human interobserver agreement and LLM errors.

View Article and Find Full Text PDF

Background: Biomedical text mining is a technique that extracts essential information from scientific articles using named entity recognition (NER). Traditional NER methods rely on dictionaries, rules, or curated corpora, which may not always be accessible. To overcome these challenges, deep learning (DL) methods have emerged.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!