In the assembly of biological networks it is important to provide reliable interactions in an effort to have the most possible accurate representation of real-life systems. Commonly, the data used to build a network comes from diverse high-throughput essays, however most of the interaction data is available through scientific literature. This has become a challenge with the notable increase in scientific literature being published, as it is hard for human curators to track all recent discoveries without using efficient tools to help them identify these interactions in an automatic way. This can be surpassed by using text mining approaches which are capable of extracting knowledge from scientific documents. One of the most important tasks in text mining for biological network building is relation extraction, which identifies relations between the entities of interest. Many interaction databases already use text mining systems, and the development of these tools will lead to more reliable networks, as well as the possibility to personalize the networks by selecting the desired relations. This review will focus on different approaches of automatic information extraction from biomedical text that can be used to enhance existing networks or create new ones, such as deep learning state-of-the-art approaches, focusing on cancer disease as a case-study.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8533101 | PMC |
http://dx.doi.org/10.3390/biom11101430 | DOI Listing |
Org Lett
January 2025
School of Pharmacy, Anhui Medical University, Hefei 230032, China.
Talaromeroterpenoids A-G (-), seven new 3,5-dimethylorsellinic-acid-derived meroterpenoids, and two known analogues ( and ) were isolated from the mangrove endophytic fungus sp. JNQQJ-4 by genome analysis and a molecular networking strategy. Their structures and absolute configurations were established by nuclear magnetic resonance data, high-resolution electrospray ionization mass spectrometry, and X-ray diffraction.
View Article and Find Full Text PDFAnn Neurosci
January 2025
Department of Biotechnology, School of Bioengineering, SRM Institute of Science and Technology, Kattankulathur, Tamil Nadu, India.
Background: Stroke is one of the leading causes of death and long-term adult disability worldwide. Stroke causes neurodegeneration and impairs synaptic function. Understanding the role of synaptic proteins and associated signalling pathways in stroke pathology could offer insights into therapeutic approaches as well as improving rehabilitation-related treatment regimes.
View Article and Find Full Text PDFFront Cell Infect Microbiol
January 2025
Jiangsu Engineering Research Center of Biological Data Mining and Healthcare Transformation, Xuzhou Medical University, Xuzhou, Jiangsu, China.
Introduction: Brucellosis, a significant zoonotic infectious disease, poses a global health threat. Accurate and efficient diagnosis is crucial for prevention, control, and treatment of brucellosis. VirB proteins, components of the Type IV secretion system (T4SS) in , play a pivotal role in bacterial virulence and pathogenesis but have been understudied for their diagnostic potential.
View Article and Find Full Text PDFFront Vet Sci
January 2025
Department of Pathology, Microbiology and Immunology, School of Veterinary Medicine, University of California, Davis, Davis, CA, United States.
Large language models (LLMs) can extract information from veterinary electronic health records (EHRs), but performance differences between models, the effect of hyperparameter settings, and the influence of text ambiguity have not been previously evaluated. This study addresses these gaps by comparing the performance of GPT-4 omni (GPT-4o) and GPT-3.5 Turbo under different conditions and by investigating the relationship between human interobserver agreement and LLM errors.
View Article and Find Full Text PDFBMC Bioinformatics
January 2025
Department of Information Technology, Vardhaman College of Engineering, Shamshabad, Hyderabad, India.
Background: Biomedical text mining is a technique that extracts essential information from scientific articles using named entity recognition (NER). Traditional NER methods rely on dictionaries, rules, or curated corpora, which may not always be accessible. To overcome these challenges, deep learning (DL) methods have emerged.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!