Background: Text mining tools have gained popularity to process the vast amount of available research articles in the biomedical literature. It is crucial that such tools extract information with a sufficient level of detail to be applicable in real life scenarios. Studies of mining non-causal molecular relations attribute to this goal by formally identifying the relations between genes, promoters, complexes and various other molecular entities found in text. More importantly, these studies help to enhance integration of text mining results with database facts.

Results: We describe, compare and evaluate two frameworks developed for the prediction of non-causal or 'entity' relations (REL) between gene symbols and domain terms. For the corresponding REL challenge of the BioNLP Shared Task of 2011, these systems ranked first (57.7% F-score) and second (41.6% F-score). In this paper, we investigate the performance discrepancy of 16 percentage points by benchmarking on a related and more extensive dataset, analysing the contribution of both the term detection and relation extraction modules. We further construct a hybrid system combining the two frameworks and experiment with intersection and union combinations, achieving respectively high-precision and high-recall results. Finally, we highlight extremely high-performance results (F-score > 90%) obtained for the specific subclass of embedded entity relations that are essential for integrating text mining predictions with database facts.

Conclusions: The results from this study will enable us in the near future to annotate semantic relations between molecular entities in the entire scientific literature available through PubMed. The recent release of the EVEX dataset, containing biomolecular event predictions for millions of PubMed articles, is an interesting and exciting opportunity to overlay these entity relations with event predictions on a literature-wide scale.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3384255PMC
http://dx.doi.org/10.1186/1471-2105-13-S11-S6DOI Listing

Publication Analysis

Top Keywords

molecular entities
12
text mining
12
entity relations
8
event predictions
8
relations
6
semantically linking
4
molecular
4
linking molecular
4
entities literature
4
literature entity
4

Similar Publications

A CRISPR-Cas and Argonaute-Driven Two-Factor Authentication Strategy for Information Security.

ACS Nano

January 2025

Department of Clinical Laboratory of Sir Run Run Shaw Hospital, College of Biosystems Engineering and Food Science, Zhejiang University School of Medicine, Hangzhou 310058, People's Republic of China.

The escalating growth in computing power and the advent of quantum computing present a critical threat to the security of modern cryptography. Two-factor authentication strategies can effectively resist brute-force attacks to improve the security of access control. Herein, we proposed a two-factor and two-authentication entity strategy based on the trans-cleavage activity of CRISPR-Cas and the "dual-step" sequence-specific cleavage of Argonaute.

View Article and Find Full Text PDF

Trichogerminoma (TG) is a rare adnexal tumor with hair follicle differentiation with less than 50 cases reported in the literature. In 2022, by investigating the genetic profile of 21 cases, our group identified recurrent rearrangements of the GRHL1/2/3 genes in this tumor entity, suggesting such alteration as the main oncogenic driver in TG. Up to now, only one TG case of malignant transformation has been reported.

View Article and Find Full Text PDF

Comprehensive genetic analysis of tumors with exome or whole genome sequencing has enabled the identification of the genes that are recurrently mutated in cancer. This has stimulated a series of exciting advances over the past 15 years, guiding us to new molecular biomarkers and therapeutic targets among the common mature B-cell neoplasms. In particular, diffuse large B-cell lymphoma (DLBCL), follicular lymphoma (FL) and Burkitt lymphoma (BL) have each been the subject of considerable attention in this field.

View Article and Find Full Text PDF

This research is purposed to synthesize the existing evidence on implant survival rates in patients with oral potentially malignant disorders (OPMD) and assess potential risk factors associated with peri-implant disease. A comprehensive search was performed across PubMed MEDLINE, Cochrane Library, and Web of Science. This review was conducted according to the PRISMA guidelines, including studies published between 2012 and 2023.

View Article and Find Full Text PDF

Recent advances in molecular genetics, particularly in identifying and characterizing genetic abnormalities within mesenchymal neoplasms, have led to a more comprehensive and evolving classification system. Modern technological developments in cytogenetics and next-generation sequencing have enabled the analysis of small clinical samples, expanded our understanding of tumor biology, and improved the diagnostic, prognostic, and predictive precision by identifying targeted genetic alterations, confirming the presence of fusion transcripts, and/or revealing the overexpression of specific genes and their targets. In this review, we focus specifically on the -rearranged enteric tumor, a recent clinicopathological entity that has emerged within the expanding classification of mesenchymal tumors.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!