DEXTER: Disease-Expression Relation Extraction from Text.

Database (Oxford)

Department of Computer and Information Sciences, University of Delaware, 18 Amstel Avenue, Newark, DE 19716, USA.

Published: January 2018

Gene expression levels affect biological processes and play a key role in many diseases. Characterizing expression profiles is useful for clinical research, and diagnostics and prognostics of diseases. There are currently several high-quality databases that capture gene expression information, obtained mostly from large-scale studies, such as microarray and next-generation sequencing technologies, in the context of disease. The scientific literature is another rich source of information on gene expression-disease relationships that not only have been captured from large-scale studies but have also been observed in thousands of small-scale studies. Expression information obtained from literature through manual curation can extend expression databases. While many of the existing databases include information from literature, they are limited by the time-consuming nature of manual curation and have difficulty keeping up with the explosion of publications in the biomedical field. In this work, we describe an automated text-mining tool, Disease-Expression Relation Extraction from Text (DEXTER) to extract information from literature on gene and microRNA expression in the context of disease. One of the motivations in developing DEXTER was to extend the BioXpress database, a cancer-focused gene expression database that includes data derived from large-scale experiments and manual curation of publications. The literature-based portion of BioXpress lags behind significantly compared to expression information obtained from large-scale studies and can benefit from our text-mined results. We have conducted two different evaluations to measure the accuracy of our text-mining tool and achieved average F-scores of 88.51 and 81.81% for the two evaluations, respectively. Also, to demonstrate the ability to extract rich expression information in different disease-related scenarios, we used DEXTER to extract information on differential expression information for 2024 genes in lung cancer, 115 glycosyltransferases in 62 cancers and 826 microRNA in 171 cancers. All extractions using DEXTER are integrated in the literature-based portion of BioXpress.Database URL: http://biotm.cis.udel.edu/DEXTER.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6007211PMC
http://dx.doi.org/10.1093/database/bay045DOI Listing

Publication Analysis

Top Keywords

gene expression
12
large-scale studies
12
manual curation
12
expression
10
disease-expression relation
8
relation extraction
8
extraction text
8
expression large-scale
8
context disease
8
text-mining tool
8

Similar Publications

The current study was deployed to evaluate the role of metastasis-associated lung adenocarcinoma transcript 1 (MALAT1) and miR-155, along with the inflammatory markers, TNFα and IL-6, and the adhesion molecule, cluster of differentiation 106 (CD106), in Behçet's disease (BD) pathogenesis. The study also assessed MALAT1/miR-155 as promising diagnostic and prognostic biomarkers for BD. The current retrospective case-control study included 74 Egyptian BD patients and 50 age and sex-matched controls.

View Article and Find Full Text PDF

Recent studies have suggested that the interaction between diet and an individual's genetic predisposition can determine the likelihood of obesity and various metabolic disorders. The current study aimed to examine the association of dietary branched-chain amino acids(BCAAs) and aromatic amino acids(AAAs) with the expression of the leptin and FTO genes in the visceral and subcutaneous adipose tissues of individuals undergoing surgery. This cross-sectional study was conducted on 136 Iranian adults, both men and women, aged ≥18 years.

View Article and Find Full Text PDF

Stroke is the second-leading global cause of death. The damage attributed to the immune storm triggered by ischemia-reperfusion injury (IRI) post-stroke is substantial. However, data on the transcriptomic dynamics of pyroptosis in IRI are limited.

View Article and Find Full Text PDF

Unraveling the potential mechanism and prognostic value of pentose phosphate pathway in hepatocellular carcinoma: a comprehensive analysis integrating bulk transcriptomics and single-cell sequencing data.

Funct Integr Genomics

January 2025

Institute of Infectious Diseases, Guangdong Province, Guangzhou Eighth People's Hospital, Guangzhou Medical University, 8 Huaying Road, Baiyun District, Guangzhou, 510440, China.

Hepatocellular carcinoma (HCC) remains a malignant and life-threatening tumor with an extremely poor prognosis, posing a significant global health challenge. Despite the continuous emergence of novel therapeutic agents, patients exhibit substantial heterogeneity in their responses to anti-tumor drugs and overall prognosis. The pentose phosphate pathway (PPP) is highly activated in various tumor cells and plays a pivotal role in tumor metabolic reprogramming.

View Article and Find Full Text PDF

Limited treatment options are available for bladder cancer (BCa) resulting in extremely high mortality rates. Cyclovirobuxine D (CVB-D), a naturally alkaloid, reportedly exhibits notable antitumor activity against diverse tumor types. However, its impact on CVB-D on BCa and its precise molecular targets remain unexplored.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!