Regulatory annotation of genomic intervals based on tissue-specific expression QTLs.

Bioinformatics

Department of Biostatics and Bioinformatics, Rollins School of Public Health, Emory University, Atlanta, GA 30322, USA.

Published: February 2020

Motivation: Annotating a given genomic locus or a set of genomic loci is an important yet challenging task. This is especially true for the non-coding part of the genome which is enormous yet poorly understood. Since gene set enrichment analyses have demonstrated to be effective approach to annotate a set of genes, the same idea can be extended to explore the enrichment of functional elements or features in a set of genomic intervals to reveal potential functional connections.

Results: In this study, we describe a novel computational strategy named loci2path that takes advantage of the newly emerged, genome-wide and tissue-specific expression quantitative trait loci (eQTL) information to help annotate a set of genomic intervals in terms of transcription regulation. By checking the presence or the absence of millions of eQTLs in a set of input genomic intervals, combined with grouping eQTLs by the pathways or gene sets that their target genes belong to, loci2path build a bridge connecting genomic intervals to functional pathways and pre-defined biological-meaningful gene sets, revealing potential for regulatory connection. Our method enjoys two key advantages over existing methods: first, we no longer rely on proximity to link a locus to a gene which has shown to be unreliable; second, eQTL allows us to provide the regulatory annotation under the context of specific tissue types. To demonstrate its utilities, we apply loci2path on sets of genomic intervals harboring disease-associated variants as query. Using 1 702 612 eQTLs discovered by the Genotype-Tissue Expression (GTEx) project across 44 tissues and 6320 pathways or gene sets cataloged in MSigDB as annotation resource, our method successfully identifies highly relevant biological pathways and revealed disease mechanisms for psoriasis and other immune-related diseases. Tissue specificity analysis of associated eQTLs provide additional evidence of the distinct roles of different tissues played in the disease mechanisms.

Availability And Implementation: loci2path is published as an open source Bioconductor package, and it is available at http://bioconductor.org/packages/release/bioc/html/loci2path.html.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8215915PMC
http://dx.doi.org/10.1093/bioinformatics/btz669DOI Listing

Publication Analysis

Top Keywords

genomic intervals
24
set genomic
12
gene sets
12
regulatory annotation
8
genomic
8
tissue-specific expression
8
annotate set
8
pathways gene
8
intervals
6
set
6

Similar Publications

Aims: This study investigated the association between maternal age and early and late gestational diabetes mellitus (GDM).

Methods: In total, 72,270 pregnant women were included in this prospective birth cohort study. Associations between maternal age and early GDM (diagnosed at <24 gestational weeks) and late GDM (diagnosed at ≥24 gestational weeks) were evaluated using a multinomial logistic regression model with possible confounding factors.

View Article and Find Full Text PDF

SUGT1 is a prognostic biomarker and is associated with immune infiltrates in ovarian cancer.

Eur J Med Res

January 2025

Department of Obstetrics and Gynecology, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, NO. 3 Qingchun East Road, Hangzhou, 310016, China.

Background: Ovarian cancer (OC) is a prevalent gynecological malignancy with a relatively dismal prognosis. The SGT1 homolog (SUGT1) protein, which interacts with heat shock protein 90 and is essential for the G1/S and G2/M transitions, was formerly thought to be a cancer promoter, but its precise role in OC remains unknown.

Methods: We conducted a comprehensive bioinformatics analysis of SUGT1 expression in patients with OC compared with their normal controls, including the data from the cancer genome atlas (TCGA), genotype-tissue expression (GTEx) databases, gene ontology (GO) analysis, Kyoto Encylopedia of Genes and Genomes (KEGG) analysis, gene set enrichment analysis (GSEA), single sample gene set enrichment analysis (ssGSEA).

View Article and Find Full Text PDF

Background: Epidemiological and genetic studies have elucidated associations between antihypertensive medication and Alzheimer's disease (AD), with the directionality of these associations varying upon the specific class of antihypertensive agents.

Methods: Genetic instruments for the expression of antihypertensive drug target genes were identified using expression quantitative trait loci (eQTL) in blood, which are associated with systolic blood pressure (SBP). Exposure was derived from existing eQTL data in blood from the eQTLGen consortium and in the brain from the PsychENCODE and subsequently replicated in GTEx V8 and BrainMeta V2.

View Article and Find Full Text PDF

Purpose: To develop a deep learning (DL) model based on primary tumor tissue to predict the lymph node metastasis (LNM) status of muscle invasive bladder cancer (MIBC), while validating the prognostic value of the predicted aiN score in MIBC patients.

Methods: A total of 323 patients from The Cancer Genome Atlas (TCGA) were used as the training and internal validation set, with image features extracted using a visual encoder called UNI. We investigated the ability to predict LNM status while assessing the prognostic value of aiN score.

View Article and Find Full Text PDF

Background: Papillary Thyroid Carcinoma (PTC) is the most common thyroid cancer, with an etiology and progression that are not fully understood. Research suggests a link between cathepsins and PTC, but the causal nature of this link is unclear. This study uses Mendelian Randomization (MR) to investigate if cathepsins causally influence PTC risk.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!