Transcription factor binding site (TFBS) identification plays an important role in deciphering gene regulatory codes. With comprehensive knowledge of TFBSs, one can understand molecular mechanisms of gene regulation. In the recent decades, various computational approaches have been proposed to predict TFBSs in the genome. The TFBS dataset of a TF generated by each algorithm is a ranked list of predicted TFBSs of that TF, where top ranked TFBSs are statistically significant ones. However, whether these statistically significant TFBSs are functional (i.e. biologically relevant) is still unknown. Here we develop a post-processor, called the functional propensity calculator (FPC), to assign a functional propensity to each TFBS in the existing computationally predicted TFBS datasets. It is known that functional TFBSs reveal strong positional preference towards the transcriptional start site (TSS). This motivates us to take TFBS position relative to the TSS as the key idea in building our FPC. Based on our calculated functional propensities, the TFBSs of a TF in the original TFBS dataset could be reordered, where top ranked TFBSs are now the ones with high functional propensities. To validate the biological significance of our results, we perform three published statistical tests to assess the enrichment of Gene Ontology (GO) terms, the enrichment of physical protein-protein interactions, and the tendency of being co-expressed. The top ranked TFBSs in our reordered TFBS dataset outperform the top ranked TFBSs in the original TFBS dataset, justifying the effectiveness of our post-processor in extracting functional TFBSs from the original TFBS dataset. More importantly, assigning functional propensities to putative TFBSs enables biologists to easily identify which TFBSs in the promoter of interest are likely to be biologically relevant and are good candidates to do further detailed experimental investigation. The FPC is implemented as a web tool at http://santiago.ee.ncku.edu.tw/FPC/.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3873331PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0083791PLOS

Publication Analysis

Top Keywords

tfbs dataset
20
top ranked
16
ranked tfbss
16
tfbss
13
functional propensities
12
tfbss original
12
original tfbs
12
tfbs
9
transcription factor
8
factor binding
8

Similar Publications

Background: Long terminal repeats (LTRs) represent important parts of LTR retrotransposons and retroviruses found in high copy numbers in a majority of eukaryotic genomes. LTRs contain regulatory sequences essential for the life cycle of the retrotransposon. Previous experimental and sequence studies have provided only limited information about LTR structure and composition, mostly from model systems.

View Article and Find Full Text PDF

Background: The precise prediction of transcription factor binding sites (TFBSs) is pivotal for unraveling the gene regulatory networks underlying biological processes. While numerous tools have emerged for in silico TFBS prediction in recent years, the evolving landscape of computational biology necessitates thorough assessments of tool performance to ensure accuracy and reliability. Only a limited number of studies have been conducted to evaluate the performance of TFBS prediction tools comprehensively.

View Article and Find Full Text PDF
Article Synopsis
  • - The study investigates the relationship between the ribosomal biogenesis factor NSA2 and mitochondrial dysfunction in chronic kidney disease (CKD), linking cytosolic and mitochondrial processes involved in its pathology.
  • - Research methods included culturing human renal tubular cells and measuring mRNA levels, protein content, and cellular respiration while analyzing transcriptomic data from existing CKD datasets.
  • - Results indicate that both NSA2 and GFM2 express together in a dose-dependent manner influenced by zinc and hyperglycemia, with increased expression observed in CKD patients, highlighting their potential role in mitochondrial function and kidney disease progression.
View Article and Find Full Text PDF
Article Synopsis
  • The study explores how promoter-bound transcription factors interact with alternative splicing in human cells, revealing a largely uncharted regulatory relationship.* -
  • Utilizing computational methods, the researchers analyze a comprehensive dataset to identify connections between transcription factor binding sites and splicing patterns, with machine learning models showing promise in predicting splicing events.* -
  • The results indicate that changes in transcription factor binding can affect splicing efficiency, particularly emphasizing the role of the CTCFL factor, which was validated through experimental data in cell lines.*
View Article and Find Full Text PDF

Controlling the gene expression is the most important development in a living organism, which makes it easier to find different kinds of diseases and their causes. It's very difficult to know what factors control the gene expression. Transcription Factor (TF) is a protein that plays an important role in gene expression.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!