Motivation: The computational investigation of DNA binding motifs from binding sites is one of the classic tasks in bioinformatics and a prerequisite for understanding gene regulation as a whole. Due to the development of sequencing technologies and the increasing number of available genomes, approaches based on phylogenetic footprinting become increasingly attractive. Phylogenetic footprinting requires phylogenetic trees with attached substitution probabilities for quantifying the evolution of binding sites, but these trees and substitution probabilities are typically not known and cannot be estimated easily.

Results: Here, we investigate the influence of phylogenetic trees with different substitution probabilities on the classification performance of phylogenetic footprinting using synthetic and real data. For synthetic data we find that the classification performance is highest when the substitution probability used for phylogenetic footprinting is similar to that used for data generation. For real data, however, we typically find that the classification performance of phylogenetic footprinting surprisingly increases with increasing substitution probabilities and is often highest for unrealistically high substitution probabilities close to one. This finding suggests that choosing realistic model assumptions might not always yield optimal predictions in general and that choosing unrealistically high substitution probabilities close to one might actually improve the classification performance of phylogenetic footprinting.

Availability And Implementation: The proposed PF is implemented in JAVA and can be downloaded from https://github.com/mgledi/PhyFoo.

Contact: : martin.nettling@informatik.uni-halle.de.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5447242PMC
http://dx.doi.org/10.1093/bioinformatics/btx033DOI Listing

Publication Analysis

Top Keywords

phylogenetic footprinting
24
substitution probabilities
24
classification performance
16
phylogenetic trees
12
performance phylogenetic
12
phylogenetic
9
binding sites
8
trees substitution
8
real data
8
find classification
8

Similar Publications

Transcriptional regulation allows cells to execute developmental programs, maintain homeostasis, and respond to intra- and extracellular signals. Central to these processes are promoters, which in eukaryotes are sequences upstream of genes that bind transcription factors (TFs) and which recruit RNA polymerase to initiate mRNA synthesis. Valuable tools for studying promoters include reporter genes, which can be used to indicate when and where genes are activated.

View Article and Find Full Text PDF

RS24090, a TetR family transcriptional repressor, negatively affects the rimocidin biosynthesis in Streptomyces rimosus M527.

Int J Biol Macromol

January 2025

Zhejiang Provincial Key Laboratory of Biometrology and Inspection & Quarantine, College of Life Sciences, China Jiliang University, Hangzhou, Zhejiang Province 310018, China. Electronic address:

The TetR family of regulators (TFRs), commonly reported as repressors, plays a role in regulating secondary metabolite production in Streptomyces. In this study, we sought to elucidate the relationship between TFRs and rimocidin production of Streptomyces rimosus M527. Through transcriptomic analysis, we identified the protein RS24090, which exhibited significant differential expression.

View Article and Find Full Text PDF

The two-component system response regulator BvrR binds to three DNA regulatory boxes in the upstream region of .

Front Microbiol

September 2023

Programa de Investigación en Enfermedades Tropicales, Escuela de Medicina Veterinaria, Universidad Nacional de Costa Rica, Heredia, Costa Rica.

is a facultative extracellular-intracellular bacterial zoonotic pathogen worldwide. It is also a major cause of abortion in bovines, generating economic losses. The two-component regulatory system BvrR/BvrS modulates the expression of genes required to transition from extracellular to intracellular lifestyles.

View Article and Find Full Text PDF

Hnf1b renal expression directed by a distal enhancer responsive to Pax8.

Sci Rep

November 2022

Laboratoire de Biologie du Développement, CNRS, Institut de Biologie Paris Seine, IBPS, UMR7622, Sorbonne Université, 75005, Paris, France.

Xenopus provides a simple and efficient model system to study nephrogenesis and explore the mechanisms causing renal developmental defects in human. Hnf1b (hepatocyte nuclear factor 1 homeobox b), a gene whose mutations are the most commonly identified genetic cause of developmental kidney disease, is required for the acquisition of a proximo-intermediate nephron segment in Xenopus as well as in mouse. Genetic networks involved in Hnf1b expression during kidney development remain poorly understood.

View Article and Find Full Text PDF

Toxicity Analysis of Pentachlorophenol Data with a Bioinformatics Tool Set.

Methods Mol Biol

April 2022

Sony Computer Science Laboratories Inc., Tokyo, Japan.

Rapid progress in technologies opened the new era of computer-leaded analytics, leaving humans more space for experimental design and decision making. Here we demonstrate the machine learning analysis workflow represented by spectral clustering, elucidation of evolutionary conserved transcription regulation, and network analysis using reverse engineering. Analysis of genes induced by the Pentachlorophenol toxic chemical revealed two subnetworks, one orchestrated by Interferon and another by Nuclear receptor factor 2 (NRF2) gene.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!