One of the main goals of analysing DNA sequences is to understand the temporal and positional information that specifies gene expression. An important step in this process is the recognition of gene expression regulatory elements. Experimental procedures for this are slow and costly. In this paper we present a computational non-supervised algorithm that facilitates the process by statistically identifying the most likely regions within a putative regulatory sequence. A probabilistic technique is presented, based on the approximation of regulatory DNA with a Markov chain, for the location of putative transcription factor binding sites in a single stretch of DNA. Hereto we developed a procedure to approximate the order of Markov model for a given DNA sequence that circumvents some of the prohibitive assumptions underlying Markov modeling. Application of the algorithm to data from 55 genes in five species shows the high sensitivity of this Markov search algorithm. Our algorithm does not require any prior knowledge in the form of description or cross-genomic comparison; it is context sensitive and takes DNA heterogeneity into account.

Download full-text PDF

Source
http://dx.doi.org/10.1142/s0219720006001813DOI Listing

Publication Analysis

Top Keywords

gene expression
8
markov
5
dna
5
transcription binding
4
binding site
4
site prediction
4
prediction markov
4
markov models
4
models main
4
main goals
4

Similar Publications

Pathway analysis plays a critical role in bioinformatics, enabling researchers to identify biological pathways associated with various conditions by analyzing gene expression data. However, the rise of large, multi-center datasets has highlighted limitations in traditional methods like Over-Representation Analysis (ORA) and Functional Class Scoring (FCS), which struggle with low signal-to-noise ratios (SNR) and large sample sizes. To tackle these challenges, we use a deep learning-based classification method, Gene PointNet, and a novel $P$-value computation approach leveraging the confusion matrix to address pathway analysis tasks.

View Article and Find Full Text PDF

Purpose: After failing primary and secondary hormonal therapy, castration-resistant and neuroendocrine prostate cancer metastatic to the bone is invariably lethal, although treatment with docetaxel and carboplatin can modestly improve survival. Therefore, agents targeting biologically relevant pathways in PCa and potentially synergizing with docetaxel and carboplatin in inhibiting bone metastasis growth are urgently needed.

Experimental Design: Phosphorylated (activated) AXL expression in human prostate cancer bone metastases was assessed by immunohistochemical staining.

View Article and Find Full Text PDF

Spatial transcriptomics enhances our understanding of cellular organization by mapping gene expression data to precise tissue locations. Here, we present a protocol for using weighted ensemble method for spatial transcriptomics (WEST), which uses ensemble techniques to boost the robustness and accuracy of existing algorithms. We describe steps for preprocessing data, obtaining embeddings from individual algorithms, and ensemble integrating all embeddings as a similarity matrix.

View Article and Find Full Text PDF
Article Synopsis
  • Primary ciliary dyskinesia (PCD) is a rare genetic disorder linked to chronic respiratory issues, infertility, and problems with body asymmetry, primarily caused by mutations in the CCDC39 and CCDC40 genes.
  • Researchers used advanced techniques to investigate how these genetic variants impact cellular functions beyond just causing cilia to stop moving.
  • They discovered that the absence of CCDC39/CCDC40 creates a significant loss of over 90 ciliary structural proteins, leading to cilia dysfunction and other cellular issues, suggesting that gene therapy could potentially offer a new treatment strategy for PCD.
View Article and Find Full Text PDF

Primary mitochondrial disorders are most often caused by deleterious mutations in the mitochondrial DNA (mtDNA). Here, we used a mitochondrial DddA-derived cytosine base editor (DdCBE) to introduce a compensatory edit in a mouse model that carries the pathological mutation in the mitochondrial transfer RNA (tRNA) alanine (mt-tRNA) gene. Because the original m.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!