Perm-seq: Mapping Protein-DNA Interactions in Segmental Duplication and Highly Repetitive Regions of Genomes with Prior-Enhanced Read Mapping.

PLoS Comput Biol

Department of Statistics, University of Wisconsin, Madison, Wisconsin, United States of America; Department of Biostatistics and Medical Informatics, University of Wisconsin, Madison, Wisconsin, United States of America.

Published: October 2015

Segmental duplications and other highly repetitive regions of genomes contribute significantly to cells' regulatory programs. Advancements in next generation sequencing enabled genome-wide profiling of protein-DNA interactions by chromatin immunoprecipitation followed by high throughput sequencing (ChIP-seq). However, interactions in highly repetitive regions of genomes have proven difficult to map since short reads of 50-100 base pairs (bps) from these regions map to multiple locations in reference genomes. Standard analytical methods discard such multi-mapping reads and the few that can accommodate them are prone to large false positive and negative rates. We developed Perm-seq, a prior-enhanced read allocation method for ChIP-seq experiments, that can allocate multi-mapping reads in highly repetitive regions of the genomes with high accuracy. We comprehensively evaluated Perm-seq, and found that our prior-enhanced approach significantly improves multi-read allocation accuracy over approaches that do not utilize additional data types. The statistical formalism underlying our approach facilitates supervising of multi-read allocation with a variety of data sources including histone ChIP-seq. We applied Perm-seq to 64 ENCODE ChIP-seq datasets from GM12878 and K562 cells and identified many novel protein-DNA interactions in segmental duplication regions. Our analysis reveals that although the protein-DNA interactions sites are evolutionarily less conserved in repetitive regions, they share the overall sequence characteristics of the protein-DNA interactions in non-repetitive regions.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4618727PMC
http://dx.doi.org/10.1371/journal.pcbi.1004491DOI Listing

Publication Analysis

Top Keywords

protein-dna interactions
20
repetitive regions
20
highly repetitive
16
regions genomes
16
interactions segmental
8
segmental duplication
8
regions
8
prior-enhanced read
8
multi-mapping reads
8
perm-seq prior-enhanced
8

Similar Publications

Piperazine-based compounds have garnered significant attention due to their notable biological and pharmacological activities, making them essential in fine chemical and pharmaceutical applications. In this study, we managed to synthesize a novel hybrid bis-cyanoacrylamide bearing the piperazine core via phenoxymethyl linker and incorporating sulphamethoxazole moiety. The novel compound was fully characterized using different spectral data including 1H-NMR, C-NMR, and FTIR spectroscopy.

View Article and Find Full Text PDF

Protective immunity induced by a novel P1 adhesin C-terminal anchored mRNA vaccine against infection in BALB/c mice.

Microbiol Spectr

January 2025

Hunan Province Cooperative Innovation Center for Molecular Target New Drug Study, Hengyang Medical College, Institute of Pathogenic Biology, University of South China, Hengyang, China.

(Mp), a unique pathogen devoid of a cell wall, is naturally impervious to penicillin antibiotics. This bacterium is the causative agent of pneumonia, an acute pulmonary affliction marked by interstitial lung damage. Non-macrolide medications may have potential adverse effects on the developmental trajectory of children, thereby establishing macrolides as the preferred treatment for in pediatric patients.

View Article and Find Full Text PDF

Accurate modeling of the structures of protein-protein complexes and other biomolecular interactions represents a longstanding and important challenge for computational biology. The Critical Assessment of PRedicted Interactions (CAPRI) experiment has served for over two decades as a key means to assess and compare current approaches and methods through blind predictive scenarios, highlighting useful strategies, and new developments. Here we describe the performance of our laboratory's team in recent CAPRI rounds, which included submissions for 10 modeling targets.

View Article and Find Full Text PDF

The replicative polymerase delta is inefficient copying repetitive DNA sequences. Error-prone translesion polymerases have been shown to switch with high-fidelity replicative polymerases to help navigate repetitive DNA. We and others have demonstrated the importance of one such translesion polymerase, polymerase Eta (pol eta), in facilitating replication at genomic regions called common fragile sites (CFS), which are difficult-to-replicate genomic regions that are hypersensitive to replication stress.

View Article and Find Full Text PDF

DNA methylation patterns are inherited from the parental germline to the embryo. In mature sperm, the sites of unmethylated DNA are tightly coupled to sites of histone retention at gene regulatory elements that are implicated in paternal epigenetic inheritance. The timing and mechanism of site-specific DNA demethylation in the male germline currently remains unknown.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!