In higher eukaryotes, many genes are regulated by enhancers that are 10-10 base pairs (bp) away from the promoter. Enhancers contain transcription-factor-binding sites (which are typically around 7-22 bp), and physical contact between the promoters and enhancers is thought to be required to modulate gene expression. Although chromatin architecture has been mapped extensively at resolutions of 1 kilobase and above; it has not been possible to define physical contacts at the scale of the proteins that determine gene expression.
View Article and Find Full Text PDFIn the era of genome-wide association studies (GWAS) and personalized medicine, predicting the impact of single nucleotide polymorphisms (SNPs) in regulatory elements is an important goal. Current approaches to determine the potential of regulatory SNPs depend on inadequate knowledge of cell-specific DNA binding motifs. Here, we present Sasquatch, a new computational approach that uses DNase footprint data to estimate and visualize the effects of noncoding variants on transcription factor binding.
View Article and Find Full Text PDFBackground: Mammalian transcriptomes contain thousands of long noncoding RNAs (lncRNAs). Some lncRNAs originate from intragenic enhancers which, when active, behave as alternative promoters producing transcripts that are processed using the canonical signals of their host gene. We have followed up this observation by analyzing intergenic lncRNAs to determine the extent to which they might also originate from intergenic enhancers.
View Article and Find Full Text PDF