Previous studies have shown that Mixed Lineage Leukemia 1 (MLL1 or MLL) binds a group of CpG-rich motifs known as morphemes. To examine whether occurrences of MLL1 morphemes in genomic DNA may influence codon utilization, we analyzed the frequency of various 9-mers in human cDNAs and in total human genomic DNA. We uncovered preferential utilization of GGC for Gly, GCG for Ala, CCG for Pro, and TCG for Ser, in coding sequences (CDSs) that included MLL1 morphemes. We also examined weighted occurrences of CDS 9-mers in a 30-base window that moved along each human chromosome. In plots, we observed peaks with fluctuating intensities. High intensity peaks appeared within promoter and exons localized in CpG islands, encompassing sequences that included MLL1 morphemes. High intensity peaks included CCG/GGC repeats, whose expansion may cause neurological disorders and congenital malformations. Such repeats are generated from overlap of a morpheme (CGCCG/CGGCG), which depending on reading frame and orientation would produce runs of Ala, Gly, or Pro in proteins. Overall, our results point to a role for morpheme occurrences on synonymous codon utilization in human genomic DNA and indicate that regulatory instructions are dispersed not only in promoters but also in exons of human genes.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1002/bip.22681 | DOI Listing |
BMC Genomics
June 2022
Department of Chemistry, Purdue University, West Lafayette, IN, 47907, USA.
Background: In mammals, Imprinting Control Regions (ICRs) regulate a subset of genes in a parent-of-origin-specific manner. In both human and mouse, previous studies identified a set of CpG-rich motifs occurring as clusters in ICRs and germline Differentially Methylated Regions (gDMRs). These motifs consist of the ZFP57 binding site (ZFBS) overlapping a subset of MLL binding units known as MLL morphemes.
View Article and Find Full Text PDFData Brief
August 2017
Information Technology at Purdue University, West Lafayette, IN 47907, USA.
While MLL1 activates gene expression in most tissues, ZFP57 represses transcription. MLL1 selectively interacts with a group of nonmethylated DNA sequences known as the MLL1 morphemes. ZFP57 associates with a methylated hexamer (ZFBS), dispersed in the genomic DNA segments known as Imprinted Control Regions (ICRs) and germline Differentially Methylated Regions (gDMRs), to maintain allele-specific gene repression.
View Article and Find Full Text PDFGenomics
July 2017
Department of Chemistry, Purdue University, West Lafayette, IN 47907, USA. Electronic address:
Mammalian genomes include DNA segments that are imprinted (CpG-methylated) only on one of the two parental chromosomes, leading to parent-of-origin-specific gene expression. The process is regulated by Imprinting Control Regions (ICRs) and germline Differentially Methylated Regions (gDMRs). Previously, ZFP57 was shown to recognize a methylated hexanucleotide in ICRs to maintain allele-specific gene repression.
View Article and Find Full Text PDFBiopolymers
September 2015
Department of Chemistry, Purdue University, West Lafayette, IN, 47907.
Previous studies have shown that Mixed Lineage Leukemia 1 (MLL1 or MLL) binds a group of CpG-rich motifs known as morphemes. To examine whether occurrences of MLL1 morphemes in genomic DNA may influence codon utilization, we analyzed the frequency of various 9-mers in human cDNAs and in total human genomic DNA. We uncovered preferential utilization of GGC for Gly, GCG for Ala, CCG for Pro, and TCG for Ser, in coding sequences (CDSs) that included MLL1 morphemes.
View Article and Find Full Text PDFBMC Genomics
December 2013
Department of Chemistry, Purdue University, West Lafayette, IN 47907, USA.
Background: Mixed Lineage Leukemia 1 (MLL1) is a mammalian ortholog of the Drosophila Trithorax. In Drosophila, Trithorax complexes transmit the memory of active genes to daughter cells through interactions with Trithorax Response Elements (TREs). However, despite their functional importance, nothing is known about sequence features that may act as TREs in mammalian genomic DNA.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!