Impact of the MLL1 morphemes on codon utilization and preservation in CpG islands.

Biopolymers

Department of Chemistry, Purdue University, West Lafayette, IN, 47907.

Published: September 2015

Previous studies have shown that Mixed Lineage Leukemia 1 (MLL1 or MLL) binds a group of CpG-rich motifs known as morphemes. To examine whether occurrences of MLL1 morphemes in genomic DNA may influence codon utilization, we analyzed the frequency of various 9-mers in human cDNAs and in total human genomic DNA. We uncovered preferential utilization of GGC for Gly, GCG for Ala, CCG for Pro, and TCG for Ser, in coding sequences (CDSs) that included MLL1 morphemes. We also examined weighted occurrences of CDS 9-mers in a 30-base window that moved along each human chromosome. In plots, we observed peaks with fluctuating intensities. High intensity peaks appeared within promoter and exons localized in CpG islands, encompassing sequences that included MLL1 morphemes. High intensity peaks included CCG/GGC repeats, whose expansion may cause neurological disorders and congenital malformations. Such repeats are generated from overlap of a morpheme (CGCCG/CGGCG), which depending on reading frame and orientation would produce runs of Ala, Gly, or Pro in proteins. Overall, our results point to a role for morpheme occurrences on synonymous codon utilization in human genomic DNA and indicate that regulatory instructions are dispersed not only in promoters but also in exons of human genes.

Download full-text PDF

Source
http://dx.doi.org/10.1002/bip.22681DOI Listing

Publication Analysis

Top Keywords

mll1 morphemes
16
codon utilization
12
genomic dna
12
cpg islands
8
human genomic
8
included mll1
8
high intensity
8
intensity peaks
8
morphemes
5
human
5

Similar Publications

Along the Bos taurus genome, uncover candidate imprinting control regions.

BMC Genomics

June 2022

Department of Chemistry, Purdue University, West Lafayette, IN, 47907, USA.

Background: In mammals, Imprinting Control Regions (ICRs) regulate a subset of genes in a parent-of-origin-specific manner. In both human and mouse, previous studies identified a set of CpG-rich motifs occurring as clusters in ICRs and germline Differentially Methylated Regions (gDMRs). These motifs consist of the ZFP57 binding site (ZFBS) overlapping a subset of MLL binding units known as MLL morphemes.

View Article and Find Full Text PDF

While MLL1 activates gene expression in most tissues, ZFP57 represses transcription. MLL1 selectively interacts with a group of nonmethylated DNA sequences known as the MLL1 morphemes. ZFP57 associates with a methylated hexamer (ZFBS), dispersed in the genomic DNA segments known as Imprinted Control Regions (ICRs) and germline Differentially Methylated Regions (gDMRs), to maintain allele-specific gene repression.

View Article and Find Full Text PDF

Imprinted control regions include composite DNA elements consisting of the ZFP57 binding site overlapping MLL1 morphemes.

Genomics

July 2017

Department of Chemistry, Purdue University, West Lafayette, IN 47907, USA. Electronic address:

Mammalian genomes include DNA segments that are imprinted (CpG-methylated) only on one of the two parental chromosomes, leading to parent-of-origin-specific gene expression. The process is regulated by Imprinting Control Regions (ICRs) and germline Differentially Methylated Regions (gDMRs). Previously, ZFP57 was shown to recognize a methylated hexanucleotide in ICRs to maintain allele-specific gene repression.

View Article and Find Full Text PDF

Impact of the MLL1 morphemes on codon utilization and preservation in CpG islands.

Biopolymers

September 2015

Department of Chemistry, Purdue University, West Lafayette, IN, 47907.

Previous studies have shown that Mixed Lineage Leukemia 1 (MLL1 or MLL) binds a group of CpG-rich motifs known as morphemes. To examine whether occurrences of MLL1 morphemes in genomic DNA may influence codon utilization, we analyzed the frequency of various 9-mers in human cDNAs and in total human genomic DNA. We uncovered preferential utilization of GGC for Gly, GCG for Ala, CCG for Pro, and TCG for Ser, in coding sequences (CDSs) that included MLL1 morphemes.

View Article and Find Full Text PDF

Background: Mixed Lineage Leukemia 1 (MLL1) is a mammalian ortholog of the Drosophila Trithorax. In Drosophila, Trithorax complexes transmit the memory of active genes to daughter cells through interactions with Trithorax Response Elements (TREs). However, despite their functional importance, nothing is known about sequence features that may act as TREs in mammalian genomic DNA.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!