Analysis of occurrence of simple amino acid repeats in large ensemble of prokaryotic and eukaryotic sequences reveals that nearly all amino acids found in the repeats belong to those which have in their codon repertoires aggressively expanding triplets, all of three known pathologically expanding classes GCU (GCU, CUG, UGC, AGC, GCA, CAG), GCC (GCC, CCG, CGC, GGC, GCG, CGG), and AAG (AAG, AGA, GAA, CTT, TTC, TCT). This is observed especially clear in the first exons of proteins of higher eukaryotes. The data are interpreted as manifestation of everlasting triplet expansions, which, presumably, started from the very origin of the triplet code. The spontaneous expansions continued to occur all the way during evolution, leaving their footprints in the protein-coding sequences as still visible simple amino acid repeats, as preferred triplets encoding the repeats, and as preferred codons in the codon usage tables.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1007/s00239-010-9425-0 | DOI Listing |
Gene
October 2013
Department of Software Engineering, ORT Braude College, Karmiel, Israel.
We have shown, in a previous paper, that tandem repeating sequences, especially triplet repeats, play a very important role in gene evolution. This result led to the formulation of the following hypothesis: most of the genomic sequences evolved through everlasting acts of tandem repeat expansions with subsequent accumulation of changes. In order to estimate how much of the observed sequences have the repeat origin we describe the adaptation of a text segmentation algorithm, based on dynamic programming, to the mapping of the ancient expansion events.
View Article and Find Full Text PDFJ Mol Evol
February 2011
Genome Diversity Center, Institute of Evolution, University of Haifa, Mount Carmel, Haifa 31905, Israel.
Analysis of occurrence of simple amino acid repeats in large ensemble of prokaryotic and eukaryotic sequences reveals that nearly all amino acids found in the repeats belong to those which have in their codon repertoires aggressively expanding triplets, all of three known pathologically expanding classes GCU (GCU, CUG, UGC, AGC, GCA, CAG), GCC (GCC, CCG, CGC, GGC, GCG, CGG), and AAG (AAG, AGA, GAA, CTT, TTC, TCT). This is observed especially clear in the first exons of proteins of higher eukaryotes. The data are interpreted as manifestation of everlasting triplet expansions, which, presumably, started from the very origin of the triplet code.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!