Low-complexity sequences are extremely abundant in eukaryotic proteins for reasons that remain unclear. One hypothesis is that they contribute to the formation of novel coding sequences, facilitating the generation of novel protein functions. Here, we test this hypothesis by examining the content of low-complexity sequences in proteins of different age. We show that recently emerged proteins contain more low-complexity sequences than older proteins and that these sequences often form functional domains. These data are consistent with the idea that low-complexity sequences may play a key role in the emergence of novel genes.

Download full-text PDF

Source
http://dx.doi.org/10.1093/molbev/msr263DOI Listing

Publication Analysis

Top Keywords

low-complexity sequences
20
sequences
8
formation novel
8
novel protein
8
coding sequences
8
role low-complexity
4
sequences formation
4
novel
4
protein coding
4
low-complexity
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!