Low-complexity sequences are extremely abundant in eukaryotic proteins for reasons that remain unclear. One hypothesis is that they contribute to the formation of novel coding sequences, facilitating the generation of novel protein functions. Here, we test this hypothesis by examining the content of low-complexity sequences in proteins of different age. We show that recently emerged proteins contain more low-complexity sequences than older proteins and that these sequences often form functional domains. These data are consistent with the idea that low-complexity sequences may play a key role in the emergence of novel genes.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1093/molbev/msr263 | DOI Listing |
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!