Protein domains of low sequence complexity-dark matter of the proteome.

Genes Dev

Department of Biochemistry, UT Southwestern Medical Center, Dallas, Texas 75390-9152, USA

Published: April 2024

This perspective begins with a speculative consideration of the properties of the earliest proteins to appear during evolution. What did these primitive proteins look like, and how were they of benefit to early forms of life? I proceed to hypothesize that primitive proteins have been preserved through evolution and now serve diverse functions important to the dynamics of cell morphology and biological regulation. The primitive nature of these modern proteins is easy to spot. They are composed of a limited subset of the 20 amino acids used by traditionally evolved proteins and thus are of low sequence complexity. This chemical simplicity limits protein domains of low sequence complexity to forming only a crude and labile type of protein structure currently hidden from the computational powers of machine learning. I conclude by hypothesizing that this structural weakness represents the underlying virtue of proteins that, at least for the moment, constitute the dark matter of the proteome.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11065162PMC
http://dx.doi.org/10.1101/gad.351465.123DOI Listing

Publication Analysis

Top Keywords

low sequence
12
protein domains
8
domains low
8
matter proteome
8
primitive proteins
8
sequence complexity
8
proteins
6
sequence complexity-dark
4
complexity-dark matter
4
proteome perspective
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!