ECOD domain classification of 48 whole proteomes from AlphaFold Structure Database using DPAM2.

R Dustin Schaeffer Jing Zhang Kirill E Medvedev Lisa N Kinch Qian Cong Nick V Grishin

PLoS Comput Biol

Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America.

Published: February 2024

Protein structure prediction has now been deployed widely across several different large protein sets. Large-scale domain annotation of these predictions can aid in the development of biological insights. Using our Evolutionary Classification of Protein Domains (ECOD) from experimental structures as a basis for classification, we describe the detection and cataloging of domains from 48 whole proteomes deposited in the AlphaFold Database. On average, we can provide positive classification (either of domains or other identifiable non-domain regions) for 90% of residues in all proteomes. We classified 746,349 domains from 536,808 proteins comprised of over 226,424,000 amino acid residues. We examine the varying populations of homologous groups in both eukaryotes and bacteria. In addition to containing a higher fraction of disordered regions and unassigned domains, eukaryotes show a higher proportion of repeated proteins, both globular and small repeats. We enumerate those highly populated domains that are shared in both eukaryotes and bacteria, such as the Rossmann domains, TIM barrels, and P-loop domains. Additionally, we compare the sampling of homologous groups from this whole proteome set against our stable ECOD reference and discuss groups that have been enriched by structure predictions. Finally, we discuss the implication of these results for protein target selection for future classification strategies for very large protein sets.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10927120	PMC
http://dx.doi.org/10.1371/journal.pcbi.1011586	DOI Listing

Publication Analysis

Top Keywords

large protein

protein sets

domains

homologous groups

eukaryotes bacteria

classification

protein

ecod domain

domain classification

classification proteomes

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!

A PHP Error was encountered