A gold standard set of mechanistically diverse enzyme superfamilies.

Genome Biol

Department of Biopharmaceutical Sciences, University of California, 1700 4th Street, San Francisco, San Francisco, CA 94143-2550, USA.

Published: August 2006

Superfamily and family analyses provide an effective tool for the functional classification of proteins, but must be automated for use on large datasets. We describe a 'gold standard' set of enzyme superfamilies, clustered according to specific sequence, structure, and functional criteria, for use in the validation of family and superfamily clustering methods. The gold standard set represents four fold classes and differing clustering difficulties, and includes five superfamilies, 91 families, 4,887 sequences and 282 structures.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1431709PMC
http://dx.doi.org/10.1186/gb-2006-7-1-r8DOI Listing

Publication Analysis

Top Keywords

gold standard
8
standard set
8
enzyme superfamilies
8
set mechanistically
4
mechanistically diverse
4
diverse enzyme
4
superfamilies superfamily
4
superfamily family
4
family analyses
4
analyses provide
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!