Puzzling over orphan enzymes.

Cell Mol Life Sci

Institut de Génétique et Microbiologie, CNRS UMR 8621, Université Paris Sud, Bâtiment 400, 91405, Orsay Cedex, France.

Published: March 2006

Despite the current availability of several hundreds of thousands of amino acid sequences, more than 39% of the well-defined enzyme activities (EC numbers) are not associated with any sequence in major public databases. This wide gap separating knowledge of biochemical function and sequence information is found in nearly all classes of enzymes. Thus, there is an urgent need to explore the 1525 orphan enzymes (EC numbers without associated sequences), in order to progressively bridge this unwanted gap. Improving genome annotation could unveil a significant proportion of sequenceless enzymes. Peptide mass mapping and further genome mining would be useful to identify proper sequence for enzymes found in species for which genetic tools are missing. Finally, the whole community must help major public databases to begin addressing the problem of missing or incomplete information.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11136189PMC
http://dx.doi.org/10.1007/s00018-005-5520-6DOI Listing

Publication Analysis

Top Keywords

orphan enzymes
8
numbers associated
8
major public
8
public databases
8
enzymes
5
puzzling orphan
4
enzymes despite
4
despite current
4
current availability
4
availability hundreds
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!