Confronting the catalytic dark matter encoded by sequenced genomes.

Nucleic Acids Res

Luxembourg Centre for Systems Biomedicine, University of Luxembourg, L-4362 Esch-sur-Alzette, Luxembourg.

Published: November 2017

The post-genomic era has provided researchers with a deluge of protein sequences. However, a significant fraction of the proteins encoded by sequenced genomes remains without an identified function. Here, we aim at determining how many enzymes of uncertain or unknown function are still present in the Saccharomyces cerevisiae and human proteomes. Using information available in the Swiss-Prot, BRENDA and KEGG databases in combination with a Hidden Markov Model-based method, we estimate that >600 yeast and 2000 human proteins (>30% of their proteins of unknown function) are enzymes whose precise function(s) remain(s) to be determined. This illustrates the impressive scale of the 'unknown enzyme problem'. We extensively review classical biochemical as well as more recent systematic experimental and computational approaches that can be used to support enzyme function discovery research. Finally, we discuss the possible roles of the elusive catalysts in light of recent developments in the fields of enzymology and metabolism as well as the significance of the unknown enzyme problem in the context of metabolic modeling, metabolic engineering and rare disease research.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5714238PMC
http://dx.doi.org/10.1093/nar/gkx937DOI Listing

Publication Analysis

Top Keywords

encoded sequenced
8
sequenced genomes
8
unknown function
8
confronting catalytic
4
catalytic dark
4
dark matter
4
matter encoded
4
genomes post-genomic
4
post-genomic era
4
era provided
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!