Woods: A fast and accurate functional annotator and classifier of genomic and metagenomic sequences.

Genomics

MetaInformatics Laboratory, Metagenomics and Systems Biology Group, Department of Biological Sciences, Indian Institute of Science Education and Research, Bhopal, Madhya Pradesh, India. Electronic address:

Published: July 2015

Functional annotation of the gigantic metagenomic data is one of the major time-consuming and computationally demanding tasks, which is currently a bottleneck for the efficient analysis. The commonly used homology-based methods to functionally annotate and classify proteins are extremely slow. Therefore, to achieve faster and accurate functional annotation, we have developed an orthology-based functional classifier 'Woods' by using a combination of machine learning and similarity-based approaches. Woods displayed a precision of 98.79% on independent genomic dataset, 96.66% on simulated metagenomic dataset and >97% on two real metagenomic datasets. In addition, it performed >87 times faster than BLAST on the two real metagenomic datasets. Woods can be used as a highly efficient and accurate classifier with high-throughput capability which facilitates its usability on large metagenomic datasets.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.ygeno.2015.04.001DOI Listing

Publication Analysis

Top Keywords

metagenomic datasets
12
accurate functional
8
functional annotation
8
real metagenomic
8
metagenomic
6
woods fast
4
fast accurate
4
functional
4
functional annotator
4
annotator classifier
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!