Assessing the performance of different approaches for functional and taxonomic annotation of metagenomes.

Javier Tamames Marta Cobo-Simón Fernando Puente-Sánchez

BMC Genomics

Systems Biology Department, Centro Nacional de Biotecnología, CSIC, C/Darwin 3, 28049, Madrid, Spain.

Published: December 2019

Background: Metagenomes can be analysed using different approaches and tools. One of the most important distinctions is the way to perform taxonomic and functional assignment, choosing between the use of assembly algorithms or the direct analysis of raw sequence reads instead by homology searching, k-mer analysys, or detection of marker genes. Many instances of each approach can be found in the literature, but to the best of our knowledge no evaluation of their different performances has been carried on, and we question if their results are comparable.

Results: We have analysed several real and mock metagenomes using different methodologies and tools, and compared the resulting taxonomic and functional profiles. Our results show that database completeness (the representation of diverse organisms and taxa in it) is the main factor determining the performance of the methods relying on direct read assignment either by homology, k-mer composition or similarity to marker genes, while methods relying on assembly and assignment of predicted genes are most influenced by metagenomic size, that in turn determines the completeness of the assembly (the percentage of read that were assembled).

Conclusions: Although differences exist, taxonomic profiles are rather similar between raw read assignment and assembly assignment methods, while they are more divergent for methods based on k-mers and marker genes. Regarding functional annotation, analysis of raw reads retrieves more functions, but it also makes a substantial number of over-predictions. Assembly methods are more advantageous as the size of the metagenome grows bigger.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6902526	PMC
http://dx.doi.org/10.1186/s12864-019-6289-6	DOI Listing

Publication Analysis

Top Keywords

marker genes

taxonomic functional

analysis raw

methods relying

read assignment

assembly assignment

assignment

assembly

methods

assessing performance

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!