MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph.

Bioinformatics

HKU-BGI Bioinformatics Algorithms Research Laboratory & Department of Computer Science, University of Hong Kong, Hong Kong, L3 Bioinformatics Limited, Hong Kong and National Institute of Informatics, Chiyoda-ku, Tokyo, Japan HKU-BGI Bioinformatics Algorithms Research Laboratory & Department of Computer Science, University of Hong Kong, Hong Kong, L3 Bioinformatics Limited, Hong Kong and National Institute of Informatics, Chiyoda-ku, Tokyo, Japan.

Published: May 2015

MEGAHIT is a NGS de novo assembler for assembling large and complex metagenomics data in a time- and cost-efficient manner. It finished assembling a soil metagenomics dataset with 252 Gbps in 44.1 and 99.6 h on a single computing node with and without a graphics processing unit, respectively. MEGAHIT assembles the data as a whole, i.e. no pre-processing like partitioning and normalization was needed. When compared with previous methods on assembling the soil data, MEGAHIT generated a three-time larger assembly, with longer contig N50 and average contig length; furthermore, 55.8% of the reads were aligned to the assembly, giving a fourfold improvement.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btv033DOI Listing

Publication Analysis

Top Keywords

large complex
8
complex metagenomics
8
assembling soil
8
megahit
4
megahit ultra-fast
4
ultra-fast single-node
4
single-node solution
4
solution large
4
metagenomics assembly
4
assembly succinct
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!