A spectral algorithm for fast de novo layout of uncorrected long nanopore reads.

Bioinformatics

CNRS & D.I., UMR 8548, École Normale Supérieure, Paris, France.

Published: October 2017

Motivation: New long read sequencers promise to transform sequencing and genome assembly by producing reads tens of kilobases long. However, their high error rate significantly complicates assembly and requires expensive correction steps to layout the reads using standard assembly engines.

Results: We present an original and efficient spectral algorithm to layout the uncorrected nanopore reads, and its seamless integration into a straightforward overlap/layout/consensus (OLC) assembly scheme. The method is shown to assemble Oxford Nanopore reads from several bacterial genomes into good quality (∼99% identity to the reference) genome-sized contigs, while yielding more fragmented assemblies from the eukaryotic microbe Sacharomyces cerevisiae.

Availability And Implementation: https://github.com/antrec/spectrassembler.

Contact: antoine.recanati@inria.fr.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btx370DOI Listing

Publication Analysis

Top Keywords

nanopore reads
12
spectral algorithm
8
layout uncorrected
8
reads
5
algorithm fast
4
fast novo
4
novo layout
4
uncorrected long
4
long nanopore
4
reads motivation
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!