Accurate self-correction of errors in long reads using de Bruijn graphs.

Bioinformatics

Helsinki Institute for Information Technology HIIT, Department of Computer Science, University of Helsinki, Helsinki, Finland.

Published: March 2017

Motivation: New long read sequencing technologies, like PacBio SMRT and Oxford NanoPore, can produce sequencing reads up to 50 000 bp long but with an error rate of at least 15%. Reducing the error rate is necessary for subsequent utilization of the reads in, e.g. de novo genome assembly. The error correction problem has been tackled either by aligning the long reads against each other or by a hybrid approach that uses the more accurate short reads produced by second generation sequencing technologies to correct the long reads.

Results: We present an error correction method that uses long reads only. The method consists of two phases: first, we use an iterative alignment-free correction method based on de Bruijn graphs with increasing length of k -mers, and second, the corrected reads are further polished using long-distance dependencies that are found using multiple alignments. According to our experiments, the proposed method is the most accurate one relying on long reads only for read sets with high coverage. Furthermore, when the coverage of the read set is at least 75×, the throughput of the new method is at least 20% higher.

Availability And Implementation: LoRMA is freely available at http://www.cs.helsinki.fi/u/lmsalmel/LoRMA/ .

Contact: leena.salmela@cs.helsinki.fi.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5351550PMC
http://dx.doi.org/10.1093/bioinformatics/btw321DOI Listing

Publication Analysis

Top Keywords

long reads
16
reads
8
bruijn graphs
8
sequencing technologies
8
error rate
8
error correction
8
correction method
8
long
7
method
5
accurate self-correction
4

Similar Publications

Short- and long-range roles of UNC-6/Netrin in dorsal-ventral axon guidance in vivo in Caenorhabditis elegans.

PLoS Genet

January 2025

Department of Molecular Biosciences, Program in Molecular, Cellular, and Developmental Biology, KU Center for Genomics, University of Kansas, Lawrence, Kansas, United States of America.

Recent studies in vertebrates and Caenorhabditis elegans have reshaped models of how the axon guidance cue UNC-6/Netrin functions in dorsal-ventral axon guidance, which was traditionally thought to form a ventral-to-dorsal concentration gradient that was actively sensed by growing axons. In the vertebrate spinal cord, floorplate Netrin1 was shown to be largely dispensable for ventral commissural growth. Rather, short range interactions with Netrin1 on the ventricular zone radial glial stem cells was shown to guide ventral commissural axon growth.

View Article and Find Full Text PDF

The Potential of Poetry in Mental Health Nurse Pre-registration Education.

Int J Ment Health Nurs

February 2025

School of Health Sciences, University of Nottingham, Nottingham, UK.

This paper examines the potential of poetry as a resource within mental health nurse pre-registration education. There has long been a debate as to whether the art or the science of nursing should be foregrounded within pre-registration education, especially in the UK within recent years as the latest Nursing and Midwifery Council's standards of pre-registration education appear to have shifted the focus towards the acquisition of skills, giving less consideration to the holistic transformatory process of education. The paper uses the conceptualisation of education by Beista, who proposes that education can be considered in relation to the three domains of qualification, socialisation and subjectification.

View Article and Find Full Text PDF

Walnut PR10/Bet v1-like proteins interact with chitinase in response to anthracnose stress.

J Evol Biol

January 2025

Laboratory of Walnut Research Center, College of Forestry, Northwest A & F University, Yangling, 712100 Shaanxi, China.

Walnut is a significant woody oil tree species that has been increasingly affected by anthracnose in recent years. Effective anthracnose control is crucial for walnut yield and quality, which requires a comprehensive understanding of the response mechanisms to Colletotrichum gloeosporioides. The PR10/Bet v1-like proteins are involved in defense against various disease, therefore, in this study, 7 JrBet v1s were identified from the walnut transcriptome (named JrBet v1-1~1-7), whose open reading frame (ORF) was 414~483 bp in length with isoelectric point ranging from 4.

View Article and Find Full Text PDF

Objective: Multiple-Locus Variable Number of Tandem Repeats (VNTR) Analysis (MLVA) is widely used to subtype pathogens causing foodborne and waterborne disease outbreaks. The MLVAType shiny application was previously designed to extract MLVA profiles of Vibrio cholerae isolates from whole-genome sequencing (WGS) data, and provide backward compatibility with traditional MLVA typing methods. The previous development and validation work was conducted using short (pair-end 300 and 150 nt long) reads from Illumina MiSeq and Hiseq sequencing.

View Article and Find Full Text PDF

Echocardiography of the right heart in pulmonary arterial hypertension: insights from the ULTRA RIGHT VALUE study.

Eur Heart J Imaging Methods Pract

January 2025

Department of Clinical Internal, Anesthesiological and Cardiovascular Sciences, Sapienza University of Rome, Viale del Policlinico 155, Rome 00161, Italy.

Aims: Outcome in pulmonary arterial hypertension (PAH) is determined by right ventricular (RV) function adaptation to increased afterload. Echocardiography is easily available to assist bedside evaluation of the RV. However, no agreement exists about the feasibility and most relevant measurements.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!