Polishing the Oxford Nanopore long-read assemblies of bacterial pathogens with Illumina short reads to improve genomic analyses.

Genomics

Joint Institute for Food Safety and Applied Nutrition, Center for Food Safety and Security Systems, University of Maryland, College Park, MD 20742, USA; Department of Nutrition and Food Science, University of Maryland, College Park, MD 20742, USA. Electronic address:

Published: May 2021

Oxford Nanopore sequencing has been widely used to achieve complete genomes of bacterial pathogens. However, the error rates of Oxford Nanopore long reads are high. Various polishing algorithms using Illumina short reads to correct the errors in Oxford Nanopore long-read assemblies have been developed. The impact of polishing the Oxford Nanopore long-read assemblies of bacterial pathogens with Illumina short reads on improving genomic analyses was evaluated using both simulated and real reads. Ten species (10 strains) were selected for simulated reads, while real reads were tested on 11 species (11 strains). Oxford Nanopore long reads were assembled with Unicycler to produce a draft assembly, followed by three rounds of polishing with Illumina short reads using two polishing tools, Pilon and NextPolish. One round of NextPolish polishing generated genome completeness and accuracy parameters similar to the reference genomes, whereas two or three rounds of Pilon polishing were needed, though contiguity remained unchanged after polishing. The polished assemblies of Escherichia coli O157:H7, Salmonella Typhimurium, and Cronobacter sakazakii with simulated reads did not provide accurate plasmid identifications. One round of NextPolish polishing was needed for accurately identifying plasmids in Staphylococcus aureus and E. coli O26:H11 with real reads, whereas one and two rounds of Pilon polishing were necessary for these two strains, respectively. Polishing failed to provide an accurate antimicrobial resistance (AMR) genotype for S. aureus with real reads. One round of polishing recovered an accurate AMR genotype for Klebsiella pneumoniae with real reads. The reference genome and draft assembly of Citrobacter braakii with real reads differed, which carried blaCMY-83 and fosA6, respectively, while both genes were present after one round of polishing. However, polishing did not improve the assembly of E. coli O26:H11 with real reads to achieve numbers of virulence genes similar to the reference genome. The draft and polished assemblies showed a phylogenetic tree topology comparable with the reference genomes. For multilocus sequence typing and pan-genome analyses, one round of NextPolish polishing was sufficient to obtain accurate results, while two or three rounds of Pilon polishing were needed. Overall, NextPolish outperformed Pilon for polishing the Oxford Nanopore long-read assemblies of bacterial pathogens, though both polishing strategies improved genomic analyses compared to the draft assemblies.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.ygeno.2021.03.018DOI Listing

Publication Analysis

Top Keywords

oxford nanopore
28
real reads
28
polishing
18
nanopore long-read
16
long-read assemblies
16
bacterial pathogens
16
illumina short
16
short reads
16
pilon polishing
16
reads
15

Similar Publications

Insect-specific RNA viruses detection in Field-Caught Aedes aegypti mosquitoes from Argentina using NGS technology.

PLoS Negl Trop Dis

January 2025

Laboratorio de Ingeniería Genética y Biología Celular y Molecular-Área de virus de insectos, Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Quilmes, Buenos Aires, Argentina.

Mosquitoes are the primary vectors of arthropod-borne pathogens. Aedes aegypti is one of the most widespread mosquito species worldwide, responsible for transmitting diseases such as Dengue, Zika, and Chikungunya, among other medically significant viruses. Characterizing the array of viruses circulating in mosquitoes, particularly in Aedes aegypti, is a crucial tool for detecting and developing novel strategies to prevent arbovirus outbreaks.

View Article and Find Full Text PDF

, commonly known as the "Chinese hibiscus", is a widely cultivated shrub with ornamental and medicinal applications (Jadhav et al., 2009). However, it is known to be susceptible to a range of pathogens including bacteria (Chase, 1986).

View Article and Find Full Text PDF

Background: A subset of developmental disorders (DD) is characterized by disease-specific genome-wide methylation changes. These episignatures inform on the underlying pathogenic mechanisms and can be used to assess the pathogenicity of genomic variants as well as confirm clinical diagnoses. Currently, the detection of these episignature requires the use of indirect methylation profiling methodologies.

View Article and Find Full Text PDF

Background: Molecular diagnosis has become highly significant for patient management in oncology.

Methods: Here, 30 well-characterized clinical germline samples were studied with adaptive sampling to enrich the full sequence of 152 cancer predisposition genes. Sequencing was performed on Oxford Nanopore (ONT) R10.

View Article and Find Full Text PDF

Carcinogenesis often involves significant alterations in the cancer genome, marked by large structural variants (SVs) and copy number variations (CNVs) that are difficult to capture with short-read sequencing. Traditionally, cytogenetic techniques are applied to detect such aberrations, but they are limited in resolution and do not cover features smaller than several hundred kilobases. Optical genome mapping (OGM) and nanopore sequencing [Oxford Nanopore Technologies (ONT)] bridge this resolution gap and offer enhanced performance for cytogenetic applications.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!