Ragout-a reference-assisted assembly tool for bacterial genomes.

Bioinformatics

St. Petersburg University of the Russian Academy of Sciences, Bioinformatics Institute, St. Petersburg, Russia, UCSC, 1156 High Street, Santa Cruz, CA and Department of Computer Science and Engineering, UCSD, 9500 Gilman Drive, La Jolla, CA, USA.

Published: June 2014

Summary: Bacterial genomes are simpler than mammalian ones, and yet assembling the former from the data currently generated by high-throughput short-read sequencing machines still results in hundreds of contigs. To improve assembly quality, recent studies have utilized longer Pacific Biosciences (PacBio) reads or jumping libraries to connect contigs into larger scaffolds or help assemblers resolve ambiguities in repetitive regions of the genome. However, their popularity in contemporary genomic research is still limited by high cost and error rates. In this work, we explore the possibility of improving assemblies by using complete genomes from closely related species/strains. We present Ragout, a genome rearrangement approach, to address this problem. In contrast with most reference-guided algorithms, where only one reference genome is used, Ragout uses multiple references along with the evolutionary relationship among these references in order to determine the correct order of the contigs. Additionally, Ragout uses the assembly graph and multi-scale synteny blocks to reduce assembly gaps caused by small contigs from the input assembly. In simulations as well as real datasets, we believe that for common bacterial species, where many complete genome sequences from related strains have been available, the current high-throughput short-read sequencing paradigm is sufficient to obtain a single high-quality scaffold for each chromosome.

Availability: The Ragout software is freely available at: https://github.com/fenderglass/Ragout.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4058940PMC
http://dx.doi.org/10.1093/bioinformatics/btu280DOI Listing

Publication Analysis

Top Keywords

bacterial genomes
8
high-throughput short-read
8
short-read sequencing
8
assembly
5
ragout-a reference-assisted
4
reference-assisted assembly
4
assembly tool
4
tool bacterial
4
genomes summary
4
summary bacterial
4

Similar Publications

A new gene coding for an iron-containing enzyme was identified in the genome of Acinetobacter radioresistens. Bioinformatics analysis allowed the assignment of the protein to DyP peroxidases, due to the presence of conserved residues involved in heme binding and catalysis. Moreover, Ar-DyP is located in an operon coding also for other enzymes involved in iron uptake and regulation.

View Article and Find Full Text PDF

Application of herbicide-degrading bacteria is an effective strategy to remove herbicide in soil. However, the ability of bacteria to degrade a herbicide is often severely limited in the presence of other pesticide. In this study, the atrazine-degrading strain Klebsiella varicola FH-1 and acetochlor-degrading strain Bacillus Aryabhatti LY-4 were used as parent strains to construct the recombinant RH-92 strain through protoplast fusion technology.

View Article and Find Full Text PDF

Cooperation of Lactoplantibacillus plantarum and polyethylene microplastics facilitated the disappearance of tetracycline during anaerobic fermentation of whole plant maize.

J Hazard Mater

January 2025

College of Animal Science, Guizhou University, Guiyang 550025, China; Key Laboratory of Animal Genetics, Breeding & Reproduction in the Plateau Mountainous Region, Ministry of Education, Guizhou University, Guiyang 550025, China. Electronic address:

In agricultural production systems, the harm of both antibiotics and microplastics (MPs) to human health has been an important and continuously concerned issue. A small bagged silage production system was designed to investigate the effects of Lactoplantibacillus plantarum, polyethylene (PE) -MPs and their mixture on the silage fermentation and chemical composition of Tetracycline (TET) -contaminated whole plant maize. In addition, the bacterial community of silage samples was analyzed by using next generation genome sequencing technology.

View Article and Find Full Text PDF

A major threat to world health is the high death rate from gastrointestinal (GI) cancer, especially in Asia, South America, and Europe. The new approaches are needed because of the complexity and heterogeneity of gastrointestinal (GI) cancer, which has made the development of effective treatments difficult. To investigate the potential of peptide-based therapies that target the P21 Activated Kinase 1 (PAK1) in GI cancer, we are using the DBsORF database to predict peptides from the genomes of two bacterial strains: Lactobacillus plantarum and Pediococcus pentosaceus.

View Article and Find Full Text PDF

Unlabelled: a natural inhabitant of the human body, is a promising candidate vehicle for vaccine delivery. An obstacle in developing bacterial delivery vehicles is generating a production strain that lacks antibiotic resistance genes and contains minimal foreign DNA. To deal with this obstacle, we have constructed a finetuned, inducible two-plasmid CRISPR/Cas9-system for chromosomal gene insertion in .

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!