Retrieving Good-Quality Genomes From the GenBank Database Using a Python Tool, SalmoDEST.

Bioinform Biol Insights

Salmonella and Listeria Unit, Laboratory for Food Safety, ANSES, Maisons-Alfort, France.

Published: February 2022

With the advent of next-generation whole-genome sequencing (WGS), the need for good-quality and well-characterised genomes has increased over the past years. Good-quality complete genomes are often required for assembly reference mapping or phylogenetic single nucleotide polymorphism (SNP) analysis. Complete genomes or contigs from specific sources or serovars are also searched for clustering analysis or source attribution studies. Therefore, new bioinformatics tools are needed for the extraction of good-quality and well-characterised genomes from public databases. Here, we developed SalmoDEST, an open-source Python tool capable of extracting genomes with a coverage higher than 50x and genome length over 4Mb from the GenBank database in the form of complete genomes or contigs, with verification of the serovar to which they belong and identification of the corresponding multi locus sequence type (MLST) profile. To validate the ability to SalmoDEST to screen for and retrieve genomes of good quality, we compared our results for S. Typhi complete genome with those available in the literature and extracted genomes from bovine sources strains isolated worldwide. Finally, we provide in this study a list of 239 complete genomes for 123 serovars of of high quality. SalmoDEST is a handy and easy-to-use open-source tool to extract complete genomes or contigs that can be routinely used in public health, food safety and research laboratories. SalmoDEST (SALMOnella Download gEnome Serotype sT) is available at https://github.com/I-Guy/SalmoDEST.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8874161PMC
http://dx.doi.org/10.1177/11779322221080264DOI Listing

Publication Analysis

Top Keywords

complete genomes
20
genomes contigs
12
genomes
11
genbank database
8
python tool
8
good-quality well-characterised
8
well-characterised genomes
8
complete
6
salmodest
5
retrieving good-quality
4

Similar Publications

Background: Paeonia lactiflora Pall., a member of Paeoniaceae family, is a medicinal herb widely used in traditional Chinese medicine. Chloroplasts are multifunctional organelles containing distinct genetic material.

View Article and Find Full Text PDF

Complete genome sequences of the KACC 18744, KACC 18716, and KACC 19094.

Microbiol Resour Announc

January 2025

Agricultural Microbiology Division, National Institute of Agricultural Sciences, Rural Development Administration, Wanju-gun, South Korea.

We report the whole genome sequences of KACC 18744, KACC 18716, and KACC 19094, to investigate the genomic diversity of bacterial type strains distributed in Korea.

View Article and Find Full Text PDF

Colistin resistance threatens global health as it compromises the effectiveness of a last-resort antibiotic. We present the complete genome sequence of ST462, which carries the gene, isolated from a pediatric diarrhea case in southern Vietnam. The 5,049,362 bp genome contains 24 resistance genes distributed across 107 contigs.

View Article and Find Full Text PDF

The complete genome sequence of , a goldthread anthracnose pathogen, was sequenced using PacBio Revio and MGI DNBSEQ-T7 PE150. It contains 10 chromosomes, 5 mini chromosomes, a circular mitochondrial chromosome, and 13,129 genes predicted with RNA-Seq data in a 52.13-Mb genome with an of 5.

View Article and Find Full Text PDF

Complete genome sequence of a polyhydroxyalkaonate-accumulating bacterium, HS-12-14, isolated from a Korean hot spring.

Microbiol Resour Announc

January 2025

Department of Bioscience and Research Center for Extremophiles and Marine Microbiology, Silla University, Busan, South Korea.

We present the complete genome sequence of polyhydroxyalkaonate-accumulating moderately thermophilic HS-12-14 strain, isolated from a Korean hot spring. These findings contribute to valuable insights into the biosynthesis of polyhydroxyalkaonates in thermophiles and enhance understanding of strains.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!