Phylogenetic trees of closely related bacterial species and subspecies based on frequencies of short nucleotide sequences.

PLoS One

Department of Chemical Biology and Applied Chemistry, Nihon University, College of Engineering, Koriyama, Fukushima, Japan.

Published: April 2023

AI Article Synopsis

  • Bacterial phylogenetic studies often rely on 16S rRNA gene sequences but face challenges like mosaicism and difficulties in distinguishing closely related species; this study aims to improve phylogenetic analysis using K-mer profiles.
  • The research successfully differentiated Escherichia albertii from E. coli and Shigella by analyzing pentanucleotide frequencies, and established a correlation between phylogenetic trees of Ipomoea species and their morphological similarities.
  • The study introduced a new R application called Phy5, which allows users to create phylogenetic trees based on pentamer profiles, available online and as a command line tool.

Article Abstract

Bacterial phylogenetic analyses are commonly performed to explore the evolutionary relationships among various bacterial species and genera based on their 16S rRNA gene sequences; however, these results are limited by mosaicism, intragenomic heterogeneity, and difficulties in distinguishing between related species. In this study, we aimed to perform genome-wide comparisons of different bacterial species, namely Escherichia coli, Shigella, Yersinia, Klebsiella, and Neisseria spp., based on their K-mer profiles to construct phylogenetic trees. Pentanucleotide frequency analyses (512 patterns of 5 nucleotides each) were performed to distinguish between highly similar species. Moreover, Escherichia albertii strains were clearly distinguished from E. coli and Shigella, despite being closely related to enterohemorrhagic E. coli in the phylogenetic tree. In addition, our phylogenetic tree of Ipomoea species based on pentamer frequency in chloroplast genomes was correlated with previously reported morphological similarities. Furthermore, a support vector machine clearly classified E. coli and Shigella genomes based on their pentanucleotide profiles. These results suggest that phylogenetic analyses based on penta- or hexamer profiles are a useful methodology for microbial phylogenetic studies. In addition, we introduced an R application, Phy5, which generates a phylogenetic tree based on genome-wide comparisons of pentamer profiles. The online version of Phy5 can be accessed at https://phy5.shinyapps.io/Phy5R/ and its command line version Phy5cli can be downloaded at https://github.com/YoshioNakano2021/phy5.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10118083PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0268847PLOS

Publication Analysis

Top Keywords

bacterial species
12
coli shigella
12
phylogenetic tree
12
phylogenetic
8
phylogenetic trees
8
phylogenetic analyses
8
genome-wide comparisons
8
species escherichia
8
based
7
species
6

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!