Representing bacteria with unique genomic signatures.

Front Big Data

Department of Computer Science, University of Memphis, Memphis, TN, United States.

Published: November 2022

Classifying or identifying bacteria in metagenomic samples is an important problem in the analysis of metagenomic data. This task can be computationally expensive since microbial communities usually consist of hundreds to thousands of environmental microbial species. We proposed a new method for representing bacteria in a microbial community using genomic signatures of those bacteria. With respect to the microbial community, the genomic signatures of each bacterium are unique to that bacterium; they do not exist in other bacteria in the community. Further, since the genomic signatures of a bacterium are much smaller than its genome size, the approach allows for a compressed representation of the microbial community. This approach uses a modified Bloom filter to store short k-mers with hash values that are unique to each bacterium. We show that most bacteria in many microbiomes can be represented uniquely using the proposed genomic signatures. This approach paves the way toward new methods for classifying bacteria in metagenomic samples.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9709128PMC
http://dx.doi.org/10.3389/fdata.2022.1018356DOI Listing

Publication Analysis

Top Keywords

genomic signatures
20
microbial community
12
community genomic
12
representing bacteria
8
bacteria metagenomic
8
metagenomic samples
8
signatures bacterium
8
unique bacterium
8
bacteria
6
genomic
5

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!