DSNetax: a deep learning species annotation method based on a deep-shallow parallel framework.

Brief Bioinform

School of Artificial Intelligence and Computer Science, Jiangnan university, Wuxi, Jiangsu 214122, China.

Published: March 2024

Microbial community analysis is an important field to study the composition and function of microbial communities. Microbial species annotation is crucial to revealing microorganisms' complex ecological functions in environmental, ecological and host interactions. Currently, widely used methods can suffer from issues such as inaccurate species-level annotations and time and memory constraints, and as sequencing technology advances and sequencing costs decline, microbial species annotation methods with higher quality classification effectiveness become critical. Therefore, we processed 16S rRNA gene sequences into k-mers sets and then used a trained DNABERT model to generate word vectors. We also design a parallel network structure consisting of deep and shallow modules to extract the semantic and detailed features of 16S rRNA gene sequences. Our method can accurately and rapidly classify bacterial sequences at the SILVA database's genus and species level. The database is characterized by long sequence length (1500 base pairs), multiple sequences (428,748 reads) and high similarity. The results show that our method has better performance. The technique is nearly 20% more accurate at the species level than the currently popular naive Bayes-dominated QIIME 2 annotation method, and the top-5 results at the species level differ from BLAST methods by <2%. In summary, our approach combines a multi-module deep learning approach that overcomes the limitations of existing methods, providing an efficient and accurate solution for microbial species labeling and more reliable data support for microbiology research and application.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11007113PMC
http://dx.doi.org/10.1093/bib/bbae157DOI Listing

Publication Analysis

Top Keywords

species annotation
12
species level
12
annotation method
8
microbial species
8
16s rrna
8
rrna gene
8
gene sequences
8
species
6
dsnetax deep
4
deep learning
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!