PC-mer: An Ultra-fast memory-efficient tool for metagenomics profiling and classification.

Saeedeh Akbari Rokn Abadi Amirhossein Mohammadi Somayyeh Koohi

PLoS One

Department of Computer Engineering, Sharif University of Technology, Tehran, Iran.

Published: August 2024

Features extraction methods, such as k-mer-based methods, have recently made up a significant role in classifying and analyzing approaches for metagenomics data. But, they are challenged by various bottlenecks, such as performance limitations, high memory consumption, and computational overhead. To deal with these challenges, we developed an innovative features extraction and sequence profiling method for DNA/RNA sequences, called PC-mer, taking advantage of the physicochemical properties of nucleotides. PC-mer in comparison with the k-mer profiling methods provides a considerable memory usage reduction by a factor of 2k while improving the metagenomics classification performance, for both machine learning-based and computational-based methods, at the various levels and also archives speedup more than 1000x for the training phase. Examining ML-based PC-mer on various datasets confirms that it can achieve 100% accuracy in classifying samples at the class, order, and family levels. Despite the k-mer-based classification methods, it also improves genus-level classification accuracy by more than 14% for shotgun dataset (i.e. achieves accuracy of 97.5%) and more than 5% for amplicon dataset (i.e. achieves accuracy of 98.6%). Due to these improvements, we provide two PC-mer-based tools, which can actually replace the popular k-mer-based tools: one for classifying and another for comparing metagenomics data.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11293629	PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0307279	PLOS

Publication Analysis

Top Keywords

features extraction

metagenomics data

dataset achieves

achieves accuracy

methods

pc-mer

pc-mer ultra-fast

ultra-fast memory-efficient

memory-efficient tool

metagenomics

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!