Enhancing metagenomic classification with compression-based features.

Artif Intell Med

IEETA-DETI, LASI, Aveiro University, Aveiro, Portugal.

Published: October 2024

Metagenomics is a rapidly expanding field that uses next-generation sequencing technology to analyze the genetic makeup of environmental samples. However, accurately identifying the organisms in a metagenomic sample can be complex, and traditional reference-based methods may need to be more effective in some instances. In this study, we present a novel approach for metagenomic identification, using data compressors as a feature for taxonomic classification. By evaluating a comprehensive set of compressors, including both general-purpose and genomic-specific, we demonstrate the effectiveness of this method in accurately identifying organisms in metagenomic samples. The results indicate that using features from multiple compressors can help identify taxonomy. An overall accuracy of 95% was achieved using this method using an imbalanced dataset with classes with limited samples. The study also showed that the correlation between compression and classification is insignificant, highlighting the need for a multi-faceted approach to metagenomic identification. This approach offers a significant advancement in the field of metagenomics, providing a reference-less method for taxonomic identification that is both effective and efficient while revealing insights into the statistical and algorithmic nature of genomic data. The code to validate this study is publicly available at https://github.com/ieeta-pt/xgTaxonomy.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.artmed.2024.102948DOI Listing

Publication Analysis

Top Keywords

accurately identifying
8
identifying organisms
8
organisms metagenomic
8
approach metagenomic
8
metagenomic identification
8
enhancing metagenomic
4
metagenomic classification
4
classification compression-based
4
compression-based features
4
features metagenomics
4

Similar Publications

Objective: This study investigated the psychometric properties of the Arabic version of the Adult Self-Report Scale-5 (the ASRS-5-AR) within a large sample of adults residing in Saudi Arabia.

Methods: This cross-sectional study applied the ASRS-5-AR to a random sample of 4,299 Saudi and non-Saudi adults, aged 19 to 66 years (31.16 ± 9.

View Article and Find Full Text PDF

Multimodal artificial intelligence system for detecting a small esophageal high-grade squamous intraepithelial neoplasia: A case report.

World J Gastrointest Endosc

January 2025

Department of Gastroenterology and Hepatology, West China Hospital, Sichuan University, Chengdu 610041, Sichuan Province, China.

Background: Recent advancements in artificial intelligence (AI) have significantly enhanced the capabilities of endoscopic-assisted diagnosis for gastrointestinal diseases. AI has shown great promise in clinical practice, particularly for diagnostic support, offering real-time insights into complex conditions such as esophageal squamous cell carcinoma.

Case Summary: In this study, we introduce a multimodal AI system that successfully identified and delineated a small and flat carcinoma during esophagogastroduodenoscopy, highlighting its potential for early detection of malignancies.

View Article and Find Full Text PDF

Molecular characterization of tumors is essential to identify predictive biomarkers that inform treatment decisions and improve precision immunotherapy development and administration. However, challenges such as the heterogeneity of tumors and patient responses, limited efficacy of current biomarkers, and the predominant reliance on single-omics data, have hindered advances in accurately predicting treatment outcomes. Standard therapy generally applies a "one size fits all" approach, which not only provides ineffective or limited responses, but also an increased risk of off-target toxicities and acceleration of resistance mechanisms or adverse effects.

View Article and Find Full Text PDF

Introduction: The envelope proteins syncytin-1 and pHERV-W from the Human Endogenous Retroviral family 'W' (HERV-W) have been identified as potential risk factors in multiple sclerosis (MS). This study aims to evaluate both humoral and cell-mediated immune response to antigenic peptides derived from these proteins across different clinical forms and inflammatory phases of MS.

Methods: Indirect enzyme-linked immunosorbent assay (ELISA) was employed to measure immunoglobulin G (IgG) responses to syncytin-1 and pHERV-W peptides in MS patients.

View Article and Find Full Text PDF

Unravelling the case of suspected ectopic ureter in a young adult patient.

Radiol Case Rep

March 2025

Department of Radiology, Hasan Sadikin Academic Medical Center-Faculty of Medicine, University of Padjadjaran, Jatinangor, Indonesia.

An ectopic ureter (EU) opens outside the bladder's trigone, a rare condition with an incidence of 0.05%-0.025%.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!