COMET: adaptive context-based modeling for ultrafast HIV-1 subtype identification.

Nucleic Acids Res

Laboratory of Retrovirology, CRP-Santé, 84, Val Fleuri, L-1526, Luxembourg.

Published: October 2014

Viral sequence classification has wide applications in clinical, epidemiological, structural and functional categorization studies. Most existing approaches rely on an initial alignment step followed by classification based on phylogenetic or statistical algorithms. Here we present an ultrafast alignment-free subtyping tool for human immunodeficiency virus type one (HIV-1) adapted from Prediction by Partial Matching compression. This tool, named COMET, was compared to the widely used phylogeny-based REGA and SCUEAL tools using synthetic and clinical HIV data sets (1,090,698 and 10,625 sequences, respectively). COMET's sensitivity and specificity were comparable to or higher than the two other subtyping tools on both data sets for known subtypes. COMET also excelled in detecting and identifying new recombinant forms, a frequent feature of the HIV epidemic. Runtime comparisons showed that COMET was almost as fast as USEARCH. This study demonstrates the advantages of alignment-free classification of viral sequences, which feature high rates of variation, recombination and insertions/deletions. COMET is free to use via an online interface.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4191385PMC
http://dx.doi.org/10.1093/nar/gku739DOI Listing

Publication Analysis

Top Keywords

data sets
8
comet
5
comet adaptive
4
adaptive context-based
4
context-based modeling
4
modeling ultrafast
4
ultrafast hiv-1
4
hiv-1 subtype
4
subtype identification
4
identification viral
4

Similar Publications

Background: Evidence regarding the individual and combined impact of dietary flavonoids on the risk of metabolic dysfunction associated with steatotic liver disease (MASLD) remains scarce. Our objective is to evaluate the association between individual and multiple dietary flavonoids with MASLD in adults.

Methods: Data sets were obtained from the National Health and Nutrition Examination Survey (NHANES), 2017-2018.

View Article and Find Full Text PDF

Due to the uncertainty of material properties of plate-like structures, many traditional methods are unable to locate the impact source on their surface in real time. It is important to study the impact source-localization problem for plate structures. In this paper, a data-driven machine learning method is proposed to detect impact sources in plate-like structures and its effectiveness is tested on three plate-like structures with different material properties.

View Article and Find Full Text PDF

Immunoglobulin G4-related disease (IgG4-RD) is an immune-mediated, fibroinflammatory, multiorgan disease with an obscure pathogenesis. Findings indicating excessive platelet activation have been reported in systemic sclerosis, which is another autoimmune, multisystemic fibrotic disorder. The immune-mediated, inflammatory, and fibrosing intersections of IgG4-RD and systemic sclerosis raised a question about platelets' role in IgG4-RD.

View Article and Find Full Text PDF

PLASMA: Partial LeAst Squares for Multiomics Analysis.

Cancers (Basel)

January 2025

Department of Biostatistics, Data Science, and Epidemiology, School of Public Health, Georgia Cancer Center at Augusta University, Augusta, GA 30912, USA.

: Recent growth in the number and applications of high-throughput "omics" technologies has created a need for better methods to integrate multiomics data. Much progress has been made in developing unsupervised methods, but supervised methods have lagged behind. : Here we present the first algorithm, PLASMA, that can learn to predict time-to-event outcomes from multiomics data sets, even when some samples have only been assayed on a subset of the omics data sets.

View Article and Find Full Text PDF

Patterns of Change in Athletic Identity After Anterior Cruciate Ligament Reconstruction.

Int J Environ Res Public Health

January 2025

Department of Psychology, Springfield College, 263 Alden Street, Springfield, MA 01109, USA.

Changes in athletic identity have been documented after injury and other sport transitions in nomothetic investigations. Patterns of change in athletic identity after injury have not been examined systematically at the individual level. In the current study, secondary analyses were performed on two data sets ( = 43 and = 80) in which athletic identity values were available for before and at least six months after anterior cruciate ligament (ACL) reconstruction.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!