TQMD is a tool for high-performance computing clusters which downloads, stores and produces lists of dereplicated prokaryotic genomes. It has been developed to counter the ever-growing number of prokaryotic genomes and their uneven taxonomic distribution. It is based on word-based alignment-free methods (-mers), an iterative single-linkage approach and a divide-and-conquer strategy to remain both efficient and scalable. We studied the performance of TQMD by verifying the influence of its parameters and heuristics on the clustering outcome. We further compared TQMD to two other dereplication tools (dRep and Assembly-Dereplicator). Our results showed that TQMD is primarily optimized to dereplicate at higher taxonomic levels (phylum/class), as opposed to the other dereplication tools, but also works at lower taxonomic levels (species/strain) like the other dereplication tools. TQMD is available from source and as a Singularity container at [https://bitbucket.org/phylogeno/tqmd ].

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8106394PMC
http://dx.doi.org/10.7717/peerj.11348DOI Listing

Publication Analysis

Top Keywords

dereplication tools
12
prokaryotic genomes
8
taxonomic levels
8
tqmd
5
torquemada tool
4
tool retrieving
4
retrieving queried
4
queried eubacteria
4
eubacteria metadata
4
metadata dereplicating
4

Similar Publications

Article Synopsis
  • * A total of 53 compounds were annotated, including 22 newly discovered analogues and 4 new homologous series, suggesting significant diversity in crambescin compounds.
  • * The research highlights the effectiveness of combining manual and computational methods for detailed metabolomic analysis, underscoring its potential for high-throughput identification in similar studies.
View Article and Find Full Text PDF

Mass-tagged aminated probes for rapid discovery of azaphilic natural products in fungal crude extracts.

Chem Commun (Camb)

January 2025

Normandie Univ, CNRS, INSA Rouen, UNIROUEN, COBRA (UMR 6014 & FR 3038), 76000 Rouen, France.

Reactivity-based screening (RBS) was used to screen fungal crude extracts for the presence of new azaphilic natural products. A probe composed of a nucleophilic primary amine and an isotopic mass tag was designed for its reactivity towards azaphilic compounds. Addition of the probe to crude extracts of fungal complexes, together with analytical tools for dereplication such as haloseeker and molecular networking, allowed easy detection of azaphilic compounds.

View Article and Find Full Text PDF

Streptomycetes remain an important bacterial source of natural products (NPs) with significant therapeutic promise, particularly in the fight against antimicrobial resistance. Herein, we present StreptomeDB 4.0, a substantial update of the database that includes expanded content and several new features.

View Article and Find Full Text PDF

An integrated 3-M workflow for accelerated annotation of natural products: Flavonoids in Daemonorops draco as a case study.

Talanta

January 2025

The MOE Key Laboratory of Standardization of Chinese Medicines, Shanghai Key Laboratory of Compound Chinese Medicines, and SATCM Key Laboratory of New Resources and Quality Evaluation of Chinese Medicines, Institute of Chinese Materia Medica, Shanghai University of Traditional Chinese Medicine, Shanghai, 201203, China. Electronic address:

Efficient annotation and dereplication of metabolites, particularly those from resource-endangered plants lacking reference standards, is crucial for natural products development. Advanced techniques like high resolution mass spectrometry (LC-HRMS) have significantly enhanced metabolite characterization. However, challenges such as redundant spectral data, limited reference databases, and inferior dereplication capacity hinder its broad applicability.

View Article and Find Full Text PDF

While numerous computational frameworks and workflows are available for recovering prokaryote and eukaryote genomes from metagenome data, only a limited number of pipelines are designed specifically for viromics analysis. With many viromics tools developed in the last few years alone, it can be challenging for scientists with limited bioinformatics experience to easily recover, evaluate quality, annotate genes, dereplicate, assign taxonomy, and calculate relative abundance and coverage of viral genomes using state-of-the-art methods and standards. Here, we describe Modular Viromics Pipeline (MVP) v.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!