Microbial gene clusters encoding the biosynthesis of primary and secondary metabolites play key roles in shaping microbial ecosystems and driving microbiome-associated phenotypes. Although effective approaches exist to evaluate the metabolic potential of such bacteria through identification of these metabolic gene clusters in their genomes, no automated pipelines exist to profile the abundance and expression levels of such gene clusters in microbiome samples to generate hypotheses about their functional roles, and to find associations with phenotypes of interest. Here, we describe BiG-MAP, a bioinformatic tool to profile abundance and expression levels of gene clusters across metagenomic and metatranscriptomic data and evaluate their differential abundance and expression under different conditions. To illustrate its usefulness, we analyzed 96 metagenomic samples from healthy and caries-associated human oral microbiome samples and identified 252 gene clusters, including unreported ones, that were significantly more abundant in either phenotype. Among them, we found the operon, a gene cluster known to be associated with tooth decay. Additionally, we found a putative reuterin biosynthetic gene cluster from a Streptococcus strain to be enriched but not exclusively found in healthy samples; metabolomic data from the same samples showed masses with fragmentation patterns consistent with (poly)acrolein, which is known to spontaneously form from the products of the reuterin pathway and has been previously shown to inhibit pathogenic Streptococcus mutans strains. Thus, we show how BiG-MAP can be used to generate new hypotheses on potential drivers of microbiome-associated phenotypes and prioritize the experimental characterization of relevant gene clusters that may mediate them. Microbes play an increasingly recognized role in determining host-associated phenotypes by producing small molecules that interact with other microorganisms or host cells. The production of these molecules is often encoded in syntenic genomic regions, also known as gene clusters. With the increasing numbers of (multi)omics data sets that can help in understanding complex ecosystems at a much deeper level, there is a need to create tools that can automate the process of analyzing these gene clusters across omics data sets. This report presents a new software tool called BiG-MAP, which allows assessing gene cluster abundance and expression in microbiome samples using metagenomic and metatranscriptomic data. Here, we describe the tool and its functionalities, as well as its validation using a mock community. Finally, using an oral microbiome data set, we show how it can be used to generate hypotheses regarding the functional roles of gene clusters in mediating host phenotypes.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8547482PMC
http://dx.doi.org/10.1128/mSystems.00937-21DOI Listing

Publication Analysis

Top Keywords

gene clusters
36
abundance expression
20
gene cluster
16
gene
13
microbiome samples
12
generate hypotheses
12
clusters
9
metabolic gene
8
cluster abundance
8
microbiome-associated phenotypes
8

Similar Publications

Chromatin Topological Domains Associate With the Rapid Formation of Tandem Duplicates in Plants.

Adv Sci (Weinh)

December 2024

School of Advanced Agriculture Sciences and School of Life Sciences, Academy for Advanced Interdisciplinary Studies, State Key Laboratory of Protein and Plant Gene Research, Peking University, Beijing, 100871, China.

In eukaryotes, chromatin is compacted within nuclei under the principle of compartmentalization. On top of that, condensin II establishes eukaryotic chromosome territories, while cohesin organizes the vertebrate genome by extruding chromatin loops and forming topologically associating domains (TADs). Thus far, the formation and roles of these chromatin structures in plants remain poorly understood.

View Article and Find Full Text PDF

Genomic investigation of plant secondary metabolism: insights from synteny network analysis of oxidosqualene cyclase flanking genes.

New Phytol

December 2024

State Key Laboratory for Crop Stress Resistance and High-Efficiency Production/Shaanxi Key Laboratory of Apple, College of Horticulture, Northwest A&F University, Yangling, 712100, China.

The clustered distribution of genes involved in metabolic pathways within the plant genome has garnered significant attention from researchers. By comparing and analyzing changes in the flanking regions of metabolic genes across a diverse array of species, we can enhance our understanding of the formation and distribution of biosynthetic gene clusters (BGCs). In this study, we have designed a workflow that uncovers and assesses conserved positional relationships between genes in various species by using synteny neighborhood networks (SNN).

View Article and Find Full Text PDF

Metastasis in patients with oral squamous cell carcinoma has been associated with a poor prognosis. However, sensitive and reliable tests for monitoring their occurrence are unavailable, with the exception of PET-CT. Circulating tumor cells and cell-free DNA have emerged as promising biomarkers for determining treatment efficacy and as prognostic predictors in solid tumors such as breast cancer and colorectal cancer.

View Article and Find Full Text PDF

Contributions to the () Group in China: Taxonomy, Species Diversity, and Molecular Phylogeny.

J Fungi (Basel)

December 2024

Engineering Research Center of Tropical Medicine Innovation and Transformation of Ministry of Education, International Joint Research Center of Human-Machine Intelligent Collaborative for Tumor Precision Diagnosis and Treatment of Hainan Province, Hainan Provincial Key Laboratory of Research and Development on Tropical Herbs, School of Pharmacy, Hainan Medical University, Haikou 571199, China.

is the largest genus in the family , with approximately 1000 species worldwide. Basic data on the species diversity, geographic distribution, and the infrageneric framework of are still incomplete because of the intricate nature of this genus, which includes numerous unrecognized taxa that exist around the world. A multigene phylogeny of the group, initially designated as the " subgroup", was conducted using the ITS-28S- nucleotide datasets.

View Article and Find Full Text PDF

A pathogen strain responsible for sweet potato stem and foliage scab disease was isolated from sweet potato stems. Through a phylogenetic analysis based on the rDNA internal transcribed spacer (ITS) region, combined with morphological methods, the isolated strain was identified as To comprehensively analyze the pathogenicity of the isolated strain from a genetic perspective, the whole-genome sequencing of HD-1 was performed using both the PacBio and Illumina platforms. The genome of HD-1 is about 26.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!