BiG-FAM: the biosynthetic gene cluster families database.

Nucleic Acids Res

Bioinformatics Group, Wageningen University, 6708PB Wageningen, The Netherlands.

Published: January 2021

Computational analysis of biosynthetic gene clusters (BGCs) has revolutionized natural product discovery by enabling the rapid investigation of secondary metabolic potential within microbial genome sequences. Grouping homologous BGCs into Gene Cluster Families (GCFs) facilitates mapping their architectural and taxonomic diversity and provides insights into the novelty of putative BGCs, through dereplication with BGCs of known function. While multiple databases exist for exploring BGCs from publicly available data, no public resources exist that focus on GCF relationships. Here, we present BiG-FAM, a database of 29,955 GCFs capturing the global diversity of 1,225,071 BGCs predicted from 209,206 publicly available microbial genomes and metagenome-assembled genomes (MAGs). The database offers rich functionalities, such as multi-criterion GCF searches, direct links to BGC databases such as antiSMASH-DB, and rapid GCF annotation of user-supplied BGCs from antiSMASH results. BiG-FAM can be accessed online at https://bigfam.bioinformatics.nl.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7778980PMC
http://dx.doi.org/10.1093/nar/gkaa812DOI Listing

Publication Analysis

Top Keywords

biosynthetic gene
8
gene cluster
8
cluster families
8
bgcs
7
big-fam biosynthetic
4
families database
4
database computational
4
computational analysis
4
analysis biosynthetic
4
gene clusters
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!