Gene clusters are sets of co-localized, often contiguous genes that together perform specific functions, many of which are relevant to biotechnology. There is a need for software tools that can extract candidate gene clusters from vast amounts of available genomic data. Therefore, we developed Opfi: a modular pipeline for identification of arbitrary gene clusters in assembled genomic or metagenomic sequences. Opfi contains functions for annotation, de-deduplication, and visualization of putative gene clusters. It utilizes a customizable rule-based filtering approach for selection of candidate systems that adhere to user-defined criteria. Opfi is implemented in Python, and is available on the Python Package Index and on Bioconda (Grüning et al., 2018).

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9017871PMC
http://dx.doi.org/10.21105/joss.03678DOI Listing

Publication Analysis

Top Keywords

gene clusters
20
python package
8
gene
5
clusters
5
opfi
4
opfi python
4
package identifying
4
identifying gene
4
clusters large
4
large genomics
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!