Seven quick tips for gene-focused computational pangenomic analysis.

BioData Min

Dipartimento di Informatica Sistemistica e Comunicazione, Università di Milano-Bicocca, Milan, Italy.

Published: September 2024

Pangenomics is a relatively new scientific field which investigates the union of all the genomes of a clade. The word pan means everything in ancient Greek; the term pangenomics originally regarded genomes of bacteria and was later intended to refer to human genomes as well. Modern bioinformatics offers several tools to analyze pangenomics data, paving the way to an emerging field that we can call computational pangenomics. Current computational power available for the bioinformatics community has made computational pangenomic analyses easy to perform, but this higher accessibility to pangenomics analysis also increases the chances to make mistakes and to produce misleading or inflated results, especially by beginners. To handle this problem, we present here a few quick tips for efficient and correct computational pangenomic analyses with a focus on bacterial pangenomics, by describing common mistakes to avoid and experienced best practices to follow in this field. We believe our recommendations can help the readers perform more robust and sound pangenomic analyses and to generate more reliable results.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11370085PMC
http://dx.doi.org/10.1186/s13040-024-00380-2DOI Listing

Publication Analysis

Top Keywords

computational pangenomic
12
pangenomic analyses
12
quick tips
8
pangenomics
6
computational
5
tips gene-focused
4
gene-focused computational
4
pangenomic
4
pangenomic analysis
4
analysis pangenomics
4

Similar Publications

Fast exact gap-affine partial order alignment with POASTA.

Bioinformatics

January 2025

Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA 02142, United States.

Motivation: Partial order alignment is a widely used method for computing multiple sequence alignments, with applications in genome assembly and pangenomics, among many others. Current algorithms to compute the optimal, gap-affine partial order alignment do not scale well to larger graphs and sequences. While heuristic approaches exist, they do not guarantee optimal alignment and sacrifice alignment accuracy.

View Article and Find Full Text PDF

Unlabelled: Thousands of complete genome sequences for strains of a species that are now available enable the advancement of pangenome analytics to a new level of sophistication. We collected 2,377 publicly available complete genomes of for detailed pangenome analysis. The core genome and accessory genomes consisted of 2,398 and 5,182 genes, respectively.

View Article and Find Full Text PDF

Unlabelled: ) is a clinically significant pathogen and a highly genetically diverse species due to its large accessory genome. The functional consequence of this diversity remains unknown mainly because, to date, functional genomic studies in have been primarily performed on reference strains. Given the growing public health threat of infections, understanding the functional genomic differences among clinical isolates can provide more insight into how its genetic diversity influences gene essentiality, clinically relevant phenotypes, and importantly, potential drug targets.

View Article and Find Full Text PDF

Evolutionary events leading to organismal preference for a specific growth temperature, as well as genes whose products are needed for a proper function at that temperature, are poorly understood. Using 64 bacteria from phylum Thermotogota as a model system, we examined how optimal growth temperature changed throughout Thermotogota history. We inferred that Thermotogota's last common ancestor was a thermophile and that some Thermotogota evolved the mesophilic and hyperthermophilic lifestyles secondarily.

View Article and Find Full Text PDF

Cancerous tissue is a largely unexplored microbial niche that provides a unique environment for the colonization and growth of specific bacterial communities, and with it, the opportunity to identify novel bacterial species. Here, we report distinct features of a novel species, sp. nov.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!