Routine and systematic use of bacterial whole-genome sequencing (WGS) is enhancing the accuracy and resolution of epidemiological investigations carried out by Public Health laboratories and regulatory agencies. Large volumes of publicly available WGS data can be used to study pathogenic populations at a large scale. Recently, a freely available computational platform called ProkEvo was published to enable reproducible, automated, and scalable hierarchical-based population genomic analyses using bacterial WGS data. This implementation of ProkEvo demonstrated the importance of combining standard genotypic mapping of populations with mining of accessory genomic content for ecological inference. In particular, the work highlighted here used ProkEvo-derived outputs for population-scaled hierarchical analyses using the R programming language. The main objective was to provide a practical guide for microbiologists, ecologists, and epidemiologists by showing how to: i) use a phylogeny-guided mapping of hierarchical genotypes; ii) assess frequency distributions of genotypes as a proxy for ecological fitness; iii) determine kinship relationships and genetic diversity using specific genotypic classifications; and iv) map lineage differentiating accessory loci. To enhance reproducibility and portability, R markdown files were used to demonstrate the entire analytical approach. The example dataset contained genomic data from 2,365 isolates of the zoonotic foodborne pathogen Salmonella Newport. Phylogeny-anchored mapping of hierarchical genotypes (Serovar -> BAPS1 -> ST -> cgMLST) revealed the population genetic structure, highlighting sequence types (STs) as the keystone differentiating genotype. Across the three most dominant lineages, ST5 and ST118 shared a common ancestor more recently than with the highly clonal ST45 phylotype. ST-based differences were further highlighted by the distribution of accessory antimicrobial resistance (AMR) loci. Lastly, a phylogeny-anchored visualization was used to combine hierarchical genotypes and AMR content to reveal the kinship structure and lineage-specific genomic signatures. Combined, this analytical approach provides some guidelines for conducting heuristic bacterial population genomic analyses using pan-genomic information.

Download full-text PDF

Source
http://dx.doi.org/10.3791/63115DOI Listing

Publication Analysis

Top Keywords

hierarchical genotypes
16
wgs data
8
population genomic
8
genomic analyses
8
mapping hierarchical
8
analytical approach
8
hierarchical
5
genotypes
5
genomic
5
heuristic mining
4

Similar Publications

Drought is a detrimental abiotic stress that severely limits wheat growth and productivity worldwide by altering several physiological processes. Thus, understanding the mechanisms of drought tolerance is essential for the selection of drought-resilient features and drought-tolerant cultivars for wheat breeding programs. This exploratory study evaluated 14 wheat genotypes (13 relatively tolerant, one susceptible) for drought endurance based on flag leaf physiological and biochemical traits during the critical grain-filling stage in the field conditions.

View Article and Find Full Text PDF

Inherited genetics represents an important contributor to risk of esophageal adenocarcinoma (EAC), and its precursor Barrett's esophagus (BE). Genome-wide association studies have identified ∼30 susceptibility variants for BE/EAC, yet genetic interactions remain unexamined. To address challenges in large-scale G×G scans, we combined knowledge-guided filtering and machine learning approaches, focusing on genes with (A) known/plausible links to BE/EAC pathogenesis (n=493) or (B) prior evidence of biological interactions (n=4,196).

View Article and Find Full Text PDF
Article Synopsis
  • HPV genotype plays a crucial role in predicting cervical cancer risk, and using genotyping can improve management strategies for HPV-positive patients during cervical screening.
  • The ScreenFire HPV RS assay, combined with the Zebra BioDome technology, facilitates efficient testing by processing up to 96 samples in about an hour while minimizing contamination risks with fewer pipetting steps.
  • Validation studies on the Zebra BioDome showed excellent repeatability and accuracy when compared to the standard assay, suggesting it could streamline HPV testing and improve accessibility for point-of-care diagnostics in low-resource settings.
View Article and Find Full Text PDF

In plant breeding and genetics, predictive models traditionally rely on compact representations of high-dimensional data, often using methods like Principal Component Analysis (PCA) and, more recently, Autoencoders (AE). However, these methods do not separate genotype-specific and environment-specific features, limiting their ability to accurately predict traits influenced by both genetic and environmental factors. We hypothesize that disentangling these representations into genotype-specific and environment-specific components can enhance predictive models.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!