A stepwise guide for pangenome development in crop plants: an alfalfa (Medicago sativa) case study.

BMC Genomics

USDA-ARS, Plant Science Research Unit, St. Paul, MN, 55108, USA.

Published: October 2024

AI Article Synopsis

  • Pangenomics is becoming important in plant genomics, with many major crops having their genomes sequenced, though polyploid species like wheat and cotton have fewer pangenome resources available.
  • This review discusses the methods used in developing crop pangenomes, tackles challenges encountered, and provides a systematic guide, using alfalfa as a case study for polyploid species.
  • Pangenome resources help uncover important genetic information, and accessible online tools for visualizing pangenomes are expanding their use among scientists and breeders.

Article Abstract

Background: The concept of pangenomics and the importance of structural variants is gaining recognition within the plant genomics community. Due to advancements in sequencing and computational technology, it has become feasible to sequence the entire genome of numerous individuals of a single species at a reasonable cost. Pangenomes have been constructed for many major diploid crops, including rice, maize, soybean, sorghum, pearl millet, peas, sunflower, grapes, and mustards. However, pangenomes for polyploid species are relatively scarce and are available in only few crops including wheat, cotton, rapeseed, and potatoes.

Main Body: In this review, we explore the various methods used in crop pangenome development, discussing the challenges and implications of these techniques based on insights from published pangenome studies. We offer a systematic guide and discuss the tools available for constructing a pangenome and conducting downstream analyses. Alfalfa, a highly heterozygous, cross pollinated and autotetraploid forage crop species, is used as an example to discuss the concerns and challenges offered by polyploid crop species. We conducted a comparative analysis using linear and graph-based methods by constructing an alfalfa graph pangenome using three publicly available genome assemblies. To illustrate the intricacies captured by pangenome graphs for a complex crop genome, we used five different gene sequences and aligned them against the three graph-based pangenomes. The comparison of the three graph pangenome methods reveals notable variations in the genomic variation captured by each pipeline.

Conclusion: Pangenome resources are proving invaluable by offering insights into core and dispensable genes, novel gene discovery, and genome-wide patterns of variation. Developing user-friendly online portals for linear pangenome visualization has made these resources accessible to the broader scientific and breeding community. However, challenges remain with graph-based pangenomes including compatibility with other tools, extraction of sequence for regions of interest, and visualization of genetic variation captured in pangenome graphs. These issues necessitate further refinement of tools and pipelines to effectively address the complexities of polyploid, highly heterozygous, and cross-pollinated species.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11526573PMC
http://dx.doi.org/10.1186/s12864-024-10931-wDOI Listing

Publication Analysis

Top Keywords

pangenome
10
pangenome development
8
crops including
8
highly heterozygous
8
crop species
8
graph pangenome
8
captured pangenome
8
pangenome graphs
8
graph-based pangenomes
8
variation captured
8

Similar Publications

is a bacterium associated with colorectal cancer (CRC) tumorigenesis, progression, and metastasis. Fap2 is a fusobacteria-specific outer membrane galactose-binding lectin that mediates adherence to and invasion of CRC tumors. Advances in omics analyses provide an opportunity to profile and identify microbial genomic features that correlate with the cancer-associated bacterial virulence factor Fap2.

View Article and Find Full Text PDF

With the increasing availability of high-quality genome assemblies, pangenome graphs emerged as a new paradigm in the genomics field for identifying, encoding, and presenting genomic variation at both population and species levels. However, it remains challenging to truly dissect and interpret pangenome graphs via biologically informative visualization. To facilitate better exploration and understanding of pangenome graphs towards novel biological insights, here we present a web-based interactive Visualization and interpretation framework for linear-Reference-projected Pangenome Graphs (VRPG).

View Article and Find Full Text PDF

Aligning genomes into common coordinates is central to pangenome analysis and construction, but it is also computationally expensive. Multi-sequence maximal unique matches (multi-MUMs) are guideposts for core genome alignments, helping to frame and solve the multiple alignment problem. We introduce Mumemto, a tool that computes multi-MUMs and other match types across large pangenomes.

View Article and Find Full Text PDF

Cherries are one of the economically important fruit crops in the Rosaceae family, genus. As the first fruits of the spring season in the northern hemisphere, their attractive appearance, intensely desirable tastes, high nutrients content, and consumer-friendly size captivate consumers worldwide. In the past 30 years, although cherry geneticists and breeders have greatly progressed in understanding the genetic and molecular basis underlying fruit quality, adaptation to climate change, and biotic and abiotic stress resistance, the utilization of cherry genomic data in genetics and molecular breeding has remained limited to date.

View Article and Find Full Text PDF

serovar Mbandaka, a prevalent foodborne pathogen, poses a threat to public health but remains poorly understood. We have determined the phylogenomic tree, genetic diversity, virulence, and antimicrobial resistance (AMR) profiles on a large genomic scale to elucidate the evolutionary dynamics within the Mbandaka pan-genome. The polyphyletic nature of this serovar is characterized by two distinct phylogenetic groups and inter-serovar recombination boundaries, that potentially arising from recombination events at the H2-antigen loci.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!