A Quantitative Computational Framework for Allopolyploid Single-Cell Data Integration and Core Gene Ranking in Development.

Mol Biol Evol

State Key Laboratory of Genetic Engineering, Department of Biochemistry, Collaborative Innovation Center of Genetics and Development, School of Life Sciences, Institute of Plant Biology, Fudan University, Shanghai 200438, China.

Published: September 2024

Polyploidization drives regulatory and phenotypic innovation. How the merger of different genomes contributes to polyploid development is a fundamental issue in evolutionary developmental biology and breeding research. Clarifying this issue is challenging because of genome complexity and the difficulty in tracking stochastic subgenome divergence during development. Recent single-cell sequencing techniques enabled probing subgenome-divergent regulation in the context of cellular differentiation. However, analyzing single-cell data suffers from high error rates due to high dimensionality, noise, and sparsity, and the errors stack up in polyploid analysis due to the increased dimensionality of comparisons between subgenomes of each cell, hindering deeper mechanistic understandings. In this study, we develop a quantitative computational framework, called "pseudo-genome divergence quantification" (pgDQ), for quantifying and tracking subgenome divergence directly at the cellular level. Further comparing with cellular differentiation trajectories derived from single-cell RNA sequencing data allows for an examination of the relationship between subgenome divergence and the progression of development. pgDQ produces robust results and is insensitive to data dropout and noise, avoiding high error rates due to multiple comparisons of genes, cells, and subgenomes. A statistical diagnostic approach is proposed to identify genes that are central to subgenome divergence during development, which facilitates the integration of different data modalities, enabling the identification of factors and pathways that mediate subgenome-divergent activity during development. Case studies have demonstrated that applying pgDQ to single-cell and bulk tissue transcriptomic data promotes a systematic and deeper understanding of how dynamic subgenome divergence contributes to developmental trajectories in polyploid evolution.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11421573PMC
http://dx.doi.org/10.1093/molbev/msae178DOI Listing

Publication Analysis

Top Keywords

subgenome divergence
20
quantitative computational
8
computational framework
8
single-cell data
8
divergence development
8
cellular differentiation
8
high error
8
error rates
8
data
6
development
6

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!