Gene duplication is a source of genetic materials and evolutionary changes, and has been associated with gene family expansion. Functional divergence of duplicated genes is strongly directed by natural selections such as organism diversification and novel feature acquisition. We show that, plant α-amylase gene family (AMY) is comprised of six subfamilies (AMY1-AMY6) that fell into two ancient phylogenetic lineages (AMY3 and AMY4). Both AMY1 and AMY2 are grass-specific and share a single-copy ancestor, which is derived from grass AMY3 genes that have undergone massive tandem and whole-genome duplications during evolution. Ancestral features of AMY4 and AMY5/AMY6 genes have been retained among four green algal sequences (Chrein_08.g362450, Vocart_0021s0194, Dusali_0430s00012 and Monegl_16464), suggesting a gene duplication event following Chlorophyceae diversification. The observed horizontal gene transfers between plant and bacterial AMYs, and chromosomal locations of AMY3 and AMY4 genes in the most ancestral green body (C. reinhardtii), provide evidences for the monophyletic origin of plant AMYs. Despite subfamily-specific sequence divergence driven by natural selections, the active site and SBS1 are well-conserved across different AMY isoforms. The differentiated electrostatic potentials and hydrogen bands-forming residue polymorphisms, further imply variable digestive abilities for a broad substrates in particular tissues or subcellular localizations.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6426938PMC
http://dx.doi.org/10.1038/s41598-019-41420-wDOI Listing

Publication Analysis

Top Keywords

gene duplication
8
gene family
8
natural selections
8
amy3 amy4
8
genes
5
gene
5
insights origin
4
origin evolution
4
evolution α-amylase
4
α-amylase genes
4

Similar Publications

Meiosis is generally a fair process: each chromosome has a 50% chance of being included into each gamete. However, meiosis can become aberrant with some chromosomes having a higher chance of making it into gametes than others. Yet, why and how such systems evolve remains unclear.

View Article and Find Full Text PDF

Male reproductive proteins frequently evolve rapidly in animals, potentially due to adaptive evolution driven by sperm competition, polyspermy avoidance, or pathogen defense. Alternatively, elevated rates of protein change may be due to relaxed constraint. The prostate-specific protease KLK3 has experienced dynamic evolution since its origin stemming from a gene duplication in the ancestor of all Old World primates, with instances of rapid evolution, stasis, and pseudogenization.

View Article and Find Full Text PDF

Pontederia cordata L. is an aquatic ornamental plant native to the Americas, but has been widely distributed in South Asia, Australia, and Europe. The genetic mechanisms behind its rapid adaptation and spread have not yet been well understood.

View Article and Find Full Text PDF

In the past decade, our understanding of how new genes originate in diverse organisms has advanced substantially, and more than a dozen molecular mechanisms for generating initial gene structures were identified, in addition to gene duplication. These new genes have been found to integrate into and modify pre-existing gene networks primarily through mutation and selection, revealing new patterns and rules with stable origination rates across various organisms. This progress has challenged the prevailing belief that new proteins evolve from pre-existing genes, as new genes may arise de novo from noncoding DNA sequences in many organisms, with high rates observed in flowering plants.

View Article and Find Full Text PDF

Organismal complexity strongly correlates with the number of protein families and domains.

Proc Natl Acad Sci U S A

February 2025

Duncan and Nancy MacMillan Cancer Immunology and Metabolism Center of Excellence, Rutgers Cancer Institute of New Jersey, New Brunswick, NJ 08901.

In the pregenomic era, scientists were puzzled by the observation that haploid genome size (the C-value) did not correlate well with organismal complexity. This phenomenon, called the "C-value paradox," is mostly explained by the fact that protein-coding genes occupy only a small fraction of eukaryotic genomes. When the first genome sequences became available, scientists were even more surprised by the fact that the number of genes (G-value) was also a poor predictor of complexity, which gave rise to the "G-value paradox.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!