We previously introduced a numerical quantity called the stability (Ps) of an inferred tree and showed that for the tree to be reliable this stability as well as the reliability of the tree, which is usually computed as the bootstrap probability (Pb), must be high. However, if genome duplication occurs in a species, a gene family of the genome also duplicates, and for this reason alone some Ps values can be high in a tree of the duplicated gene families. In addition, the topology of the duplicated gene family can be similar to that of the original gene family if such gene families are identifiable. After genome duplication, however, the gene families are often partially deleted or partially duplicated, and the duplicated gene family may not show the same topology as that of the original family. It is therefore necessary to compute the similarity of the topologies of the duplicated and the original gene families. In this paper, we introduce another quantity called the reproducibility (Pr) for measuring the similarity of the two gene families. To show how to compute the Pr values, we first compute the Pb and Ps values for each of the MHC class II α and β chain gene families, which were apparently generated by genome duplication. We then compute the Pr values for the MHC class II α and β chain gene families. The Pr values for the α and β chain gene families are now low, and this suggests that the diploidization of gene segregation has occurred after the genome duplication. Currently higher animals, defined as animals with complex phenotypic characters, generally have a higher genome size, and this increase in genome size appears to have been caused by genome duplication and diploidization of gene segregation after genome duplication.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7012300 | PMC |
http://dx.doi.org/10.1093/gbe/evz272 | DOI Listing |
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!