The loss of information accompanying assessment of absolute fit of substitution models to phylogenetic data negatively affects the discriminatory power of previous methods and can make them insensitive to lineage-specific changes in the substitution process. As an alternative, I propose evaluating absolute fit of substitution models based on a novel statistic which describes the observed data without information loss and which is unlikely to become zero-inflated with increasing numbers of taxa. This method can accommodate gaps and is sensitive to lineage-specific shifts in the substitution process.
View Article and Find Full Text PDFThough it is well accepted that mitochondria originated from an alphaproteobacteria-like ancestor, the phylogenetic relationship of the mitochondrial endosymbiont to extant Alphaproteobacteria is yet unresolved. The focus of much debate is whether the affinity between mitochondria and fast-evolving alphaproteobacterial lineages reflects true homology or artefacts. Approaches such as site exclusion have been claimed to mitigate compositional heterogeneity between taxa, but this comes at the cost of information loss, and the reliability of such methods is so far unproven.
View Article and Find Full Text PDFA novel test is described that visualizes the absolute model-data fit of the substitution and tree components of an evolutionary model. The test utilizes statistics based on counts of character state matches and mismatches in alignments of observed and simulated sequences. This comparison is used to assess model-data fit.
View Article and Find Full Text PDFThe development of new resistant varieties to the oomycete (Berk.& Curt) is a promising way to combat downy mildew (DM), one of the major diseases threatening the cultivated grapevine ( L.).
View Article and Find Full Text PDFIn grapevine, flavonoids constitute one of the most abundant subgroups of secondary metabolites, influencing the quality, health value, and typicity of wines. Their synthesis in many plant species is mainly regulated at the transcriptional level by modulation of flavonoid pathway genes either by single regulators or by complexes of different regulators. In particular, bZIP and MYB factors interact synergistically in the recognition of light response units present in the promoter of some genes of the pathway, thus mediating light-dependent flavonoid biosynthesis.
View Article and Find Full Text PDFThe phylogenetic branching order of the green algal groups that gave rise to land plants remains uncertain despite its fundamental importance to understanding plant evolution. Previous studies have demonstrated that land plants evolved from streptophyte algae, but different lineages of streptophytes have been suggested to be the sister group of land plants. To better understand the evolutionary history of land plants and to determine the potential effects of "long-branch attraction" in phylogenetic reconstruction, we analyzed a chloroplast genome data set including three new chloroplast genomes from streptophyte algae: Coleochaetae orbicularis (Coleochaetales), Nitella hookeri (Charales), and Spirogyra communis (Zygnematales).
View Article and Find Full Text PDFBoth the origin of domesticated apple and the overall phylogeny of the genus Malus are still not completely resolved. Having this as a target, we built a 134,553-position-long alignment including two previously published chloroplast DNAs (cpDNAs) and 45 de novo sequenced, fully colinear chloroplast genomes from cultivated apple varieties and wild apple species. The data produced are free from compositional heterogeneity and from substitutional saturation, which can adversely affect phylogeny reconstruction.
View Article and Find Full Text PDFCyanobacteria forged two major evolutionary transitions with the invention of oxygenic photosynthesis and the bestowal of photosynthetic lifestyle upon eukaryotes through endosymbiosis. Information germane to understanding those transitions is imprinted in cyanobacterial genomes, but deciphering it is complicated by lateral gene transfer (LGT). Here, we report genome sequences for the morphologically most complex true-branching cyanobacteria, and for Scytonema hofmanni PCC 7110, which with 12,356 proteins is the most gene-rich prokaryote currently known.
View Article and Find Full Text PDFCorrect rooting of the angiosperm radiation is both challenging and necessary for understanding the origins and evolution of physiological and phenotypic traits in flowering plants. The problem is known to be difficult due to the large genetic distance separating flowering plants from other seed plants and the sparse taxon sampling among basal angiosperms. Here, we provide further evidence for concern over substitution model misspecification in analyses of chloroplast DNA sequences.
View Article and Find Full Text PDFMitochondrial genomes of spermatophytes are the largest of all organellar genomes. Their large size has been attributed to various factors; however, the relative contribution of these factors to mitochondrial DNA (mtDNA) expansion remains undetermined. We estimated their relative contribution in Malus domestica (apple).
View Article and Find Full Text PDFFor almost a decade, our knowledge on the organisation of the family 1 UDP-glycosyltransferases (UGTs) has been limited to the model plant A. thaliana. The availability of other plant genomes represents an opportunity to obtain a broader view of the family in terms of evolution and organisation.
View Article and Find Full Text PDFResolving the closest relatives of Gnetales has been an enigmatic problem in seed plant phylogeny. The problem is known to be difficult because of the extent of divergence between this diverse group of gymnosperms and their closest phylogenetic relatives. Here, we investigate the evolutionary properties of conifer chloroplast DNA sequences.
View Article and Find Full Text PDFNoisy data, especially in combination with misalignment and model misspecification can have an adverse effect on phylogeny reconstruction; however, effective methods to identify such data are few. One particularly important class of noisy data is saturated positions. To avoid potential errors related to saturation in phylogenomic analyses, we present an automated procedure involving the step-wise removal of the most variable positions in a given data set coupled with a stopping criterion derived from correlation analyses of pairwise ML distances calculated from the deleted (saturated) and the remaining (conserved) subsets of the alignment.
View Article and Find Full Text PDFWe report a high-quality draft genome sequence of the domesticated apple (Malus × domestica). We show that a relatively recent (>50 million years ago) genome-wide duplication (GWD) has resulted in the transition from nine ancestral chromosomes to 17 chromosomes in the Pyreae. Traces of older GWDs partly support the monophyly of the ancestral paleohexaploidy of eudicots.
View Article and Find Full Text PDFIt is widely appreciated that noisy, highly variable data can impede phylogeney reconstruction. Researchers have for a long time omitted problematic data from phylogenetic analyses, such as the third-codon positions and variable regions. In the analyses of the phylogenetic relations of the angiosperms; however, inclusion of complete gene sequences into genomic-scale alignments has become a common practice.
View Article and Find Full Text PDFThe mitochondrial genome of grape (Vitis vinifera), the largest organelle genome sequenced so far, is presented. The genome is 773,279 nt long and has the highest coding capacity among known angiosperm mitochondrial DNAs (mtDNAs). The proportion of promiscuous DNA of plastid origin in the genome is also the largest ever reported for an angiosperm mtDNA, both in absolute and relative terms.
View Article and Find Full Text PDFThe tropical Andes harbor a major part of the world's plant biodiversity. The montane cacti of the tribes Browningieae, Cereeae, and Trichocereeae underwent extensive radiation and thus are well suited as a model group to study the diversification of Andean plants. We reconstructed their phylogeny employing three noncoding chloroplast regions and explained it in the context of the geological history of South America.
View Article and Find Full Text PDFWe readdress the issue of phylogeny of the basal extant angiosperms employing a source previously not systematically investigated, specifically, the non-coding sequences of cpDNA. Comparison of trees with and without grasses or the outgroup (Pinus) in our analyses revealed no rearrangements in tree topology that might be expected if LBA were distorting the position of the magnoliids. For each model applied, irrespective of whether monocots or ANITA members appeared basally divergent, the orchid Phalaenopsis assumed the same position on the trees with the reduced taxon set as did the branch bearing the orchid plus the grasses in the full alignment.
View Article and Find Full Text PDFDetermining the phylogenetic relationships among the major lines of angiosperms is a long-standing problem, yet the uncertainty as to the phylogenetic affinity of these lines persists. While a number of studies have suggested that the ANITA (Amborella-Nymphaeales-Illiciales-Trimeniales-Aristolochiales) grade is basal within angiosperms, studies of complete chloroplast genome sequences also suggested an alternative tree, wherein the line leading to the grasses branches first among the angiosperms. To improve taxon sampling in the existing chloroplast genome data, we sequenced the chloroplast genome of the monocot Acorus calamus.
View Article and Find Full Text PDFThe traditional approach to plant molecular phylogenetics involves amplifying, sequencing and analyzing one or a few genes from many species and is conducive to broad taxon sampling. An independent approach involves chloroplast genome sequencing, providing much larger amounts of data per taxon but for a smaller number of species. In principle, the two strategies can inform each other but in practice their results sometimes conflict for reasons that are currently debated.
View Article and Find Full Text PDFAngiosperms (flowering plants) dominate contemporary terrestrial flora with roughly 250,000 species, but their origin and early evolution are still poorly understood. In recent years, molecular evidence has accumulated suggesting a dicotyledonous origin of monocots. Phylogenetic reconstructions have suggested that several dicotyledonous groups that include taxa such as Amborella, Austrobaileya, and Nymphaea branch off as the most basal among angiosperms.
View Article and Find Full Text PDFPhylogenetic analyses based on comparison of a limited number of genes recently suggested that Amborella trichopoda is the most ancient angiosperm. Here we present the complete sequence of the chloroplast genome of this plant. It does not display any of the genes characteristic of chloroplast DNA of the gymnosperm Pinus thunbergii (chlB, chlL, chlN, psaM, and ycf12).
View Article and Find Full Text PDF