Pangenomes are collections of annotated genome sequences of multiple individuals of a species. The structural variants uncovered by these datasets are a major asset to genetic analysis in crop plants. Here we report a pangenome of barley comprising long-read sequence assemblies of 76 wild and domesticated genomes and short-read sequence data of 1,315 genotypes.
View Article and Find Full Text PDFUnder-utilised orphan crops hold the key to diversified and climate-resilient food systems. Here, we report on orphan crop genomics using the case of Lablab purpureus (L.) Sweet (lablab) - a legume native to Africa and cultivated throughout the tropics for food and forage.
View Article and Find Full Text PDFIncreasing the proportion of locally produced plant protein in currently meat-rich diets could substantially reduce greenhouse gas emissions and loss of biodiversity. However, plant protein production is hampered by the lack of a cool-season legume equivalent to soybean in agronomic value. Faba bean (Vicia faba L.
View Article and Find Full Text PDFParasitism is a successful life strategy that has evolved independently in several families of vascular plants. The genera Cuscuta and Orobanche represent examples of the two profoundly different groups of parasites: one parasitizing host shoots and the other infecting host roots. In this study, we sequenced and described the overall repertoire of small RNAs from Cuscuta campestris and Orobanche aegyptiaca.
View Article and Find Full Text PDFCultivated oat (Avena sativa L.) is an allohexaploid (AACCDD, 2n = 6x = 42) thought to have been domesticated more than 3,000 years ago while growing as a weed in wheat, emmer and barley fields in Anatolia. Oat has a low carbon footprint, substantial health benefits and the potential to replace animal-based food products.
View Article and Find Full Text PDFTo unlock the genetic potential in crops, multi-genome comparisons are an essential tool. Decreasing costs and improved sequencing technologies have democratized plant genome sequencing and led to a vast increase in the amount of available reference sequences on the one hand and enabled the assembly of even the largest and most complex and repetitive crops genomes such as wheat and barley. These developments have led to the era of pan-genomics in recent years.
View Article and Find Full Text PDFWheat has one of the largest and most repetitive genomes among major crop plants, containing over 85% transposable elements (TEs). TEs populate genomes much in the way that individuals populate ecosystems, diversifying into different lineages, sub-families and sub-populations. The recent availability of high-quality, chromosome-scale genome sequences from ten wheat lines enables a detailed analysis how TEs evolved in allohexaploid wheat, its diploids progenitors, and in various chromosomal haplotype segments.
View Article and Find Full Text PDFWe have established a high-quality, chromosome-level genome assembly for the hexaploid common wheat cultivar 'Fielder', an American, soft, white, pastry-type wheat released in 1974 and known for its amenability to Agrobacterium tumefaciens-mediated transformation and genome editing. Accurate, long-read sequences were obtained using PacBio circular consensus sequencing with the HiFi approach. Sequence reads from 16 SMRT cells assembled using the hifiasm assembler produced assemblies with N50 greater than 20 Mb.
View Article and Find Full Text PDFRye (Secale cereale L.) is an exceptionally climate-resilient cereal crop, used extensively to produce improved wheat varieties via introgressive hybridization and possessing the entire repertoire of genes necessary to enable hybrid breeding. Rye is allogamous and only recently domesticated, thus giving cultivated ryes access to a diverse and exploitable wild gene pool.
View Article and Find Full Text PDFSequence assembly of large and repeat-rich plant genomes has been challenging, requiring substantial computational resources and often several complementary sequence assembly and genome mapping approaches. The recent development of fast and accurate long-read sequencing by circular consensus sequencing (CCS) on the PacBio platform may greatly increase the scope of plant pan-genome projects. Here, we compare current long-read sequencing platforms regarding their ability to rapidly generate contiguous sequence assemblies in pan-genome studies of barley (Hordeum vulgare).
View Article and Find Full Text PDFAdvances in genomics have expedited the improvement of several agriculturally important crops but similar efforts in wheat (Triticum spp.) have been more challenging. This is largely owing to the size and complexity of the wheat genome, and the lack of genome-assembly data for multiple wheat lines.
View Article and Find Full Text PDFThe diversity of maize (Zea mays) is the backbone of modern heterotic patterns and hybrid breeding. Historically, US farmers exploited this variability to establish today's highly productive Corn Belt inbred lines from blends of dent and flint germplasm pools. Here, we report de novo genome sequences of four European flint lines assembled to pseudomolecules with scaffold N50 ranging from 6.
View Article and Find Full Text PDFChromosome-scale genome sequence assemblies underpin pan-genomic studies. Recent genome assembly efforts in the large-genome Triticeae crops wheat and barley have relied on the commercial closed-source assembly algorithm DeNovoMagic. We present TRITEX, an open-source computational workflow that combines paired-end, mate-pair, 10X Genomics linked-read with chromosome conformation capture sequencing data to construct sequence scaffolds with megabase-scale contiguity ordered into chromosomal pseudomolecules.
View Article and Find Full Text PDFParasitic plants in the genus Striga, commonly known as witchweeds, cause major crop losses in sub-Saharan Africa and pose a threat to agriculture worldwide. An understanding of Striga parasite biology, which could lead to agricultural solutions, has been hampered by the lack of genome information. Here, we report the draft genome sequence of Striga asiatica with 34,577 predicted protein-coding genes, which reflects gene family contractions and expansions that are consistent with a three-phase model of parasitic plant genome evolution.
View Article and Find Full Text PDFThe domestication of wild emmer wheat led to the selection of modern durum wheat, grown mainly for pasta production. We describe the 10.45 gigabase (Gb) assembly of the genome of durum wheat cultivar Svevo.
View Article and Find Full Text PDFRecent advances in genomics technologies have greatly accelerated the progress in both fundamental plant science and applied breeding research. Concurrently, high-throughput plant phenotyping is becoming widely adopted in the plant community, promising to alleviate the phenotypic bottleneck. While these technological breakthroughs are significantly accelerating quantitative trait locus (QTL) and causal gene identification, challenges to enable even more sophisticated analyses remain.
View Article and Find Full Text PDFBackground: Transposable elements (TEs) are major components of large plant genomes and main drivers of genome evolution. The most recent assembly of hexaploid bread wheat recovered the highly repetitive TE space in an almost complete chromosomal context and enabled a detailed view into the dynamics of TEs in the A, B, and D subgenomes.
Results: The overall TE content is very similar between the A, B, and D subgenomes, although we find no evidence for bursts of TE amplification after the polyploidization events.
A parasitic lifestyle, where plants procure some or all of their nutrients from other living plants, has evolved independently in many dicotyledonous plant families and is a major threat for agriculture globally. Nevertheless, no genome sequence of a parasitic plant has been reported to date. Here we describe the genome sequence of the parasitic field dodder, Cuscuta campestris.
View Article and Find Full Text PDFBackground: While transposable elements (TEs) comprise the bulk of plant genomic DNA, how they contribute to genome structure and organization is still poorly understood. Especially in large genomes where TEs make the majority of genomic DNA, it is still unclear whether TEs target specific chromosomal regions or whether they simply accumulate where they are best tolerated.
Results: Here, we present an analysis of the repetitive fraction of the 5100 Mb barley genome, the largest angiosperm genome to have a near-complete sequence assembly.
The draft genome of the moss model, Physcomitrella patens, comprised approximately 2000 unordered scaffolds. In order to enable analyses of genome structure and evolution we generated a chromosome-scale genome assembly using genetic linkage as well as (end) sequencing of long DNA fragments. We find that 57% of the genome comprises transposable elements (TEs), some of which may be actively transposing during the life cycle.
View Article and Find Full Text PDFPseudogenes have a reputation of being 'evolutionary relics' or 'junk DNA'. While they are well characterized in mammals, studies in more complex plant genomes have so far been hampered by the absence of reference genome sequences. Barley is one of the economically most important cereals and has a genome size of 5.
View Article and Find Full Text PDFPlant genetic resources are a substantial opportunity for plant breeding, preservation and maintenance of biological diversity. As part of the German Network for Bioinformatics Infrastructure (de.NBI) the German Crop BioGreenformatics Network (GCBN) focuses mainly on crop plants and provides both data and software infrastructure which are tailored to the needs of the plant research community.
View Article and Find Full Text PDFWheat ( spp.) is one of the founder crops that likely drove the Neolithic transition to sedentary agrarian societies in the Fertile Crescent more than 10,000 years ago. Identifying genetic modifications underlying wheat's domestication requires knowledge about the genome of its allo-tetraploid progenitor, wild emmer ( ssp.
View Article and Find Full Text PDF