Simulations demonstrated that estimates of realized genetic gain from linear mixed models using regional trials are biased to some degree. Thus, we recommend multiple selected models to obtain a range of reasonable estimates. Genetic improvements of discrete characteristics are obvious and easy to demonstrate, while quantitative traits require reliable and accurate methods to disentangle the confounding genetic and non-genetic components.
View Article and Find Full Text PDFSoybean is grown primarily for the protein and oil extracted from its seed and its value is influenced by these components. The objective of this study was to map marker-trait associations (MTAs) for the concentration of seed protein, oil, and meal protein using the soybean nested association mapping (SoyNAM) population. The composition traits were evaluated on seed harvested from over 5000 inbred lines of the SoyNAM population grown in 10 field locations across 3 years.
View Article and Find Full Text PDFPhotosynthesis is a key target to improve crop production in many species including soybean [Glycine max (L.) Merr.].
View Article and Find Full Text PDFThe Beavis effect in quantitative trait locus (QTL) mapping describes a phenomenon that the estimated effect size of a statistically significant QTL (measured by the QTL variance) is greater than the true effect size of the QTL if the sample size is not sufficiently large. This is a typical example of the Winners' curse applied to molecular quantitative genetics. Theoretical evaluation and correction for the Winners' curse have been studied for interval mapping.
View Article and Find Full Text PDFPlant breeding is a decision-making discipline based on understanding project objectives. Genetic improvement projects can have two competing objectives: maximize the rate of genetic improvement and minimize the loss of useful genetic variance. For commercial plant breeders, competition in the marketplace forces greater emphasis on maximizing immediate genetic improvements.
View Article and Find Full Text PDFIn soybean variety development and genetic improvement projects, iron deficiency chlorosis (IDC) is visually assessed as an ordinal response variable. Linear Mixed Models for Genomic Prediction (GP) have been developed, compared, and used to select continuous plant traits such as yield, height, and maturity, but can be inappropriate for ordinal traits. Generalized Linear Mixed Models have been developed for GP of ordinal response variables.
View Article and Find Full Text PDFTrait introgression is a complex process that plant breeders use to introduce desirable alleles from one variety or species to another. Two of the major types of decisions that must be made during this sophisticated and uncertain workflow are: parental selection and resource allocation. We formulated the trait introgression problem as an engineering process and proposed a Markov Decision Processes (MDP) model to optimize the resource allocation procedure.
View Article and Find Full Text PDFMaize has for many decades been both one of the most important crops worldwide and one of the primary genetic model organisms. More recently, maize breeding has been impacted by rapid technological advances in sequencing and genotyping technology, transformation including genome editing, doubled haploid technology, parallelled by progress in data sciences and the development of novel breeding approaches utilizing genomic information. Herein, we report on past, current and future developments relevant for maize breeding with regard to (1) genome analysis, (2) germplasm diversity characterization and utilization, (3) manipulation of genetic diversity by transformation and genome editing, (4) inbred line development and hybrid seed production, (5) understanding and prediction of hybrid performance, (6) breeding methodology and (7) synthesis of opportunities and challenges for future maize breeding.
View Article and Find Full Text PDFSummary: We present GWASpro, a high-performance web server for the analyses of large-scale genome-wide association studies (GWAS). GWASpro was developed to provide data analyses for large-scale molecular genetic data, coupled with complex replicated experimental designs such as found in plant science investigations and to overcome the steep learning curves of existing GWAS software tools. GWASpro supports building complex design matrices, by which complex experimental designs that may include replications, treatments, locations and times, can be accounted for in the linear mixed model.
View Article and Find Full Text PDFGenetic improvement toward optimized and stable agronomic performance of soybean genotypes is desirable for food security. Understanding how genotypes perform in different environmental conditions helps breeders develop sustainable cultivars adapted to target regions. Complex traits of importance are known to be controlled by a large number of genomic regions with small effects whose magnitude and direction are modulated by environmental factors.
View Article and Find Full Text PDFA set of nested association mapping (NAM) families was developed by crossing 40 diverse soybean [ (L.) Merr.] genotypes to the common cultivar.
View Article and Find Full Text PDFAn epistatic genetic architecture can have a significant impact on prediction accuracies of genomic prediction (GP) methods. Machine learning methods predict traits comprised of epistatic genetic architectures more accurately than statistical methods based on additive mixed linear models. The differences between these types of GP methods suggest a diagnostic for revealing genetic architectures underlying traits of interest.
View Article and Find Full Text PDFUsing an Operations Research approach, we demonstrate design of optimal trait introgression projects with respect to competing objectives. We demonstrate an innovative approach for designing Trait Introgression (TI) projects based on optimization principles from Operations Research. If the designs of TI projects are based on clear and measurable objectives, they can be translated into mathematical models with decision variables and constraints that can be translated into Pareto optimality plots associated with any arbitrary selection strategy.
View Article and Find Full Text PDFThe objective of this study was to explore the potential of genomic prediction (GP) for soybean resistance against Sclerotinia sclerotiorum (Lib.) de Bary, the causal agent of white mold (WM). A diverse panel of 465 soybean plant introduction accessions was phenotyped for WM resistance in replicated field and greenhouse tests.
View Article and Find Full Text PDFWe introduce software, Numericware i, to compute identical by state (IBS) matrix based on genotypic data. Calculating an IBS matrix with a large dataset requires large computer memory and takes lengthy processing time. Numericware i addresses these challenges with 2 algorithmic methods: multithreading and forward chopping.
View Article and Find Full Text PDFWe consider the plant genetic improvement challenge of introgressing multiple alleles from a homozygous donor to a recipient. First, we frame the project as an algorithmic process that can be mathematically formulated. We then introduce a novel metric for selecting breeding parents that we refer to as the predicted cross value (PCV).
View Article and Find Full Text PDFWe present the generalized numerator relationship matrix (GNRM) algorithm and Numericware N as a software tool for calculating the numerator relationship matrix (NRM). The GNRM algorithm aims to build the NRM based on plant pedigrees. Customary plant pedigrees have a sparse format representing multiple ancestors and offspring.
View Article and Find Full Text PDFCotton fiber represents the largest single cell in plants and they serve as models to study cell development. This study investigated the distribution and evolution of fiber Unigenes anchored to recombination hotspots between tetraploid cotton (Gossypium hirsutum) At and Dt subgenomes, and within a parental diploid cotton (Gossypium raimondii) D genome. Comparative analysis of At vs D and Dt vs D showed that 1) the D genome provides many fiber genes after its merger with another parental diploid cotton (Gossypium arboreum) A genome although the D genome itself does not produce any spinnable fiber; 2) similarity of fiber genes is higher between At vs D than between Dt vs D genomic hotspots.
View Article and Find Full Text PDFParametric and nonparametric methods have been developed for purposes of predicting phenotypes. These methods are based on retrospective analyses of empirical data consisting of genotypic and phenotypic scores. Recent reports have indicated that parametric methods are unable to predict phenotypes of traits with known epistatic genetic architectures.
View Article and Find Full Text PDFIdentification of allelic variants associated with complex traits provides molecular genetic information associated with variability upon which both artificial and natural selections are based. Family-based association mapping (FBAM) takes advantage of linkage disequilibrium among segregating progeny within crosses and among parents to provide greater power than association mapping and greater resolution than linkage mapping. Herein, we discuss the potential adaption of human family-based association tests and quantitative transmission disequilibrium tests for use in crop species.
View Article and Find Full Text PDFDetection of quantitative trait loci (QTL) controlling complex traits followed by selection has become a common approach for selection in crop plants. The QTL are most often identified by linkage mapping using experimental F(2), backcross, advanced inbred, or doubled haploid families. An alternative approach for QTL detection are genome-wide association studies (GWAS) that use pre-existing lines such as those found in breeding programs.
View Article and Find Full Text PDFNested Association Mapping (NAM) has been proposed as a means to combine the power of linkage mapping with the resolution of association mapping. It is enabled through sequencing or array genotyping of parental inbred lines while using low-cost, low-density genotyping technologies for their segregating progenies. For purposes of data analyses of NAM populations, parental genotypes at a large number of Single Nucleotide Polymorphic (SNP) loci need to be projected to their segregating progeny.
View Article and Find Full Text PDFIdentification of functional markers (FMs) provides information about the genetic architecture underlying complex traits. An approach that combines the strengths of linkage and association mapping, referred to as nested association mapping (NAM), has been proposed to identify FMs in many plant species. The ability to identify and resolve FMs for complex traits depends upon a number of factors including frequency of FM alleles, magnitudes of their genetic effects, disequilibrium among functional and nonfunctional markers, statistical analysis methods, and mating design.
View Article and Find Full Text PDF