Publications by authors named "Marco Lopez-Cruz"

Predicting phenotypes from a combination of genetic and environmental factors is a grand challenge of modern biology. Slight improvements in this area have the potential to save lives, improve food and fuel security, permit better care of the planet, and create other positive outcomes. In 2022 and 2023 the first open-to-the-public Genomes to Fields (G2F) initiative Genotype by Environment (GxE) prediction competition was held using a large dataset including genomic variation, phenotype and weather measurements and field management notes, gathered by the project over nine years.

View Article and Find Full Text PDF
Article Synopsis
  • - Predicting how genetic and environmental factors influence traits (phenotypes) is a critical challenge in biology, with potential benefits like improved health, food security, and environmental care.
  • - The Genomes to Fields (G2F) initiative hosted a competition in 2022 and 2023, inviting global participants from various disciplines to develop models using a comprehensive dataset gathered over nine years, including genetic and environmental data.
  • - Winning methods combined machine learning with traditional breeding techniques, showcasing a variety of approaches such as quantitative genetics and deep learning, indicating that no single strategy was universally superior in predicting phenotypes in this context.
View Article and Find Full Text PDF

Many genetic models (including models for epistatic effects as well as genetic-by-environment) involve covariance structures that are Hadamard products of lower rank matrices. Implementing these models requires factorizing large Hadamard product matrices. The available algorithms for factorization do not scale well for big data, making the use of some of these models not feasible with large sample sizes.

View Article and Find Full Text PDF

Genotype-by-environment (G×E) interactions can significantly affect crop performance and stability. Investigating G×E requires extensive data sets with diverse cultivars tested over multiple locations and years. The Genomes-to-Fields (G2F) Initiative has tested maize hybrids in more than 130 year-locations in North America since 2014.

View Article and Find Full Text PDF

Objectives: The Genomes to Fields (G2F) 2022 Maize Genotype by Environment (GxE) Prediction Competition aimed to develop models for predicting grain yield for the 2022 Maize GxE project field trials, leveraging the datasets previously generated by this project and other publicly available data.

Data Description: This resource used data from the Maize GxE project within the G2F Initiative [1]. The dataset included phenotypic and genotypic data of the hybrids evaluated in 45 locations from 2014 to 2022.

View Article and Find Full Text PDF
Article Synopsis
  • Oil palm is a critical oil crop, with Colombia being the fourth largest producer, primarily using interspecific hybrids (OxG) for production.
  • Conventional breeding takes a long time (up to 20 years) to develop new varieties, making genomic selection (GS) a promising alternative for faster and improved breeding results.
  • The study evaluated various traits in hybrids and backcrosses, finding that GS models showed better prediction accuracies for certain traits, indicating the importance of genetic correlations and the potential for GS in selecting superior parent lines for oil palm breeding.
View Article and Find Full Text PDF

The success of genomic selection (GS) in breeding schemes relies on its ability to provide accurate predictions of unobserved lines at early stages. Multigeneration data provides opportunities to increase the training data size and thus, the likelihood of extracting useful information from ancestors to improve prediction accuracy. The genomic best linear unbiased predictions (GBLUPs) are performed by borrowing information through kinship relationships between individuals.

View Article and Find Full Text PDF

Genomic prediction models are often calibrated using multi-generation data. Over time, as data accumulates, training data sets become increasingly heterogeneous. Differences in allele frequency and linkage disequilibrium patterns between the training and prediction genotypes may limit prediction accuracy.

View Article and Find Full Text PDF

Genomic prediction uses DNA sequences and phenotypes to predict genetic values. In homogeneous populations, theory indicates that the accuracy of genomic prediction increases with sample size. However, differences in allele frequencies and linkage disequilibrium patterns can lead to heterogeneity in SNP effects.

View Article and Find Full Text PDF

High-throughput phenotyping (HTP) technologies can produce data on thousands of phenotypes per unit being monitored. These data can be used to breed for economically and environmentally relevant traits (e.g.

View Article and Find Full Text PDF

Sorghum is one of the world's major crops, expresses traits for resilience to climate change, and can be used for several purposes including food and clean fuels. Multiple-trait genomic prediction and selection models were implemented using genotyping-by-sequencing single nucleotide polymorphism markers and phenotypic data information. We demonstrated for the first time the efficiency genomic selection modelling of index selection including biofuel traits such as aboveground biomass yield, plant height, and dry mass fraction of the fresh material.

View Article and Find Full Text PDF

The purpose of this work was to assess the performance of four genomic selection (GS) models (GBLUP, BRR, Bayesian LASSO and BayesB) in 4 sorghum grain antioxidant traits (phenols, flavonoids, total antioxidant capacity and condensed tannins) using whole-genome SNP markers in a novel diversity panel of lines and landraces and recombinant inbred lines. One key breeding problem modelled was predicting the performance in the antioxidant production of new and unphenotyped sorghum genotypes (validation set). The population was weakly structured (analysis of molecular variance, AMOVA R = 9%), showed a significant genetic diversity and expressed antioxidant traits with a good level of variability and high correlation.

View Article and Find Full Text PDF

Potato () is a staple food crop and is considered one of the main sources of carbohydrates worldwide. Late blight () and common scab () are two of the primary production constraints faced by potato farming. Previous studies have identified a few resistance genes for both late blight and common scab; however, these genes explain only a limited fraction of the heritability of these diseases.

View Article and Find Full Text PDF

Genomic selection (GS) models use genome-wide genetic information to predict genetic values of candidates of selection. Originally, these models were developed without considering genotype × environment interaction(G×E). Several authors have proposed extensions of the single-environment GS model that accommodate G×E using either covariance functions or environmental covariates.

View Article and Find Full Text PDF

Genotyping-by-sequencing (GBS) technologies have proven capacity for delivering large numbers of marker genotypes with potentially less ascertainment bias than standard single nucleotide polymorphism (SNP) arrays. Therefore, GBS has become an attractive alternative technology for genomic selection. However, the use of GBS data poses important challenges, and the accuracy of genomic prediction using GBS is currently undergoing investigation in several crops, including maize, wheat, and cassava.

View Article and Find Full Text PDF