Publications by authors named "Perez-Rodriguez Paulino"

Analyzing human genomic data from biobanks and large-scale genetic evaluations often requires fitting models with a sample size exceeding the number of DNA markers used (n > p). For instance, developing Polygenic Scores (PGS) for humans and genomic prediction for genetic evaluations of agricultural species may require fitting models involving a few thousand SNPs using data with hundreds of thousands of samples. In such cases, computations based on sufficient statistics are more efficient than those based on individual genotype-phenotype data.

View Article and Find Full Text PDF
Article Synopsis
  • Statistical machine learning (ML) analyzes large volumes of genomic, phenotypic, and environmental data to uncover patterns and improve prediction models in plant breeding.
  • By investigating genotype-by-environment (G×E) interactions, ML helps identify genetic factors that influence performance in various environments.
  • This review emphasizes how big data and ML enhance prediction accuracy and streamline breeding strategies through comprehensive analysis of diverse datasets.
View Article and Find Full Text PDF

This study presents a novel approach for the optimization of genomic parental selection in breeding programs involving categorical and continuous-categorical multi-trait mixtures (CMs and CCMMs). Utilizing the Bayesian decision theory (BDT) and latent trait models within a multivariate normal distribution framework, we address the complexities of selecting new parental lines across ordinal and continuous traits for breeding. Our methodology enhances precision and flexibility in genetic selection, validated through extensive simulations.

View Article and Find Full Text PDF

The use of plant genetic resources (PGR)-wild relatives, landraces, and isolated breeding gene pools-has had substantial impacts on wheat breeding for resistance to biotic and abiotic stresses, while increasing nutritional value, end-use quality, and grain yield. In the Global South, post-Green Revolution genetic yield gains are generally achieved with minimal additional inputs. As a result, production has increased, and millions of hectares of natural ecosystems have been spared.

View Article and Find Full Text PDF

Bread wheat (Triticum aestivum L.) is a globally important food crop, which was domesticated about 8-10,000 years ago. Bread wheat is an allopolyploid, and it evolved from two hybridization events of three species.

View Article and Find Full Text PDF

A rapid, eco-friendly, and simple method for the synthesis of long-lasting (2 years) silver nanoparticles (AgNPs) is reported using aqueous leaf and petal extracts of L. The particles were characterized using UV-Visible spectrophotometry and the analytical and crystallographic techniques of transmission electron microscopy (TEM). The longevity of the AgNPs was studied using UV-Vis and high-resolution TEM.

View Article and Find Full Text PDF

In the field of plant breeding, various machine learning models have been developed and studied to evaluate the genomic prediction (GP) accuracy of unseen phenotypes. Deep learning has shown promise. However, most studies on deep learning in plant breeding have been limited to small datasets, and only a few have explored its application in moderate-sized datasets.

View Article and Find Full Text PDF

Development of high yielding cowpea varieties coupled with good taste and rich in essential minerals can promote consumption and thus nutrition and profitability. The sweet taste of cowpea grain is determined by its sugar content, which comprises mainly sucrose and galacto-oligosaccharides (GOS) including raffinose and stachyose. However, GOS are indigestible and their fermentation in the colon can produce excess intestinal gas, causing undesirable bloating and flatulence.

View Article and Find Full Text PDF

Selecting and mating parents in conventional phenotypic and genomic selection are crucial. Plant breeding programs aim to improve the economic value of crops, considering multiple traits simultaneously. When traits are negatively correlated and/or when there are missing records in some traits, selection becomes more complex.

View Article and Find Full Text PDF

Many genetic models (including models for epistatic effects as well as genetic-by-environment) involve covariance structures that are Hadamard products of lower rank matrices. Implementing these models requires factorizing large Hadamard product matrices. The available algorithms for factorization do not scale well for big data, making the use of some of these models not feasible with large sample sizes.

View Article and Find Full Text PDF

Inbreeding depression (ID) is caused by increased homozygosity in the offspring after selfing. Although the self-compatible, highly heterozygous, tetrasomic polyploid potato ( L.) suffers from ID, some argue that the potential genetic gains from using inbred lines in a sexual propagation system of potato are too large to be ignored.

View Article and Find Full Text PDF

It is of paramount importance in plant breeding to have methods dealing with large numbers of predictor variables and few sample observations, as well as efficient methods for dealing with high correlation in predictors and measured traits. This paper explores in terms of prediction performance the partial least squares (PLS) method under single-trait (ST) and multi-trait (MT) prediction of potato traits. The first prediction was for tested lines in tested environments under a five-fold cross-validation (5FCV) strategy and the second prediction was for tested lines in untested environments (herein denoted as leave one environment out cross validation, LOEO).

View Article and Find Full Text PDF

Objective: The objective was to compare (pedigree-based) best linear unbiased prediction (BLUP), genomic BLUP (GBLUP), and single-step GBLUP (ssGBLUP) methods for genomic evaluation of growth traits in a Mexican Braunvieh cattle population.

Methods: Birth (BW), weaning (WW), and yearling weight (YW) data of a Mexican Braunvieh cattle population were analyzed with BLUP, GBLUP, and ssGBLUP methods. These methods are differentiated by the additive genetic relationship matrix included in the model and the animals under evaluation.

View Article and Find Full Text PDF

While several statistical machine learning methods have been developed and studied for assessing the genomic prediction (GP) accuracy of unobserved phenotypes in plant breeding research, few methods have linked genomics and phenomics (imaging). Deep learning (DL) neural networks have been developed to increase the GP accuracy of unobserved phenotypes while simultaneously accounting for the complexity of genotype-environment interaction (GE); however, unlike conventional GP models, DL has not been investigated for when genomics is linked with phenomics. In this study we used 2 wheat data sets (DS1 and DS2) to compare a novel DL method with conventional GP models.

View Article and Find Full Text PDF

Development and deployment of high-yielding maize varieties with native resistance to Fall armyworm (FAW), turcicum leaf blight (TLB), and gray leaf spot (GLS) infestation is critical for addressing the food insecurity in sub-Saharan Africa. The objectives of this study were to determine the inheritance of resistance for FAW, identity hybrids which in addition to FAW resistance, also show resistance to TLB and GLS, and investigate the usefulness of models based on general combining ability (GCA) and SNP markers in predicting the performance of new untested hybrids. Half-diallel mating scheme was used to generate 105 F hybrids from 15 parents and another 55 F hybrids from 11 parents.

View Article and Find Full Text PDF

Genomic selection (GS) in wheat breeding programs is of great interest for predicting the genotypic values of individuals, where both additive and nonadditive effects determine the final breeding value of lines. While several simulation studies have shown the efficiency of rapid-cycling GS strategies for parental selection or population improvement, their practical implementations are still lacking in wheat and other crops. In this study, we demonstrate the potential of rapid-cycle recurrent GS (RCRGS) to increase genetic gain for grain yield (GY) in wheat.

View Article and Find Full Text PDF

Background: As a result of the technological progress, the use of sensors for crop survey has substantially increased, generating valuable information for modelling agricultural data. Plant spectroscopy jointly with statistical modeling can potentially help to assess certain chemical components of interest present in plants, which may be laborious and expensive to obtain by direct measurements. In this research, the phosphorus content in wheat grain is modeled using reflectance information measured by a hyperspectral sensor at different wavelengths.

View Article and Find Full Text PDF

The success of genomic selection (GS) in breeding schemes relies on its ability to provide accurate predictions of unobserved lines at early stages. Multigeneration data provides opportunities to increase the training data size and thus, the likelihood of extracting useful information from ancestors to improve prediction accuracy. The genomic best linear unbiased predictions (GBLUPs) are performed by borrowing information through kinship relationships between individuals.

View Article and Find Full Text PDF

Spot blotch (SB) caused by (Sacc.) Shoem is a destructive fungal disease affecting wheat and many other crops. Synthetic hexaploid wheat (SHW) offers opportunities to explore new resistance genes for SB for introgression into elite bread wheat.

View Article and Find Full Text PDF

The BGLR-R package implements various types of single-trait shrinkage/variable selection Bayesian regressions. The package was first released in 2014, since then it has become a software very often used in genomic studies. We recently develop functionality for multitrait models.

View Article and Find Full Text PDF

Modern GWAS studies use an enormous sample size and ultra-high density SNP genotypes. These conditions reduce the mapping resolution of marginal association tests-the method most often used in GWAS. Multi-locus Bayesian Variable Selection (BVS) offers a one-stop solution for powerful and precise mapping of risk variants and polygenic risk score (PRS) prediction.

View Article and Find Full Text PDF

Braunvieh is an important dual-purpose breed in the Mexican tropics. The study of its genetic diversity is key to implementing genetic improvement programs. This study was conducted to determine genetic diversity of reproductive traits in a Mexican Braunvieh beef cattle population using single nucleotide polymorphisms in candidate genes.

View Article and Find Full Text PDF

Background: Coffee quality is an important selection criterion for coffee breeding. Metabolite profiling and Genome-Wide Association Studies (GWAS) effectively dissect the genetic background of complex traits such as metabolites content (caffeine, trigonelline, and 5-caffeoylquinic acid (5-CQA)) in coffee that affect quality. Therefore, it is important to determine the metabolic profiles of Coffea spp.

View Article and Find Full Text PDF

Genomic-enabled prediction models are of paramount importance for the successful implementation of genomic selection (GS) based on breeding values. As opposed to animal breeding, plant breeding includes extensive multienvironment and multiyear field trial data. Hence, genomic-enabled prediction models should include genotype × environment (G × E) interaction, which most of the time increases the prediction performance when the response of lines are different from environment to environment.

View Article and Find Full Text PDF