Genomic prediction models are often calibrated using multi-generation data. Over time, as data accumulates, training data sets become increasingly heterogeneous. Differences in allele frequency and linkage disequilibrium patterns between the training and prediction genotypes may limit prediction accuracy. This leads to the question of whether all available data or a subset of it should be used to calibrate genomic prediction models. Previous research on training set optimization has focused on identifying a subset of the available data that is optimal for a given prediction set. However, this approach does not contemplate the possibility that different training sets may be optimal for different prediction genotypes. To address this problem, we recently introduced a sparse selection index (SSI) that identifies an optimal training set for each individual in a prediction set. Using additive genomic relationships, the SSI can provide increased accuracy relative to genomic-BLUP (GBLUP). Non-parametric genomic models using Gaussian kernels (KBLUP) have, in some cases, yielded higher prediction accuracies than standard additive models. Therefore, here we studied whether combining SSIs and kernel methods could further improve prediction accuracy when training genomic models using multi-generation data. Using four years of doubled haploid maize data from the International Maize and Wheat Improvement Center (CIMMYT), we found that when predicting grain yield the KBLUP outperformed the GBLUP, and that using SSI with additive relationships (GSSI) lead to 5-17% increases in accuracy, relative to the GBLUP. However, differences in prediction accuracy between the KBLUP and the kernel-based SSI were smaller and not always significant.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8551287PMC
http://dx.doi.org/10.1038/s41437-021-00474-1DOI Listing

Publication Analysis

Top Keywords

genomic prediction
12
prediction accuracy
12
prediction
11
sparse selection
8
prediction models
8
multi-generation data
8
prediction genotypes
8
training set
8
optimal prediction
8
prediction set
8

Similar Publications

Alopecia areata (AA) is an autoimmune condition marked by hair loss, linked to inflammatory processes involving the interleukin-1 receptor type 1 (IL-1R1) pathway. This study aims to explore the relationship between IL-1R1 gene expression, serum IL-1R1 levels, and hsa-miR-19b-3p in relation to AA severity. Using a case-control design, we assessed 100 AA patients and 100 healthy controls, measuring serum IL-1R1 through enzyme-linked immunosorbent assay (ELISA) and analyzing IL-1R1 gene and hsa-miR-19b-3p expression levels via quantitative real-time PCR (qRT-PCR).

View Article and Find Full Text PDF

The neurobiological mechanisms driving the ictal-interictal fluctuations and the chronification of migraine remain elusive. We aimed to construct a composite genetic-microRNA model that could reflect the dynamic perturbations of the disease course and inform the pathogenesis of migraine. We prospectively recruited four groups of participants, including interictal episodic migraine (i.

View Article and Find Full Text PDF

Characterising patterns of genetic diversity including evidence of local adaptation is relevant for predicting and managing species recovering from overexploitation in the face of climate change. Red abalone (Haliotis rufescens) is a species of conservation concern due to recent declines from overharvesting, disease and climate change, resulting in the closure of commercial and recreational fisheries. Using whole-genome resequencing data from 23 populations spanning their entire range (southern Oregon, USA, to Baja California, MEX) we investigated patterns of population connectivity and genotype-environment associations that would reveal local adaptation across the mosaic of coastal environments that define the California Current System (CCS).

View Article and Find Full Text PDF

Background: A recent prospective phase II study (ECOG-ACRIN E2211) demonstrated that MGMT deficiency was associated with a significant response to capecitabine and temozolomide (CAPTEM) in pancreatic neuroendocrine neoplasms (NENs); however, routine MGMT analysis in NENs was not recommended. Our study sought to demonstrate whether loss of MGMT protein expression is associated with improved overall survival (OS) in patients receiving CAPTEM for NENs from various tumor sites.

Materials And Methods: Paraffin-embedded tumor samples were evaluated by immunohistochemistry (IHC) using an MGMT monoclonal antibody.

View Article and Find Full Text PDF

The transcriptomic classification of primary colorectal cancer (CRC) into distinct consensus molecular subtypes (CMSs) is a well-described strategy for patient stratification. However, the molecular nature of CRC metastases remains poorly investigated. To this end, this study aimed to identify and compare organotropic CMS frequencies in CRC liver and brain metastases.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!