Background: Despite ongoing reductions in the cost of sequencing technologies, whole genome SNP genotype imputation is often used as an alternative for obtaining abundant SNP genotypes for genome wide association studies. Several existing genotype imputation methods can be efficient for this purpose, while achieving various levels of imputation accuracy. Recent empirical results have shown that the two-step imputation may improve accuracy by imputing the low density genotyped study animals to a medium density array first and then to the target density. We are interested in building a series of staircase arrays that lead the low density array to the high density array or even the whole genome, such that genotype imputation along these staircases can achieve the highest accuracy.
Results: For genotype imputation from a lower density to a higher density, we first show how to select untyped SNPs to construct a medium density array. Subsequently, we determine for each selected SNP those untyped SNPs to be imputed in the add-one two-step imputation, and lastly how the clusters of imputed genotype are pieced together as the final imputation result. We design extensive empirical experiments using several hundred sequenced and genotyped animals to demonstrate that our novel two-step piecemeal imputation always achieves an improvement compared to the one-step imputation by the state-of-the-art methods Beagle and FImpute. Using the two-step piecemeal imputation, we present some preliminary success on whole genome SNP genotype imputation for genotyped animals via a series of staircase arrays.
Conclusions: From a low SNP density to the whole genome, intermediate pseudo-arrays can be computationally constructed by selecting the most informative SNPs for untyped SNP genotype imputation. Such pseudo-array staircases are able to impute more accurately than the classic one-step imputation.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4619096 | PMC |
http://dx.doi.org/10.1186/s12859-015-0770-2 | DOI Listing |
One of the major challenges in genomic data sharing is protecting participants' privacy in collaborative studies and when genomic data is outsourced to perform analysis tasks, e.g., genotype imputation services and federated collaborations genomic analysis.
View Article and Find Full Text PDFPlant Genome
March 2025
Plant Breeding Graduate Program, Horticultural Sciences Department, University of Florida, IFAS Gulf Coast Research and Education Center, Wimauma, Florida, USA.
Genomic selection is a widely used quantitative method of determining the genetic value of an individual from genomic information and phenotypic data. In this study, we used a large, multi-year training population of 3248 individuals from the University of Florida strawberry (Fragaria × ananassa Duchesne) breeding program. We coupled this training population with a test population of 1460 individuals derived from 20 biparental families.
View Article and Find Full Text PDFJ Dairy Sci
January 2025
Confederación de Asociaciones de Frisona Española (CONAFE), Ctra. de Andalucía km 23600 Valdemoro, 28340 Madrid, Spain.
Epizootic hemorrhagic disease (EHD) is a non-contagious viral infection that can cause important economic losses in dairy farms. This study aimed to identify epidemiological and genetic factors influencing the susceptibility and severity of EHD in Holstein dairy cattle during the 2023 outbreak in Spain. Data from 2852 animals in 7 affected farms from 5 Spanish provinces were used.
View Article and Find Full Text PDFJ Dairy Sci
January 2025
College of Animal Science and Technology, Northwest A&F University, 22 nt, Xinong Road, Yangling, Shaanxi, China. Electronic address:
Low-coverage whole-genome sequencing (LcWGS), a cost-effective genotyping method, offers greater flexibility in variant detection than does single-nucleotide polymorphism (SNP) chips. However, to our knowledge, no studies have explored the application of LcWGS in sheep. This study aimed to evaluate the feasibility of implementing LcWGS and genotype imputation and assess their applicability in genomic studies of body weight and milk yield in sheep.
View Article and Find Full Text PDFJ Anim Sci
January 2025
USDA-Agricultural Research Service, U.S. Meat Animal Research Center (USMARC), Clay Center, NE, USA.
Sow lameness results in premature culling, causing economic loss and well-being issues. A study, utilizing a pressure-sensing mat (GAIT4) and video monitoring system (NUtrack), was conducted to identify objective measurements on gilts that are predictive of future lameness. Gilts (N = 3656) were categorized to describe their lifetime soundness: SOUND, retained for breeding with no detected mobility issues; LAME_SOW, retained for breeding and detected lame as a sow; CULL_STR, not retained due to poor leg structure; LAME_GILT, not retained due to visible signs of lameness; and CULL, not retained due to reasons other than leg structure.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!