Background: Genotype imputation is an important process of predicting unknown genotypes, which uses reference population with dense genotypes to predict missing genotypes for both human and animal genetic variations at a low cost. Machine learning methods specially boosting methods have been used in genetic studies to explore the underlying genetic profile of disease and build models capable of predicting missing values of a marker.
Methods: In this study strategies and factors affecting the imputation accuracy of parent-offspring trios compared from lower-density SNP panels (5 K) to high density (10 K) SNP panel using three different Boosting methods namely TotalBoost (TB), LogitBoost (LB) and AdaBoost (AB). The methods employed using simulated data to impute the un-typed SNPs in parent-offspring trios. Four different datasets of G1 (100 trios with 5 k SNPs), G2 (100 trios with 10 k SNPs), G3 (500 trios with 5 k SNPs), and G4 (500 trio with 10 k SNPs) were simulated. In four datasets all parents were genotyped completely, and offspring genotyped with a lower density panel.
Results: Comparison of the three methods for imputation showed that the LB outperformed AB and TB for imputation accuracy. The time of computation were different between methods. The AB was the fastest algorithm. The higher SNP densities resulted the increase of the accuracy of imputation. Larger trios (i.e. 500) was better for performance of LB and TB.
Conclusions: The conclusion is that the three methods do well in terms of imputation accuracy also the dense chip is recommended for imputation of parent-offspring trios.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4702368 | PMC |
http://dx.doi.org/10.1186/s40781-015-0081-1 | DOI Listing |
Mol Biol Rep
December 2024
Production Engineering Division, Momoshima Field Station, Fisheries Technology Institute, Japan Fisheries Research and Education Agency, 1760 Momoshima, Onomichi, Hiroshima, 722-0061, Japan.
Background: As part of stock enhancement programs for marine fishery species, the stocking of hatchery-produced seedlings into sea areas has been implemented worldwide. DNA markers are vital for responsible stock enhancement practices that aim to conserve the genetic diversity of recipient wild populations. We report novel single-nucleotide polymorphism (SNP) markers and multiplex SNP panels developed for the west Pacific pen shell Atrina lischkeana (Clessin, 1891), a large bivalve that is expected to be a subject of stock enhancement activity as the natural resource has dwindled, especially in Japan.
View Article and Find Full Text PDFGenet Med Open
January 2024
Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom.
Purpose: Structural variants such as multiexon deletions and duplications are an important cause of disease but are often overlooked in standard exome/genome sequencing analysis. We aimed to evaluate the detection of copy-number variants (CNVs) from exome sequencing (ES) in comparison with genome-wide low-resolution and exon-resolution chromosomal microarrays (CMAs) and to characterize the properties of de novo CNVs in a large clinical cohort.
Methods: We performed CNV detection using ES of 9859 parent-offspring trios in the Deciphering Developmental Disorders (DDD) study and compared them with CNVs detected from exon-resolution array comparative genomic hybridization in 5197 probands from the DDD study.
bioRxiv
November 2024
Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ, USA.
Given the many levels of biological variation in mutation rates observed to date in primates - spanning from species to individuals to genomic regions - future steps in our understanding of mutation rate evolution will be aided by both a greater breadth of species coverage across the primate clade, but also by a greater depth as afforded by an evaluation of multiple trios within individual species. In order to help bridge these gaps, we here present an analysis of a species representing one of the most basal splits on the primate tree (aye-ayes), combining whole-genome sequencing of seven parent-offspring trios from a three-generation pedigree with a novel computational pipeline that takes advantage of recently developed pan-genome graphs, thereby circumventing the application of (highly subjective) quality metrics that has previously been shown to result in notable differences in the detection of mutations, and ultimately estimates of mutation rates. This deep sampling has enabled both a detailed picture of parental age effects as well as sex dependency in mutation rates which we here compare with previously studied primates, but has also provided unique insights into the nature of genetic variation in one of the most endangered primates on the planet.
View Article and Find Full Text PDFCommun Biol
November 2024
Environmental Health Science and Research Bureau, Health Canada, Ottawa, ON, Canada.
Whole-genome sequencing studies of parent-offspring trios have provided valuable insights into the potential impact of de novo mutations (DNMs) on human health and disease. However, the molecular mechanisms that drive DNMs are unclear. Studies with multi-child families can provide important insight into the causes of inter-family variability in DNM rates but they are highly limited.
View Article and Find Full Text PDFJ Neurosci
December 2024
Department of Child and Adolescent Psychiatry, Erasmus University Medical Center, 3015 GD Rotterdam, the Netherlands
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!