Robust estimation of allele frequencies in pools of DNA has the potential to reduce genotyping costs and/or increase the number of individuals contributing to a study where hundreds of thousands of genetic markers need to be genotyped in very large populations sample sets, such as genome wide association studies. In order to make accurate allele frequency estimations from pooled samples a correction for unequal allele representation must be applied. We have developed the polynomial based probe specific correction (PPC) which is a novel correction algorithm for accurate estimation of allele frequencies in data from high-density microarrays. This algorithm was validated through comparison of allele frequencies from a set of 10 individually genotyped DNA's and frequencies estimated from pools of these 10 DNAs using GeneChip 10K Mapping Xba 131 arrays. Our results demonstrate that when using the PPC to correct for allelic biases the accuracy of the allele frequency estimates increases dramatically.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1240117PMC
http://dx.doi.org/10.1093/nar/gni142DOI Listing

Publication Analysis

Top Keywords

allele frequencies
16
algorithm accurate
8
accurate estimation
8
pools dna
8
estimation allele
8
allele frequency
8
allele
7
frequencies
5
ppc algorithm
4
estimation snp
4

Similar Publications

Background: Primary hyperoxaluria type 1 (PH 1) is a rare genetic condition due to mutations in the AGXT gene. This leads to an overproduction of oxalate in the liver. Hyperoxaluria often causes kidney stones, nephrocalcinosis, and chronic kidney disease.

View Article and Find Full Text PDF

Epidemiological data suggest the population distribution of thyrotropin (TSH) values is shifted toward lower values in self-identified Black non-Hispanic individuals compared with self-identified White non-Hispanic individuals. It is unknown whether genetic differences between individuals with genetic similarities to African reference populations (GSA) and those with similarities to European reference populations (GSE) contribute to these observed differences. We aimed to compare genome-wide associations with TSH and putative causal TSH-associated variants between GSA and GSE groups.

View Article and Find Full Text PDF

Background And Aims: Acute liver failure (ALF) is a serious condition, typically in individuals without prior liver disease. Drug-induced ALF (DIALF) constitutes a major portion of ALF cases. Our research aimed to identify potential genetic predispositions to DIALF.

View Article and Find Full Text PDF

Breast cancer (BC) is a malignant tumor that occurs in breast tissue. This project aims to predict the prognosis of BC patients using genes related to hypoxia and endoplasmic reticulum stress (ERS). RNA-seq and clinical data for BC were downloaded from TCGA and GEO databases.

View Article and Find Full Text PDF

Background: Colon cancer is a leading cause of mortality in Appalachian Kentucky. Studies suggest that the microbiome may influence cancer outcomes. We investigate differential gene expression, the tumor microbiome, and the association between the two as potential drivers of disparities in colon cancer outcomes.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!