The number of human genomes being genotyped or sequenced increases exponentially and efficient haplotype estimation methods able to handle this amount of data are now required. Here we present a method, SHAPEIT4, which substantially improves upon other methods to process large genotype and high coverage sequencing datasets. It notably exhibits sub-linear running times with sample size, provides highly accurate haplotypes and allows integrating external phasing information such as large reference panels of haplotypes, collections of pre-phased variants and long sequencing reads.
View Article and Find Full Text PDFThe 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.
View Article and Find Full Text PDFWe carried out a genome-wide association study of schizophrenia (479 cases, 2,937 controls) and tested loci with P < 10(-5) in up to 16,726 additional subjects. Of 12 loci followed up, 3 had strong independent support (P < 5 x 10(-4)), and the overall pattern of replication was unlikely to occur by chance (P = 9 x 10(-8)). Meta-analysis provided strongest evidence for association around ZNF804A (P = 1.
View Article and Find Full Text PDFGenome-wide association (GWA) studies have identified multiple loci at which common variants modestly but reproducibly influence risk of type 2 diabetes (T2D). Established associations to common and rare variants explain only a small proportion of the heritability of T2D. As previously published analyses had limited power to identify variants with modest effects, we carried out meta-analysis of three T2D GWA scans comprising 10,128 individuals of European descent and approximately 2.
View Article and Find Full Text PDFNat Genet
November 2007
We have genotyped 14,436 nonsynonymous SNPs (nsSNPs) and 897 major histocompatibility complex (MHC) tag SNPs from 1,000 independent cases of ankylosing spondylitis (AS), autoimmune thyroid disease (AITD), multiple sclerosis (MS) and breast cancer (BC). Comparing these data against a common control dataset derived from 1,500 randomly selected healthy British individuals, we report initial association and independent replication in a North American sample of two new loci related to ankylosing spondylitis, ARTS1 and IL23R, and confirmation of the previously reported association of AITD with TSHR and FCRL3. These findings, enabled in part by increased statistical power resulting from the expansion of the control reference group to include individuals from the other disease groups, highlight notable new possibilities for autoimmune regulation and suggest that IL23R may be a common susceptibility factor for the major 'seronegative' diseases.
View Article and Find Full Text PDFFor fMRI time-series analysis to be statistically valid, it is important to deal correctly with temporal autocorrelation in the noise. Most of the approaches in the literature adopt a two-stage approach in which the autocorrelation structure is estimated using the residuals of an initial model fit. This estimate is then used to "prewhiten" the data and the model before the model is refit to obtain final activation parameter estimates.
View Article and Find Full Text PDF