Genomic studies in African populations provide unique opportunities to understand disease etiology, human diversity, and population history. In the largest study of its kind, comprising genome-wide data from 6,400 individuals and whole-genome sequences from 1,978 individuals from rural Uganda, we find evidence of geographically correlated fine-scale population substructure. Historically, the ancestry of modern Ugandans was best represented by a mixture of ancient East African pastoralists.
View Article and Find Full Text PDFThe linear mixed model (LMM) is now routinely used to estimate heritability. Unfortunately, as we demonstrate, LMM estimates of heritability can be inflated when using a standard model. To help reduce this inflation, we used a more general LMM with two random effects-one based on genomic variants and one based on easily measured spatial location as a proxy for environmental effects.
View Article and Find Full Text PDFWe examine improvements to the linear mixed model (LMM) that better correct for population structure and family relatedness in genome-wide association studies (GWAS). LMMs rely on the estimation of a genetic similarity matrix (GSM), which encodes the pairwise similarity between every two individuals in a cohort. These similarities are estimated from single nucleotide polymorphisms (SNPs) or other genetic variants.
View Article and Find Full Text PDFUnlabelled: We investigated the hypothesis that the correlation between the class I HLA types of an individual and whether that individual spontaneously controls HIV-1 is mediated by the targeting of specific epitopes by CD8(+) T cells. By measuring gamma interferon enzyme-linked immunosorbent spot (ELISPOT) assay responses to a panel of 257 optimally defined epitopes in 341 untreated HIV-infected persons, including persons who spontaneously control viremia, we found that the correlation between HLA types and control is mediated by the targeting of specific epitopes. Moreover, we performed a graphical model-based analysis that suggested that the targeting of specific epitopes is a cause of such control--that is, some epitopes are protective rather than merely associated with control--and identified eight epitopes that are significantly protective.
View Article and Find Full Text PDFMotivation: Set-based variance component tests have been identified as a way to increase power in association studies by aggregating weak individual effects. However, the choice of test statistic has been largely ignored even though it may play an important role in obtaining optimal power. We compared a standard statistical test-a score test-with a recently developed likelihood ratio (LR) test.
View Article and Find Full Text PDFApplications of linear mixed models (LMMs) to problems in genomics include phenotype prediction, correction for confounding in genome-wide association studies, estimation of narrow sense heritability, and testing sets of variants (e.g., rare variants) for association.
View Article and Find Full Text PDFMotivation: Approaches for testing sets of variants, such as a set of rare or common variants within a gene or pathway, for association with complex traits are important. In particular, set tests allow for aggregation of weak signal within a set, can capture interplay among variants and reduce the burden of multiple hypothesis testing. Until now, these approaches did not address confounding by family relatedness and population structure, a problem that is becoming more important as larger datasets are used to increase power.
View Article and Find Full Text PDFWe present an approach for genome-wide association analysis with improved power on the Wellcome Trust data consisting of seven common phenotypes and shared controls. We achieved improved power by expanding the control set to include other disease cohorts, multiple races, and closely related individuals. Within this setting, we conducted exhaustive univariate and epistatic interaction association analyses.
View Article and Find Full Text PDFUnlabelled: The development of immunomonitoring models to determine HIV-1 vaccine efficacy is a major challenge. Studies suggest that HIV-1–specific CD8 T cells play a critical role in subjects achieving spontaneous viral control (HIV-1 controllers) and that they will be important in immune interventions. However, no single CD8 T-cell function is uniquely associated with controller status and the heterogeneity of responses targeting different epitopes further complicates the discovery of determinants of protective immunity.
View Article and Find Full Text PDFA small proportion of human immunodeficiency virus-1 (HIV-1) infected individuals, termed HIV-1 controllers, suppress viral replication to very low levels in the absence of therapy. Genetic investigations of this phenotype have strongly implicated variation in the class I major histocompatibility complex (MHC) region as key to HIV-1 control. We collected sequence-based classical class I HLA genotypes at 4-digit resolution in HIV-1-infected African American controllers and progressors (n = 1107), and tested them for association with host control using genome-wide single nucleotide polymorphism data to account for population structure.
View Article and Find Full Text PDFUnderstanding the organization and function of transcriptional regulatory networks by analyzing high-throughput gene expression profiles is a key problem in computational biology. The challenges in this work are 1) the lack of complete knowledge of the regulatory relationship between the regulators and the associated genes, 2) the potential for spurious associations due to confounding factors, and 3) the number of parameters to learn is usually larger than the number of available microarray experiments. We present a sparse (L1 regularized) graphical model to address these challenges.
View Article and Find Full Text PDFThe promiscuous presentation of epitopes by similar HLA class I alleles holds promise for a universal T-cell-based HIV-1 vaccine. However, in some instances, cytotoxic T lymphocytes (CTL) restricted by HLA alleles with similar or identical binding motifs are known to target epitopes at different frequencies, with different functional avidities and with different apparent clinical outcomes. Such differences may be illuminated by the association of similar HLA alleles with distinctive escape pathways.
View Article and Find Full Text PDFWe describe factored spectrally transformed linear mixed models (FaST-LMM), an algorithm for genome-wide association studies (GWAS) that scales linearly with cohort size in both run time and memory use. On Wellcome Trust data for 15,000 individuals, FaST-LMM ran an order of magnitude faster than current efficient algorithms. Our algorithm can analyze data for 120,000 individuals in just a few hours, whereas current algorithms fail on data for even 20,000 individuals (http://mscompbio.
View Article and Find Full Text PDFStrong statistical associations between polymorphisms in HIV-1 population sequences and carriage of HLA class I alleles have been widely used to identify possible sites of CD8 T cell immune selection in vivo. However, there have been few attempts to prospectively and systematically test these genetic hypotheses arising from population-based studies at a cellular, functional level. We assayed CD8 T cell epitope-specific IFN-γ responses in 290 individuals from the same cohort, which gave rise to 874 HLA-HIV associations in genetic analyses, taking into account autologous viral sequences and individual HLA genotypes.
View Article and Find Full Text PDFNatural killer (NK) cells have an important role in the control of viral infections, recognizing virally infected cells through a variety of activating and inhibitory receptors. Epidemiological and functional studies have recently suggested that NK cells can also contribute to the control of HIV-1 infection through recognition of virally infected cells by both activating and inhibitory killer immunoglobulin-like receptors (KIRs). However, it remains unknown whether NK cells can directly mediate antiviral immune pressure in vivo in humans.
View Article and Find Full Text PDFUnderstanding the role of genetic variation in human diseases remains an important problem to be solved in genomics. An important component of such variation consist of variations at single sites in DNA, or single nucleotide polymorphisms (SNPs). Typically, the problem of associating particular SNPs to phenotypes has been confounded by hidden factors such as the presence of population structure, family structure or cryptic relatedness in the sample of individuals being analyzed.
View Article and Find Full Text PDFInduction of virus-specific CD8⁺ T cell responses is critical for the success of vaccines against chronic viral infections. Despite the large number of potential MHC-I-restricted epitopes located in viral proteins, MHC-I-restricted epitope generation is inefficient, and factors defining the production and presentation of MHC-I-restricted viral epitopes are poorly understood. Here, we have demonstrated that the half-lives of HIV-derived peptides in cytosol from primary human cells were highly variable and sequence dependent, and significantly affected the efficiency of cell recognition by CD8⁺ T cells.
View Article and Find Full Text PDFBackground: Identifying viral and host determinants of HIV-1 elite control may help inform novel therapeutic and/or vaccination strategies. Previously, we observed decreased replication capacity in controller-derived viruses suggesting that fitness consequences of human leukocyte antigen (HLA) class I-associated escape mutations in Gag may contribute to this phenotype. This study examines whether similar functional defects occur in Pol proteins of elite controllers.
View Article and Find Full Text PDFScience
December 2010
Infectious and inflammatory diseases have repeatedly shown strong genetic associations within the major histocompatibility complex (MHC); however, the basis for these associations remains elusive. To define host genetic effects on the outcome of a chronic viral infection, we performed genome-wide association analysis in a multiethnic cohort of HIV-1 controllers and progressors, and we analyzed the effects of individual amino acids within the classical human leukocyte antigen (HLA) proteins. We identified >300 genome-wide significant single-nucleotide polymorphisms (SNPs) within the MHC and none elsewhere.
View Article and Find Full Text PDFProc Natl Acad Sci U S A
September 2010
Understanding the genetic underpinnings of disease is important for screening, treatment, drug development, and basic biological insight. One way of getting at such an understanding is to find out which parts of our DNA, such as single-nucleotide polymorphisms, affect particular intermediary processes such as gene expression. Naively, such associations can be identified using a simple statistical test on all paired combinations of genetic variants and gene transcripts.
View Article and Find Full Text PDFMutations that allow escape from CD8 T-cell responses are common in HIV-1 and may attenuate pathogenesis by reducing viral fitness. While this has been demonstrated for individual cases, a systematic investigation of the consequence of HLA class I-mediated selection on HIV-1 in vitro replication capacity (RC) has not been undertaken. We examined this question by generating recombinant viruses expressing plasma HIV-1 RNA-derived Gag-Protease sequences from 66 acute/early and 803 chronic untreated subtype B-infected individuals in an NL4-3 background and measuring their RCs using a green fluorescent protein (GFP) reporter CD4 T-cell assay.
View Article and Find Full Text PDFThe mechanisms underlying HIV-1 control by protective HLA class I alleles are not fully understood and could involve selection of escape mutations in functionally important Gag epitopes resulting in fitness costs. This study was undertaken to investigate, at the population level, the impact of HLA-mediated immune pressure in Gag on viral fitness and its influence on HIV-1 pathogenesis. Replication capacities of 406 recombinant viruses encoding plasma-derived Gag-protease from patients chronically infected with HIV-1 subtype C were assayed in an HIV-1-inducible green fluorescent protein reporter cell line.
View Article and Find Full Text PDFPrevious studies have identified a central role for HLA-B alleles in influencing control of HIV infection. An alternative possibility is that a small number of HLA-B alleles may have a very strong impact on HIV disease outcome, dominating the contribution of other HLA alleles. Here, we find that even following the exclusion of subjects expressing any of the HLA-B class I alleles (B*57, B*58, and B*18) identified to have the strongest influence on control, the dominant impact of HLA-B alleles on virus set point and absolute CD4 count variation remains significant.
View Article and Find Full Text PDFSince HLA-restricted cytotoxic T-cell responses select specific polymorphisms in HIV-1 sequences and HLA diversity is relatively static in human populations, we investigated the use of peptide epitopes based on sites of HLA-associated adaptation in HIV-1 sequences to stimulate and detect T-cell responses ex vivo. These "HLA-optimised" peptides captured more HIV-1 Nef-specific responses compared with overlapping peptides of a single consensus sequence, in interferon-gamma enzyme linked immunospot assays. Sites of immune selection can reveal more immunogenic epitopes in HLA-diverse populations and offer insights into the nature of HLA-epitope targeting, which could be applied in vaccine design.
View Article and Find Full Text PDF