RNAseq data can be used to infer genetic variants, yet its use for estimating genetic population structure remains underexplored. Here, we construct a freely available computational tool (RGStraP) to estimate RNAseq-based genetic principal components (RG-PCs) and assess whether RG-PCs can be used to control for population structure in gene expression analyses. Using whole blood samples from understudied Nepalese populations and the Geuvadis study, we show that RG-PCs had comparable results to paired array-based genotypes, with high genotype concordance and high correlations of genetic principal components, capturing subpopulations within the dataset.
View Article and Find Full Text PDFGenomic researchers increasingly utilize commercial cloud service providers (CSPs) to manage data and analytics needs. CSPs allow researchers to grow Information Technology (IT) infrastructure on demand to overcome bottlenecks when combining large datasets. However, without adequate security controls, the risk of unauthorized access may be higher for data stored on the cloud.
View Article and Find Full Text PDFThe therapeutic efficacy of tamoxifen is predominantly mediated by its active metabolites 4-hydroxy-tamoxifen and endoxifen, whose formation is catalyzed by the polymorphic cytochrome P450 2D6 (CYP2D6). Yet, known CYP2D6 polymorphisms only partially determine metabolite concentrations in vivo. We performed the first cross-ancestry genome-wide association study with well-characterized patients of European, Middle-Eastern, and Asian descent (n = 497) to identify genetic factors impacting active and parent metabolite formation.
View Article and Find Full Text PDFGenetic factors underlying leukocyte telomere length (LTL) may provide insights into telomere homeostasis, with direct links to disease susceptibility. Genetic evaluation of 23,096 Singaporean Chinese samples identifies 10 genome-wide loci (P < 5 × 10). Several of these contain candidate genes (TINF2, PARP1, TERF1, ATM and POT1) with potential roles in telomere biology and DNA repair mechanisms.
View Article and Find Full Text PDFMeningococcal disease (MD) remains an important infectious cause of life threatening infection in both industrialized and resource poor countries. Genetic factors influence both occurrence and severity of presentation, but the genes responsible are largely unknown. We performed a genome-wide association study (GWAS) examining 5,440,063 SNPs in 422 Spanish MD patients and 910 controls.
View Article and Find Full Text PDFPolypoidal choroidal vasculopathy (PCV), a subtype of 'wet' age-related macular degeneration (AMD), constitutes up to 55% of cases of wet AMD in Asian patients. In contrast to the choroidal neovascularization (CNV) subtype, the genetic risk factors for PCV are relatively unknown. Exome sequencing analysis of a Han Chinese cohort followed by replication in four independent cohorts identified a rare c.
View Article and Find Full Text PDFAge-related macular degeneration (AMD) is a major cause of blindness, but presents differently in Europeans and Asians. Here, we perform a genome-wide and exome-wide association study on 2,119 patients with exudative AMD and 5,691 controls, with independent replication in 4,226 patients and 10,289 controls, all of East Asian descent, as part of The Genetics of AMD in Asians (GAMA) Consortium. We find a strong association between CETP Asp442Gly (rs2303790), an East Asian-specific mutation, and increased risk of AMD (odds ratio (OR)=1.
View Article and Find Full Text PDFEnteric fever affects more than 25 million people annually and results from systemic infection with Salmonella enterica serovar Typhi or Paratyphi pathovars A, B or C(1). We conducted a genome-wide association study of 432 individuals with blood culture-confirmed enteric fever and 2,011 controls from Vietnam. We observed strong association at rs7765379 (odds ratio (OR) for the minor allele = 0.
View Article and Find Full Text PDFWhole-genome sequencing across multiple samples in a population provides an unprecedented opportunity for comprehensively characterizing the polymorphic variants in the population. Although the 1000 Genomes Project (1KGP) has offered brief insights into the value of population-level sequencing, the low coverage has compromised the ability to confidently detect rare and low-frequency variants. In addition, the composition of populations in the 1KGP is not complete, despite the fact that the study design has been extended to more than 2,500 samples from more than 20 population groups.
View Article and Find Full Text PDFPrimary angle closure glaucoma (PACG) is a major cause of blindness worldwide. We conducted a genome-wide association study including 1,854 PACG cases and 9,608 controls across 5 sample collections in Asia. Replication experiments were conducted in 1,917 PACG cases and 8,943 controls collected from a further 6 sample collections.
View Article and Find Full Text PDFWe performed a two-stage genome-wide association study of IgA nephropathy (IgAN) in Han Chinese, with 1,434 affected individuals (cases) and 4,270 controls in the discovery phase and follow-up of the top 61 SNPs in an additional 2,703 cases and 3,464 controls. We identified associations at 17p13 (rs3803800, P = 9.40 × 10(-11), OR = 1.
View Article and Find Full Text PDFKawasaki disease is a systemic vasculitis of unknown etiology, with clinical observations suggesting a substantial genetic contribution to disease susceptibility. We conducted a genome-wide association study and replication analysis in 2,173 individuals with Kawasaki disease and 9,383 controls from five independent sample collections. Two loci exceeded the formal threshold for genome-wide significance.
View Article and Find Full Text PDFRecent reports have identified a north-south cline in genetic variation in East and South-East Asia, but these studies have not formally explored the basis of these clinical differences. Understanding the origins of these variations may provide valuable insights in tracking down the functional variants in genomic regions identified by genetic association studies. Here we investigate the genetic basis of these differences with genome-wide data from the HapMap, the Human Genome Diversity Project and the Singapore Genome Variation Project.
View Article and Find Full Text PDF