Genome analysis of individuals affected by retinitis pigmentosa (RP) identified two rare nucleotide substitutions at the same genomic location on chromosome 11 (g.61392563 [GRCh38]), 69 base pairs upstream of the start codon of the ciliopathy gene TMEM216 (c.-69G>A, c.
View Article and Find Full Text PDFThe impact of genetic regulatory variation active in early pancreatic development on adult pancreatic disease and traits is not well understood. Here, we generate a panel of 107 fetal-like iPSC-derived pancreatic progenitor cells (iPSC-PPCs) from whole genome-sequenced individuals and identify 4065 genes and 4016 isoforms whose expression and/or alternative splicing are affected by regulatory variation. We integrate eQTLs identified in adult islets and whole pancreas samples, which reveal 1805 eQTL associations that are unique to the fetal-like iPSC-PPCs and 1043 eQTLs that exhibit regulatory plasticity across the fetal-like and adult pancreas tissues.
View Article and Find Full Text PDFThe causal variants and genes underlying thousands of cardiac GWAS signals have yet to be identified. Here, we leverage spatiotemporal information on 966 RNA-seq cardiac samples and perform an expression quantitative trait locus (eQTL) analysis detecting eQTLs considering both eGenes and eIsoforms. We identify 2,578 eQTLs associated with a specific developmental stage-, tissue- and/or cell type.
View Article and Find Full Text PDFReactivation of fetal-specific genes and isoforms occurs during heart failure. However, the underlying molecular mechanisms and the extent to which the fetal program switch occurs remains unclear. Limitations hindering transcriptome-wide analyses of alternative splicing differences (i.
View Article and Find Full Text PDFVariability in SARS-CoV-2 susceptibility and COVID-19 disease severity between individuals is partly due to genetic factors. Here, we identify 4 genomic loci with suggestive associations for SARS-CoV-2 susceptibility and 19 for COVID-19 disease severity. Four of these 23 loci likely have an ethnicity-specific component.
View Article and Find Full Text PDFPatients with inherited retinal dystrophies (IRDs) were recruited from two understudied populations: Mexico and Pakistan as well as a third well-studied population of European Americans to define the genetic architecture of IRD by performing whole-genome sequencing (WGS). Whole-genome analysis was performed on 409 individuals from 108 unrelated pedigrees with IRDs. All patients underwent an ophthalmic evaluation to establish the retinal phenotype.
View Article and Find Full Text PDFVariability in SARS-CoV-2 susceptibility and COVID-19 disease severity between individuals is partly due to genetic factors. Here, we applied colocalization to compare summary statistics for 16 GWASs from the COVID-19 Host Genetics Initiative to investigate similarities and differences in their genetic signals. We identified 9 loci associated with susceptibility (one with two independent GWAS signals; one with an ethnicity-specific signal), 14 associated with severity (one with two independent GWAS signals; two with ethnicity-specific signals) and one harboring two discrepant GWAS signals (one for susceptibility; one for severity).
View Article and Find Full Text PDFCancer-derived iPSCs have provided valuable insight into oncogenesis, but human cancer cells can often be difficult to reprogram, especially in cases of complex genetic abnormalities. Here we report, to our knowledge, the first successful generation of an iPSC line from a human immortalized acute myeloid leukemia (AML) cell line, the cell line HL-60. This iPSC line retains a majority of the leukemic genotype and displays defects in myeloid differentiation, thus providing a tool for modeling and studying AML.
View Article and Find Full Text PDFInherited retinal degenerations (IRDs) are a group of genetically heterogeneous conditions with a broad phenotypic heterogeneity. Here, we report detection and validation of the underlying cause of progressive retinal degeneration in a nuclear family of European descent with a single affected individual. Whole genome sequencing of the proband and her unaffected sibling identified a novel intron 8 donor splice site variant (c.
View Article and Find Full Text PDFStructural variants (SVs) and short tandem repeats (STRs) are important sources of genetic diversity but are not routinely analyzed in genetic studies because they are difficult to accurately identify and genotype. Because SVs and STRs range in size and type, it is necessary to apply multiple algorithms that incorporate different types of evidence from sequencing data and employ complex filtering strategies to discover a comprehensive set of high-quality and reproducible variants. Here we assemble a set of 719 deep whole genome sequencing (WGS) samples (mean 42×) from 477 distinct individuals which we use to discover and genotype a wide spectrum of SV and STR variants using five algorithms.
View Article and Find Full Text PDFStructural variants (SVs) and short tandem repeats (STRs) comprise a broad group of diverse DNA variants which vastly differ in their sizes and distributions across the genome. Here, we identify genomic features of SV classes and STRs that are associated with gene expression and complex traits, including their locations relative to eGenes, likelihood of being associated with multiple eGenes, associated eGene types (e.g.
View Article and Find Full Text PDFBackground: Febrile neonates and young infants presenting with seizure require immediate evaluation and treatment. Herein we experienced two young infants with parechovirus-A3 (PeV-A3) encephalitis, initially presented with focal seizure suspecting herpes simplex virus (HSV) encephalitis.
Cases: We have experienced 2 infantile cases, initially presented with focal seizure.
The MHC region is highly associated with autoimmune and infectious diseases. Here we conduct an in-depth interrogation of associations between genetic variation, gene expression and disease. We create a comprehensive map of regulatory variation in the MHC region using WGS from 419 individuals to call eight-digit HLA types and RNA-seq data from matched iPSCs.
View Article and Find Full Text PDFUsing iPSCs to study cancer has been complicated by the fact that many cancer cells are difficult to reprogram, which has been attributed to the genomic abnormalities present. Acute Myeloid Leukemia (AML) is a complex disease that presents with various types of genomic aberrations that affect prognosis. Here we reprogrammed CD34+ cells from an AML patient containing a rare der(7)t(7;13) translocation associated with poor prognosis, who had relapsed and was refractory to current treatments.
View Article and Find Full Text PDFDespite the importance of understanding how variability across induced pluripotent stem cell (iPSC) lines due to non-genetic factors (clone and passage) influences their differentiation outcome, large-scale studies capable of addressing this question have not yet been conducted. Here, we differentiated 191 iPSC lines to generate iPSC-derived cardiovascular progenitor cells (iPSC-CVPCs). We observed cellular heterogeneity across the iPSC-CVPC samples due to varying fractions of two cell types: cardiomyocytes (CMs) and epicardium-derived cells (EPDCs).
View Article and Find Full Text PDFThe cardiac transcription factor (TF) gene NKX2-5 has been associated with electrocardiographic (EKG) traits through genome-wide association studies (GWASs), but the extent to which differential binding of NKX2-5 at common regulatory variants contributes to these traits has not yet been studied. We analyzed transcriptomic and epigenomic data from induced pluripotent stem cell-derived cardiomyocytes from seven related individuals, and identified ~2,000 single-nucleotide variants associated with allele-specific effects (ASE-SNVs) on NKX2-5 binding. NKX2-5 ASE-SNVs were enriched for altered TF motifs, for heart-specific expression quantitative trait loci and for EKG GWAS signals.
View Article and Find Full Text PDFWe evaluate whether human induced pluripotent stem cell-derived retinal pigment epithelium (iPSC-RPE) cells can be used to prioritize and functionally characterize causal variants at age-related macular degeneration (AMD) risk loci. We generated iPSC-RPE from six subjects and show that they have morphological and molecular characteristics similar to those of native RPE. We generated RNA-seq, ATAC-seq, and H3K27ac ChIP-seq data and observed high similarity in gene expression and enriched transcription factor motif profiles between iPSC-RPE and human fetal RPE.
View Article and Find Full Text PDFBackground: Identifying genetic variation associated with plasma protein levels, and the mechanisms by which they act, could provide insight into alterable processes involved in regulation of protein levels. Although protein levels can be affected by genetic variants, their estimation can also be biased by missense variants in coding exons causing technical artifacts. Integrating genome sequence genotype data with mass spectrometry-based protein level estimation could reduce bias, thereby improving detection of variation that affects RNA or protein metabolism.
View Article and Find Full Text PDFTo understand the mutational burden of human induced pluripotent stem cells (iPSCs), we sequenced genomes of 18 fibroblast-derived iPSC lines and identified different classes of somatic mutations based on structure, origin, and frequency. Copy-number alterations affected 295 kb in each sample and strongly impacted gene expression. UV-damage mutations were present in ∼45% of the iPSCs and accounted for most of the observed heterogeneity in mutation rates across lines.
View Article and Find Full Text PDF