The uncovering of protein-RNA interactions enables a deeper understanding of RNA processing. Recent multiplexed crosslinking and immunoprecipitation (CLIP) technologies such as antibody-barcoded eCLIP (ABC) dramatically increase the throughput of mapping RNA binding protein (RBP) binding sites. However, multiplex CLIP datasets are multivariate, and each RBP suffers non-uniform signal-to-noise ratio.
View Article and Find Full Text PDFHere, we present a protocol for using Skipper, a pipeline designed to process crosslinking and immunoprecipitation (CLIP) data into annotated binding sites. We describe steps for partitioning annotated transcript regions and fitting data to a beta-binomial model to call windows of enriched binding. From raw CLIP data, we detail how users can map reproducible RNA-binding sites to call enriched windows and perform downstream analysis.
View Article and Find Full Text PDFRNA binding proteins (RBPs) are key regulators of RNA processing and cellular function. Technologies to discover RNA targets of RBPs such as TRIBE (targets of RNA binding proteins identified by editing) and STAMP (surveying targets by APOBEC1 mediated profiling) utilize fusions of RNA base-editors (rBEs) to RBPs to circumvent the limitations of immunoprecipitation (CLIP)-based methods that require enzymatic digestion and large amounts of input material. To broaden the repertoire of rBEs suitable for editing-based RBP-RNA interaction studies, we have devised experimental and computational assays in a framework called PRINTER (protein-RNA interaction-based triaging of enzymes that edit RNA) to assess over thirty A-to-I and C-to-U rBEs, allowing us to identify rBEs that expand the characterization of binding patterns for both sequence-specific and broad-binding RBPs.
View Article and Find Full Text PDFRNA-binding proteins (RBPs) modulate alternative splicing outcomes to determine isoform expression and cellular survival. To identify RBPs that directly drive alternative exon inclusion, we developed tethered function luciferase-based splicing reporters that provide rapid, scalable and robust readouts of exon inclusion changes and used these to evaluate 718 human RBPs. We performed enhanced cross-linking immunoprecipitation, RNA sequencing and affinity purification-mass spectrometry to investigate a subset of candidates with no prior association with splicing.
View Article and Find Full Text PDFRNA binding proteins (RBPs) are key regulators of RNA processing and cellular function. Technologies to discover RNA targets of RBPs such as TRIBE (targets of RNA binding proteins identified by editing) and STAMP (surveying targets by APOBEC1 mediated profiling) utilize fusions of RNA base-editors (rBEs) to RBPs to circumvent the limitations of immunoprecipitation (CLIP)-based methods that require enzymatic digestion and large amounts of input material. To broaden the repertoire of rBEs suitable for editing-based RBP-RNA interaction studies, we have devised experimental and computational assays in a framework called PRINTER (protein-RNA interaction-based triaging of enzymes that edit RNA) to assess over thirty A-to-I and C-to-U rBEs, allowing us to identify rBEs that expand the characterization of binding patterns for both sequence-specific and broad-binding RBPs.
View Article and Find Full Text PDFTechnology for crosslinking and immunoprecipitation (CLIP) followed by sequencing (CLIP-seq) has identified the transcriptomic targets of hundreds of RNA-binding proteins in cells. To increase the power of existing and future CLIP-seq datasets, we introduce Skipper, an end-to-end workflow that converts unprocessed reads into annotated binding sites using an improved statistical framework. Compared with existing methods, Skipper on average calls 210%-320% more transcriptomic binding sites and sometimes >1,000% more sites, providing deeper insight into post-transcriptional gene regulation.
View Article and Find Full Text PDFThe RNA-guided nuclease Cas9 has unlocked powerful methods for perturbing both the genome through targeted DNA cleavage and the regulome through targeted DNA binding, but limited biochemical data have hampered efforts to quantitatively model sequence perturbation of target binding and cleavage across diverse guide sequences. We present scalable, sequencing-based platforms for high-throughput filter binding and cleavage and then perform 62,444 quantitative binding and cleavage assays on 35,047 on- and off-target DNA sequences across 90 Cas9 ribonucleoproteins (RNPs) loaded with distinct guide RNAs. We observe that binding and cleavage efficacy, as well as specificity, vary substantially across RNPs; canonically studied guides often have atypically high specificity; sequence context surrounding the target modulates Cas9 on-rate; and Cas9 RNPs may sequester targets in nonproductive states that contribute to "proofreading" capability.
View Article and Find Full Text PDFUnlabelled: Several recent papers have reported strong signals of selection on European polygenic height scores. These analyses used height effect estimates from the GIANT consortium and replication studies. Here, we describe a new analysis based on the the UK Biobank (UKB), a large, independent dataset.
View Article and Find Full Text PDFPowerful new technologies for perturbing genetic elements have recently expanded the study of genetic interactions in model systems ranging from yeast to human cell lines. However, technical artifacts can confound signal across genetic screens and limit the immense potential of parallel screening approaches. To address this problem, we devised a novel PCA-based method for correcting genome-wide screening data, bolstering the sensitivity and specificity of detection for genetic interactions.
View Article and Find Full Text PDFBackground: Cell-free DNA (cfDNA) diagnostics are emerging as a new paradigm of disease monitoring and therapy management. The clinical utility of these diagnostics is relatively limited by a low signal-to-noise ratio, such as with low allele frequency (AF) mutations in cancer. While enriching for rare alleles to increase their AF before sample analysis is one strategy that can greatly improve detection capability, current methods are limited in their generalizability, ease of use, and applicability to point mutations.
View Article and Find Full Text PDFThe accumulation of amyloid beta (Aβ) peptide (Amyloid cascade hypothesis), an APP protein cleavage product, is a leading hypothesis in the etiology of Alzheimer's disease (AD). In order to identify additional AD risk genes, we performed targeted sequencing and rare variant burden association study for nine candidate genes involved in the amyloid metabolism in 1886 AD cases and 1700 controls. We identified a significant variant burden association for the gene encoding caspase-8, CASP8 (p = 8.
View Article and Find Full Text PDFThe challenge of linking intergenic mutations to target genes has limited molecular understanding of human diseases. Here we show that H3K27ac HiChIP generates high-resolution contact maps of active enhancers and target genes in rare primary human T cell subtypes and coronary artery smooth muscle cells. Differentiation of naive T cells into T helper 17 cells or regulatory T cells creates subtype-specific enhancer-promoter interactions, specifically at regions of shared DNA accessibility.
View Article and Find Full Text PDFA central goal of genetics is to understand the links between genetic variation and disease. Intuitively, one might expect disease-causing variants to cluster into key pathways that drive disease etiology. But for complex traits, association signals tend to be spread across most of the genome-including near many genes without an obvious connection to disease.
View Article and Find Full Text PDFThe bacterial adaptive immune system CRISPR-Cas9 has been appropriated as a versatile tool for editing genomes, controlling gene expression, and visualizing genetic loci. To analyze Cas9's ability to bind DNA rapidly and specifically, we generated multiple libraries of potential binding partners for measuring the kinetics of nuclease-dead Cas9 (dCas9) interactions. Using a massively parallel method to quantify protein-DNA interactions on a high-throughput sequencing flow cell, we comprehensively assess the effects of combinatorial mismatches between guide RNA (gRNA) and target nucleotides, both in the seed and in more distal nucleotides, plus disruption of the protospacer adjacent motif (PAM).
View Article and Find Full Text PDFCRISPR-Cas9 screens are powerful tools for high-throughput interrogation of genome function, but can be confounded by nuclease-induced toxicity at both on- and off-target sites, likely due to DNA damage. Here, to test potential solutions to this issue, we design and analyse a CRISPR-Cas9 library with 10 variable-length guides per gene and thousands of negative controls targeting non-functional, non-genic regions (termed safe-targeting guides), in addition to non-targeting controls. We find this library has excellent performance in identifying genes affecting growth and sensitivity to the ricin toxin.
View Article and Find Full Text PDFMosaicism is increasingly recognized as a cause of developmental disorders with the advent of next-generation sequencing (NGS). Mosaic mutations of have been associated with the widest spectrum of phenotypes associated with overgrowth and vascular malformations. We performed targeted NGS using 2 independent deep-coverage methods that utilize molecular inversion probes and amplicon sequencing in a cohort of 241 samples from 181 individuals with brain and/or body overgrowth.
View Article and Find Full Text PDFImportance: Focal cortical dysplasia (FCD), hemimegalencephaly, and megalencephaly constitute a spectrum of malformations of cortical development with shared neuropathologic features. These disorders are associated with significant childhood morbidity and mortality.
Objective: To identify the underlying molecular cause of FCD, hemimegalencephaly, and diffuse megalencephaly.
Background: Bilateral perisylvian polymicrogyria (BPP), the most common form of regional polymicrogyria, causes the congenital bilateral perisylvian syndrome, featuring oromotor dysfunction, cognitive impairment, and epilepsy. The causes of BPP are heterogeneous, but only a few genetic causes have been reported. The aim of this study was to identify additional genetic causes of BPP and characterise their frequency in this population.
View Article and Find Full Text PDFDefects in primary cilium biogenesis underlie the ciliopathies, a growing group of genetic disorders. We describe a whole-genome siRNA-based reverse genetics screen for defects in biogenesis and/or maintenance of the primary cilium, obtaining a global resource. We identify 112 candidate ciliogenesis and ciliopathy genes, including 44 components of the ubiquitin-proteasome system, 12 G-protein-coupled receptors, and 3 pre-mRNA processing factors (PRPF6, PRPF8 and PRPF31) mutated in autosomal dominant retinitis pigmentosa.
View Article and Find Full Text PDFJoubert syndrome (JS) is a recessive neurodevelopmental disorder characterized by a distinctive mid-hindbrain malformation. JS is part of a group of disorders called ciliopathies based on their overlapping phenotypes and common underlying pathophysiology linked to primary cilium dysfunction. Biallelic mutations in one of 28 genes, all encoding proteins localizing to the primary cilium or basal body, can cause JS.
View Article and Find Full Text PDFThe genetic cause underlying the development of multiple colonic adenomas, the premalignant precursors of colorectal cancer (CRC), frequently remains unresolved in patients with adenomatous polyposis. Here we applied whole-exome sequencing to 51 individuals with multiple colonic adenomas from 48 families. In seven affected individuals from three unrelated families, we identified a homozygous germline nonsense mutation in the base-excision repair (BER) gene NTHL1.
View Article and Find Full Text PDFFreeman-Sheldon syndrome, or distal arthrogryposis type 2A (DA2A), is an autosomal-dominant condition caused by mutations in MYH3 and characterized by multiple congenital contractures of the face and limbs and normal cognitive development. We identified a subset of five individuals who had been putatively diagnosed with "DA2A with severe neurological abnormalities" and for whom congenital contractures of the limbs and face, hypotonia, and global developmental delay had resulted in early death in three cases; this is a unique condition that we now refer to as CLIFAHDD syndrome. Exome sequencing identified missense mutations in the sodium leak channel, non-selective (NALCN) in four families affected by CLIFAHDD syndrome.
View Article and Find Full Text PDF