Structural variation, such as deletions, duplications, inversions and complex rearrangements, can have profound effects on gene expression, genome stability, phenotypic diversity and disease susceptibility. Structural variants can encompass up to millions of bases and have the potential to rearrange substantial segments of the genome. They contribute considerably more to genetic diversity in human populations and have larger effects on phenotypic traits than point mutations.
View Article and Find Full Text PDFDrug resistance is a principal limitation to the long-term efficacy of cancer therapies. Cancer genome sequencing can retrospectively delineate the genetic basis of drug resistance, but this requires large numbers of post-treatment samples to nominate causal variants. Here we prospectively identify genetic mechanisms of resistance to ten oncology drugs from CRISPR base editing mutagenesis screens in four cancer cell lines using a guide RNA library predicted to install 32,476 variants in 11 cancer genes.
View Article and Find Full Text PDFGene misexpression is the aberrant transcription of a gene in a context where it is usually inactive. Despite its known pathological consequences in specific rare diseases, we have a limited understanding of its wider prevalence and mechanisms in humans. To address this, we analyzed gene misexpression in 4,568 whole-blood bulk RNA sequencing samples from INTERVAL study blood donors.
View Article and Find Full Text PDFGenetic screens in cancer cell lines inform gene function and drug discovery. More comprehensive screen datasets with multi-omics data are needed to enhance opportunities to functionally map genetic vulnerabilities. Here, we construct a second-generation map of cancer dependencies by annotating 930 cancer cell lines with multi-omic data and analyze relationships between molecular markers and cancer dependencies derived from CRISPR-Cas9 screens.
View Article and Find Full Text PDFGene duplication is common across the tree of life, including yeast and humans, and contributes to genomic robustness. In this study, we examined changes in the subcellular localization and abundance of proteins in response to the deletion of their paralogs originating from the whole-genome duplication event, which is a largely unexplored mechanism of functional divergence. We performed a systematic single-cell imaging analysis of protein dynamics and screened subcellular redistribution of proteins, capturing their localization and abundance changes, providing insight into forces determining paralog retention.
View Article and Find Full Text PDFAssigning functions to genes and learning how to control their expression are part of the foundation of cell biology and therapeutic development. An efficient and unbiased method to accomplish this is genetic screening, which historically required laborious clone generation and phenotyping and is still limited by scale today. The rapid technological progress on modulating gene function with CRISPR-Cas and measuring it in individual cells has now relaxed the major experimental constraints and enabled pooled screening with complex readouts from single cells.
View Article and Find Full Text PDFMutagenic outcomes of CRISPR/Cas9-generated double-stranded breaks depend on both the sequence flanking the cut and cellular DNA damage repair. The interaction of these features has been largely unexplored, limiting our ability to understand and manipulate the outcomes. Here, we measured how the absence of 18 repair genes changed frequencies of 83,680 unique mutational outcomes generated by Cas9 double-stranded breaks at 2,838 synthetic target sequences in mouse embryonic stem cells.
View Article and Find Full Text PDFThe first fruits of the CRISPR-Cas revolution are starting to enter the clinic, with gene editing therapies offering solutions to previously incurable genetic diseases. The success of such applications hinges on control over the mutations that are generated, which are known to vary depending on the targeted locus. In this review, we present the current state of understanding and predicting CRISPR-Cas cutting, base editing, and prime editing outcomes in mammalian cells.
View Article and Find Full Text PDFPerturbing expression is a powerful way to understand the role of individual genes, but can be challenging in important models. CRISPR-Cas screens in human induced pluripotent stem cells (iPSCs) are of limited efficiency due to DNA break-induced stress, while the less stressful silencing with an inactive Cas9 has been considered less effective so far. Here, we developed the dCas9-KRAB-MeCP2 fusion protein for screening in iPSCs from multiple donors.
View Article and Find Full Text PDFMost short sequences can be precisely written into a selected genomic target using prime editing; however, it remains unclear what factors govern insertion. We design a library of 3,604 sequences of various lengths and measure the frequency of their insertion into four genomic sites in three human cell lines, using different prime editor systems in varying DNA repair contexts. We find that length, nucleotide composition and secondary structure of the insertion sequence all affect insertion rates.
View Article and Find Full Text PDFBrightfield cell microscopy is a foundational tool in life sciences. The acquired images are prone to contain visual artifacts that hinder downstream analysis, and automatically removing them is therefore of great practical interest. Deep convolutional neural networks are state-of-the-art for image segmentation, but require pixel-level annotations, which are time-consuming to produce.
View Article and Find Full Text PDFCRISPR/Cas base editors promise nucleotide-level control over DNA sequences, but the determinants of their activity remain incompletely understood. We measured base editing frequencies in two human cell lines for two cytosine and two adenine base editors at ∼14 000 target sequences and find that base editing activity is sequence-biased, with largest effects from nucleotides flanking the target base. Whether a base is edited depends strongly on the combination of its position in the target and the preceding base, acting to widen or narrow the effective editing window.
View Article and Find Full Text PDFEscherichia coli is an important cause of bacterial infections worldwide, with multidrug-resistant strains incurring substantial costs on human lives. Besides therapeutic concentrations of antimicrobials in health care settings, the presence of subinhibitory antimicrobial residues in the environment and in clinics selects for antimicrobial resistance (AMR), but the underlying genetic repertoire is less well understood. Here, we used machine learning to predict the population doubling time and cell growth yield of 1,407 genetically diverse E.
View Article and Find Full Text PDFAdvances in microscopy have increased output data volumes, and powerful image analysis methods are required to match. In particular, finding and characterizing nuclei from microscopy images, a core cytometry task, remains difficult to automate. While deep learning models have given encouraging results on this problem, the most powerful approaches have not yet been tested for attacking it.
View Article and Find Full Text PDFIdentifying nuclei is a standard first step when analysing cells in microscopy images. The traditional approach relies on signal from a DNA stain, or fluorescent transgene expression localised to the nucleus. However, imaging techniques that do not use fluorescence can also carry useful information.
View Article and Find Full Text PDFThe consequence of a mutation can be influenced by the context in which it operates. For example, loss of gene function may be tolerated in one genetic background, and lethal in another. The extent to which mutant phenotypes are malleable, the architecture of modifiers and the identities of causal genes remain largely unknown.
View Article and Find Full Text PDFCRISPR guide RNA libraries have been iteratively improved to provide increasingly efficient reagents, although their large size is a barrier for many applications. We design an optimised minimal genome-wide human CRISPR-Cas9 library (MinLibCas9) by mining existing large-scale gene loss-of-function datasets, resulting in a greater than 42% reduction in size compared to other CRISPR-Cas9 libraries while preserving assay sensitivity and specificity. MinLibCas9 provides backward compatibility with existing datasets, increases the dynamic range of CRISPR-Cas9 screens and extends their application to complex models and assays.
View Article and Find Full Text PDFis a common bacterial species in the gastrointestinal tracts of warm-blooded animals and humans. Pathogenicity and antimicrobial resistance in may emerge via host switching from animal reservoirs. Despite its potential clinical importance, knowledge of the population structure of commensal within wild hosts and the epidemiological links between in nonhuman hosts and in humans is still scarce.
View Article and Find Full Text PDFThe Klebsiella pneumoniae species complex includes important opportunistic pathogens which have become public health priorities linked to major hospital outbreaks and the recent emergence of multidrug-resistant hypervirulent strains. Bacterial virulence and the spread of multidrug resistance have previously been linked to toxin-antitoxin (TA) systems. TA systems encode a toxin that disrupts essential cellular processes, and a cognate antitoxin which counteracts this activity.
View Article and Find Full Text PDFGenome-scale CRISPR-Cas9 viability screens performed in cancer cell lines provide a systematic approach to identify cancer dependencies and new therapeutic targets. As multiple large-scale screens become available, a formal assessment of the reproducibility of these experiments becomes necessary. We analyze data from recently published pan-cancer CRISPR-Cas9 screens performed at the Broad and Sanger Institutes.
View Article and Find Full Text PDFThe eukaryotic translation initiation factor 2α (eIF2α) kinase GCN2 is activated by amino acid starvation to elicit a rectifying physiological program known as the Integrated Stress Response (ISR). A role for uncharged tRNAs as activating ligands of yeast GCN2 is supported experimentally. However, mouse GCN2 activation has recently been observed in circumstances associated with ribosome stalling with no global increase in uncharged tRNAs.
View Article and Find Full Text PDFGenome-wide CRISPR/Cas9 knockout screens are revolutionizing mammalian functional genomics. However, their range of applications remains limited by signal variability from different guide RNAs that target the same gene, which confounds gene effect estimation and dictates large experiment sizes. To address this problem, we report JACKS, a Bayesian method that jointly analyzes screens performed with the same guide RNA library.
View Article and Find Full Text PDF