Numerous high-specificity Cas9 variants have been engineered for precision genome editing. These variants typically harbor multiple mutations designed to alter the Cas9-single guide RNA (sgRNA)-DNA complex interactions for reduced off-target cleavage. By dissecting the contributions of individual mutations, we attempt to derive principles for designing high-specificity Cas9 variants.
View Article and Find Full Text PDFCag type IV secretion system (CagT4SS) translocates oncoprotein cytotoxin-associated gene A (CagA) into host cells and plays a key role in the pathogenesis of Helicobacter pylori. The structure of the outer membrane core complex (OMCC) in CagT4SS consists of CagX, CagY, CagM, CagT, and Cag3 in a stoichiometric ratio of 1:1:2:2:5 with 14-fold symmetry. However, the assembly pathway of OMCC remains elusive.
View Article and Find Full Text PDFA strategy to obtain the greatest number of best-performing variants with least amount of experimental effort over the vast combinatorial mutational landscape would have enormous utility in boosting resource producibility for protein engineering. Toward this goal, we present a simple and effective machine learning-based strategy that outperforms other state-of-the-art methods. Our strategy integrates zero-shot prediction and multi-round sampling to direct active learning via experimenting with only a few predicted top variants.
View Article and Find Full Text PDFMapping mutations and discovering cellular determinants that cause the spike protein of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) to induce infected cells to form syncytia would facilitate the development of strategies for blocking the formation of such cell-cell fusion. Here we describe high-throughput screening methods based on droplet microfluidics and the size-exclusion selection of syncytia, coupled with large-scale mutagenesis and genome-wide knockout screening via clustered regularly interspaced short palindromic repeats (CRISPR), for the large-scale identification of determinants of cell-cell fusion. We used the methods to perform deep mutational scans in spike-presenting cells to pinpoint mutable syncytium-enhancing substitutions in two regions of the spike protein (the fusion peptide proximal region and the furin-cleavage site).
View Article and Find Full Text PDFSelecting the most suitable existing base editors and engineering new variants for installing specific base conversions with maximal efficiency and minimal undesired edits are pivotal for precise genome editing applications. Here, we present a platform for creating and analyzing a library of engineered base editor variants to enable head-to-head evaluation of their editing performance at scale. Our comprehensive comparison provides quantitative measures on each variant's editing efficiency, purity, motif preference, and bias in generating single and multiple base conversions, while uncovering undesired higher indel generation rate and noncanonical base conversion for some of the existing base editors.
View Article and Find Full Text PDFThe genome-editing Cas9 protein uses multiple amino-acid residues to bind the target DNA. Considering only the residues in proximity to the target DNA as potential sites to optimise Cas9's activity, the number of combinatorial variants to screen through is too massive for a wet-lab experiment. Here we generate and cross-validate ten in silico and experimental datasets of multi-domain combinatorial mutagenesis libraries for Cas9 engineering, and demonstrate that a machine learning-coupled engineering approach reduces the experimental screening burden by as high as 95% while enriching top-performing variants by ∼7.
View Article and Find Full Text PDFThe Cas9 nuclease from Staphylococcus aureus (SaCas9) holds great potential for use in gene therapy, and variants with increased fidelity have been engineered. However, we find that existing variants have not reached the greatest accuracy to discriminate base mismatches and exhibited much reduced activity when their mutations were grafted onto the KKH mutant of SaCas9 for editing an expanded set of DNA targets. We performed structure-guided combinatorial mutagenesis to re-engineer KKH-SaCas9 with enhanced accuracy.
View Article and Find Full Text PDFAdv Genet (Hoboken)
December 2021
Protein design plays an important role in recent medical advances from antibody therapy to vaccine design. Typically, exhaustive mutational screens or directed evolution experiments are used for the identification of the best design or for improvements to the wild-type variant. Even with a high-throughput screening on pooled libraries and Next-Generation Sequencing to boost the scale of read-outs, surveying all the variants with combinatorial mutations for their empirical fitness scores is still of magnitudes beyond the capacity of existing experimental settings.
View Article and Find Full Text PDFSystematic testing of existing drugs and their combinations is an attractive strategy to exploit approved drugs for repurposing and identifying the best actionable treatment options. To expedite the search among many possible drug combinations, we designed a combinatorial CRISPR-Cas9 screen to inhibit druggable targets. Coblockade of the N-methyl-d-aspartate receptor (NMDAR) with targets of first-line kinase inhibitors reduced hepatocellular carcinoma (HCC) cell growth.
View Article and Find Full Text PDFAn amendment to this paper has been published and can be accessed via a link at the top of the paper.
View Article and Find Full Text PDFThe combined effect of multiple mutations on protein function is hard to predict; thus, the ability to functionally assess a vast number of protein sequence variants would be practically useful for protein engineering. Here we present a high-throughput platform that enables scalable assembly and parallel characterization of barcoded protein variants with combinatorial modifications. We demonstrate this platform, which we name CombiSEAL, by systematically characterizing a library of 948 combination mutants of the widely used Streptococcus pyogenes Cas9 (SpCas9) nuclease to optimize its genome-editing activity in human cells.
View Article and Find Full Text PDFBackground: Recombination is widespread across the tree of life, because it helps purge deleterious mutations and creates novel adaptive traits. In prokaryotes, it often takes the form of horizontal gene transfer from a donor to a recipient bacterium. While such transfer is widespread in natural communities, its immediate fitness benefits are usually unknown.
View Article and Find Full Text PDFGene clusters for the synthesis of secondary metabolites are a common feature of microbial genomes. Well-known examples include clusters for the synthesis of antibiotics in actinomycetes, and also for the synthesis of antibiotics and toxins in filamentous fungi. Until recently it was thought that genes for plant metabolic pathways were not clustered, and this is certainly true in many cases; however, five plant secondary metabolic gene clusters have now been discovered, all of them implicated in synthesis of defence compounds.
View Article and Find Full Text PDF