Publications by Hoi Yee Chu

Publications by authors named "Hoi Yee Chu"

Page 1 of 1

Quantifying Protein-Nucleic Acid Interactions for Engineering Useful CRISPR-Cas9 Genome-Editing Variants.

Hoi Yee Chu Jiaxing Peng Yuanbiao Mou Alan S L Wong

Methods Mol Biol

November 2024

Numerous high-specificity Cas9 variants have been engineered for precision genome editing. These variants typically harbor multiple mutations designed to alter the Cas9-single guide RNA (sgRNA)-DNA complex interactions for reduced off-target cleavage. By dissecting the contributions of individual mutations, we attempt to derive principles for designing high-specificity Cas9 variants.

View Article and Find Full Text PDF

Structural insights into the assembly pathway of the Helicobacter pylori CagT4SS outer membrane core complex.

Chin Yu Mok Hoi Yee Chu Wendy Wai Ling Lam Shannon Wing Ngor Au

Structure

October 2024

Cag type IV secretion system (CagT4SS) translocates oncoprotein cytotoxin-associated gene A (CagA) into host cells and plays a key role in the pathogenesis of Helicobacter pylori. The structure of the outer membrane core complex (OMCC) in CagT4SS consists of CagX, CagY, CagM, CagT, and Cag3 in a stoichiometric ratio of 1:1:2:2:5 with 14-fold symmetry. However, the assembly pathway of OMCC remains elusive.

View Article and Find Full Text PDF

Accurate top protein variant discovery via low-N pick-and-validate machine learning.

Hoi Yee Chu John H C Fong Dawn G L Thean Peng Zhou Frederic K C Fung

Cell Syst

February 2024

A strategy to obtain the greatest number of best-performing variants with least amount of experimental effort over the vast combinatorial mutational landscape would have enormous utility in boosting resource producibility for protein engineering. Toward this goal, we present a simple and effective machine learning-based strategy that outperforms other state-of-the-art methods. Our strategy integrates zero-shot prediction and multi-round sampling to direct active learning via experimenting with only a few predicted top variants.

View Article and Find Full Text PDF

Author Correction: High-throughput screening of genetic and cellular drivers of syncytium formation induced by the spike protein of SARS-CoV-2.

Charles W F Chan Bei Wang Lang Nan Xiner Huang Tianjiao Mao Hoi Yee Chu Hin Chu

Nat Biomed Eng

March 2024

View Article and Find Full Text PDF

High-throughput screening of genetic and cellular drivers of syncytium formation induced by the spike protein of SARS-CoV-2.

Charles W F Chan Bei Wang Lang Nan Xiner Huang Tianjiao Mao Hoi Yee Chu Hin Chu

Nat Biomed Eng

March 2024

Mapping mutations and discovering cellular determinants that cause the spike protein of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) to induce infected cells to form syncytia would facilitate the development of strategies for blocking the formation of such cell-cell fusion. Here we describe high-throughput screening methods based on droplet microfluidics and the size-exclusion selection of syncytia, coupled with large-scale mutagenesis and genome-wide knockout screening via clustered regularly interspaced short palindromic repeats (CRISPR), for the large-scale identification of determinants of cell-cell fusion. We used the methods to perform deep mutational scans in spike-presenting cells to pinpoint mutable syncytium-enhancing substitutions in two regions of the spike protein (the fusion peptide proximal region and the furin-cleavage site).

View Article and Find Full Text PDF

Parallel engineering and activity profiling of a base editor system.

John H C Fong Hoi Yee Chu Peng Zhou Alan S L Wong

Cell Syst

May 2023

Selecting the most suitable existing base editors and engineering new variants for installing specific base conversions with maximal efficiency and minimal undesired edits are pivotal for precise genome editing applications. Here, we present a platform for creating and analyzing a library of engineered base editor variants to enable head-to-head evaluation of their editing performance at scale. Our comprehensive comparison provides quantitative measures on each variant's editing efficiency, purity, motif preference, and bias in generating single and multiple base conversions, while uncovering undesired higher indel generation rate and noncanonical base conversion for some of the existing base editors.

View Article and Find Full Text PDF

Machine learning-coupled combinatorial mutagenesis enables resource-efficient engineering of CRISPR-Cas9 genome editor activities.

Dawn G L Thean Hoi Yee Chu John H C Fong Becky K C Chan Peng Zhou

Nat Commun

April 2022

The genome-editing Cas9 protein uses multiple amino-acid residues to bind the target DNA. Considering only the residues in proximity to the target DNA as potential sites to optimise Cas9's activity, the number of combinatorial variants to screen through is too massive for a wet-lab experiment. Here we generate and cross-validate ten in silico and experimental datasets of multi-domain combinatorial mutagenesis libraries for Cas9 engineering, and demonstrate that a machine learning-coupled engineering approach reduces the experimental screening burden by as high as 95% while enriching top-performing variants by ∼7.

View Article and Find Full Text PDF

High-fidelity KKH variant of Staphylococcus aureus Cas9 nucleases with improved base mismatch discrimination.

Chaya T L Yuen Dawn G L Thean Becky K C Chan Peng Zhou Cynthia C S Kwok Hoi Yee Chu

Nucleic Acids Res

February 2022

The Cas9 nuclease from Staphylococcus aureus (SaCas9) holds great potential for use in gene therapy, and variants with increased fidelity have been engineered. However, we find that existing variants have not reached the greatest accuracy to discriminate base mismatches and exhibited much reduced activity when their mutations were grafted onto the KKH mutant of SaCas9 for editing an expanded set of DNA targets. We performed structure-guided combinatorial mutagenesis to re-engineer KKH-SaCas9 with enhanced accuracy.

View Article and Find Full Text PDF

Facilitating Machine Learning-Guided Protein Engineering with Smart Library Design and Massively Parallel Assays.

Hoi Yee Chu Alan S L Wong

Adv Genet (Hoboken)

December 2021

Protein design plays an important role in recent medical advances from antibody therapy to vaccine design. Typically, exhaustive mutational screens or directed evolution experiments are used for the identification of the best design or for improvements to the wild-type variant. Even with a high-throughput screening on pooled libraries and Next-Generation Sequencing to boost the scale of read-outs, surveying all the variants with combinatorial mutations for their empirical fitness scores is still of magnitudes beyond the capacity of existing experimental settings.

View Article and Find Full Text PDF

A Combinatorial CRISPR-Cas9 Screen Identifies Ifenprodil as an Adjunct to Sorafenib for Liver Cancer Treatment.

Feng Xu Man Tong Cindy S W Tong Becky K C Chan Hoi Yee Chu

Cancer Res

December 2021

Systematic testing of existing drugs and their combinations is an attractive strategy to exploit approved drugs for repurposing and identifying the best actionable treatment options. To expedite the search among many possible drug combinations, we designed a combinatorial CRISPR-Cas9 screen to inhibit druggable targets. Coblockade of the N-methyl-d-aspartate receptor (NMDAR) with targets of first-line kinase inhibitors reduced hepatocellular carcinoma (HCC) cell growth.

View Article and Find Full Text PDF

Publisher Correction: Combinatorial mutagenesis en masse optimizes the genome editing activities of SpCas9.

Gigi C G Choi Peng Zhou Chaya T L Yuen Becky K C Chan Feng Xu Hoi Yee Chu

Nat Methods

August 2019

An amendment to this paper has been published and can be accessed via a link at the top of the paper.

View Article and Find Full Text PDF

Combinatorial mutagenesis en masse optimizes the genome editing activities of SpCas9.

Gigi C G Choi Peng Zhou Chaya T L Yuen Becky K C Chan Feng Xu Hoi Yee Chu

Nat Methods

August 2019

The combined effect of multiple mutations on protein function is hard to predict; thus, the ability to functionally assess a vast number of protein sequence variants would be practically useful for protein engineering. Here we present a high-throughput platform that enables scalable assembly and parallel characterization of barcoded protein variants with combinatorial modifications. We demonstrate this platform, which we name CombiSEAL, by systematically characterizing a library of 948 combination mutants of the widely used Streptococcus pyogenes Cas9 (SpCas9) nuclease to optimize its genome-editing activity in human cells.

View Article and Find Full Text PDF

Assessing the benefits of horizontal gene transfer by laboratory evolution and genome sequencing.

Hoi Yee Chu Kathleen Sprouffske Andreas Wagner

BMC Evol Biol

April 2018

Background: Recombination is widespread across the tree of life, because it helps purge deleterious mutations and creates novel adaptive traits. In prokaryotes, it often takes the form of horizontal gene transfer from a donor to a recipient bacterium. While such transfer is widespread in natural communities, its immediate fitness benefits are usually unknown.

View Article and Find Full Text PDF

From hormones to secondary metabolism: the emergence of metabolic gene clusters in plants.

Hoi Yee Chu Eva Wegel Anne Osbourn

Plant J

April 2011

Gene clusters for the synthesis of secondary metabolites are a common feature of microbial genomes. Well-known examples include clusters for the synthesis of antibiotics in actinomycetes, and also for the synthesis of antibiotics and toxins in filamentous fungi. Until recently it was thought that genes for plant metabolic pathways were not clustered, and this is certainly true in many cases; however, five plant secondary metabolic gene clusters have now been discovered, all of them implicated in synthesis of defence compounds.

View Article and Find Full Text PDF