Publications by authors named "Francesca-Zhoufan Li"

Sequence-function data provides valuable information about the protein functional landscape but is rarely obtained during directed evolution campaigns. Here, we present Long-read every variant Sequencing (LevSeq), a pipeline that combines a dual barcoding strategy with nanopore sequencing to rapidly generate sequence-function data for entire protein-coding genes. LevSeq integrates into existing protein engineering workflows and comes with open-source software for data analysis and visualization.

View Article and Find Full Text PDF

Enzymes can be engineered at the level of their amino acid sequences to optimize key properties such as expression, stability, substrate range, and catalytic efficiency-or even to unlock new catalytic activities not found in nature. Because the search space of possible proteins is vast, enzyme engineering usually involves discovering an enzyme starting point that has some level of the desired activity followed by directed evolution to improve its "fitness" for a desired application. Recently, machine learning (ML) has emerged as a powerful tool to complement this empirical process.

View Article and Find Full Text PDF

With advances in machine learning (ML)-assisted protein engineering, models based on data, biophysics, and natural evolution are being used to propose informed libraries of protein variants to explore. Synthesizing these libraries for experimental screens is a major bottleneck, as the cost of obtaining large numbers of exact gene sequences is often prohibitive. Degenerate codon (DC) libraries are a cost-effective alternative for generating combinatorial mutagenesis libraries where mutations are targeted to a handful of amino acid sites.

View Article and Find Full Text PDF
Article Synopsis
  • Antibody responses are crucial for defending against SARS-CoV-2 by stopping the virus from entering cells, and a new assay called 2D-MBBA has been developed to measure various antibody isotypes simultaneously.
  • This assay was used to analyze IgG, IgM, and IgA levels against the spike protein and its variants, and machine learning significantly improved predictions of how well these antibodies neutralize the virus in convalescent patients.
  • The method can differentiate between antibody profiles in convalescent and vaccinated individuals and offers the potential for rapid testing of neutralization efficacy against new variants and pathogens using just a small blood sample.
View Article and Find Full Text PDF

Optimizing microbial hosts for the large-scale production of valuable metabolites often requires multiple mutations and modifications to the host's genome. We describe a three-round screen for increased L-DOPA production in S. cerevisiae using FACS enrichment of an enzyme-coupled biosensor for L-DOPA.

View Article and Find Full Text PDF