Publications by authors named "Fajie Yuan"

High-throughput DNA sequencing technologies decode tremendous amounts of microbial protein-coding gene sequences. However, accurately assigning protein functions to novel gene sequences remain a challenge. To this end, we developed FunGeneTyper, an extensible framework with two new deep learning models (i.

View Article and Find Full Text PDF

Protein language models (PLMs) are machine learning tools trained to predict masked amino acids within protein sequences, offering opportunities to enhance protein function without prior knowledge of their specific roles. Here, we present a protocol for optimizing thymine-DNA-glycosylase (TDG) using PLMs. We describe steps for "zero-shot" enzyme optimization, construction of plasmids, double plasmid transfection, and high-throughput sequencing and data analysis.

View Article and Find Full Text PDF

Current base editors (BEs) use DNA deaminases, including cytidine deaminase in cytidine BE (CBE) or adenine deaminase in adenine BE (ABE), to facilitate transition nucleotide substitutions. Combining CBE or ABE with glycosylase enzymes can induce limited transversion mutations. Nonetheless, a critical demand remains for BEs capable of generating alternative mutation types, such as T>G corrections.

View Article and Find Full Text PDF