The structures and activities of enzymes are influenced by pH of the environment. Understanding and distinguishing the adaptation mechanisms of enzymes to extreme pH values is of great significance for elucidating the molecular mechanisms and promoting the industrial applications of enzymes. In this study, the ESM-2 protein language model was used to encode the secreted microbial proteins with the optimal performance above pH 9 and below pH 5, which yielded 47 725 high-pH protein sequences and 66 079 low-pH protein sequences, respectively. A deep learning model was constructed to identify protein acid-base tolerance based on amino acid sequences. The model showcased significantly higher accuracy than other methods, with the overall accuracy of 94.8%, precision of 91.8%, and a recall rate of 93.4% on the test set. Furthermore, we built a website (https://enzymepred.biodesign.ac.cn), which enabled users to predict the acid-base tolerance by submitting the protein sequences of enzymes. This study has accelerated the application of enzymes in various fields, including biotechnology, pharmaceuticals, and chemicals. It provides a powerful tool for the rapid screening and optimization of industrial enzymes.

Download full-text PDF

Source
http://dx.doi.org/10.13345/j.cjb.240255DOI Listing

Publication Analysis

Top Keywords

acid-base tolerance
12
protein sequences
12
protein acid-base
8
enzymes study
8
enzymes
6
protein
5
[acidbasepred protein
4
tolerance prediction
4
prediction platform
4
platform based
4

Similar Publications

Enterococcus species, natural inhabitants of the human gut, have become major causes of life-threatening bloodstream infections (BSIs) and the third most frequent cause of hospital-acquired bacteremia. The rise of high-level gentamicin resistance (HLGR) in enterococcal isolates complicates treatment and revives bacteriophage therapy. This study isolated and identified forty E.

View Article and Find Full Text PDF

An N,N,N-type Cu(Ⅱ) complex-catalyzed desaturation method for converting alcohols, ketones, lactones, and lactams to their α,β-unsaturated carbonyl compounds is reported. The dehydrogenation reaction can be conducted with a green terminal oxidant O2 without requiring strong acid/base or stoichiometric oxidants. The Cu(Ⅱ) complex/TEMPO/O2 system uses a non-noble catalyst, and a green terminal oxidant as well as demonstrates high activity and functional group tolerance.

View Article and Find Full Text PDF

Entropy engineering activation of UiO-66 for boosting catalytic transfer hydrogenation.

Nat Commun

January 2025

State Key Laboratory of Inorganic Synthesis and Preparative Chemistry, College of Chemistry, Jilin University, 130012, Changchun, P. R. China.

High-entropy metal-organic frameworks (HE-MOFs) hold promise as versatile materials, yet current rare examples are confined to low-valence elements in the fourth period, constraining their design and optimization for diverse applications. Here, a novel high-entropy, defect-rich and small-sized (32 nm) UiO-66 (ZrHfCeSnTi HE-UiO-66) has been synthesized for the first time, leveraging increased configurational entropy to achieve high tolerance to doping with diverse metal ions. The lattice distortion of HE-UiO-66 induces high exposure of metal nodes to create coordination unsaturated metal sites with a concentration of 322.

View Article and Find Full Text PDF

Superoxide dismutase (SOD) plays important roles in the balance of oxidation and antioxidation in body mostly by scavenging superoxide anion free radicals (O). Previously, we reported a novel Cu/Zn SOD from jellyfish Cyanea capillata, named CcSOD1, which exhibited excellent SOD activity and high stability. TAT peptide is a common type of cell penetrating peptides (CPPs) that efficiently deliver extracellular biomacromolecules into cytoplasm.

View Article and Find Full Text PDF

The structures and activities of enzymes are influenced by pH of the environment. Understanding and distinguishing the adaptation mechanisms of enzymes to extreme pH values is of great significance for elucidating the molecular mechanisms and promoting the industrial applications of enzymes. In this study, the ESM-2 protein language model was used to encode the secreted microbial proteins with the optimal performance above pH 9 and below pH 5, which yielded 47 725 high-pH protein sequences and 66 079 low-pH protein sequences, respectively.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!