Publications by authors named "Jike Wang"

Large language models (LLMs) have shown remarkable advancements in chemistry and biomedical research, acting as versatile foundation models for various tasks. We introduce AMP-Designer, an LLM-based approach, for swiftly designing antimicrobial peptides (AMPs) with desired properties. Within 11 days, AMP-Designer achieved the de novo design of 18 AMPs with broad-spectrum activity against Gram-negative bacteria.

View Article and Find Full Text PDF

Accurate estimation of protein-ligand (PL) binding free energies is a crucial task in medicinal chemistry and a critical measure of PL interaction modeling effectiveness. However, traditional computational methods are often computationally expensive and prone to errors. Recently, deep learning (DL)-based approaches for predicting PL interactions have gained enormous attention, but their accuracy and generalizability are hindered by data scarcity.

View Article and Find Full Text PDF

The generation of three-dimensional (3D) molecules based on target structures represents a cutting-edge challenge in drug discovery. Many existing approaches often produce molecules with invalid configurations, unphysical conformations, suboptimal drug-like qualities, limited synthesizability, and require extensive generation times. To address these challenges, we present 3DSMILES-GPT, a fully language-model-driven framework for 3D molecular generation that utilizes tokens exclusively.

View Article and Find Full Text PDF

Despite the significant potential of generative models, low synthesizability of many generated molecules limits their real-world applications. In response to this issue, we develop ClickGen, a deep learning model that utilizes modular reactions like click chemistry to assemble molecules and incorporates reinforcement learning along with inpainting technique to ensure that the proposed molecules display high diversity, novelty and strong binding tendency. ClickGen demonstrates superior performance over the other reaction-based generative models in terms of novelty, synthesizability, and docking conformation similarity for existing binders targeting the three proteins.

View Article and Find Full Text PDF
Article Synopsis
  • A TOPAS-based optimization program has been created to accurately focus very-high-energy electron (VHEE) beams on deep-seated targets by adjusting magnetic gradients and the arrangement of quadrupole magnets.
  • The program uses three quadrupole magnets to achieve a maximum dose position deeper than 17 cm while keeping entrance and exit doses within 25% and limiting lateral size to 1 cm at the highest dose location.
  • It also combines two VHEE beams to enhance dose distribution across a target depth of 12-17 cm, achieving high dose flatness and may be useful for advanced treatment plans in radiation therapy.
View Article and Find Full Text PDF

Molecular generation stands at the forefront of AI-driven technologies, playing a crucial role in accelerating the development of small molecule drugs. The intricate nature of practical drug discovery necessitates the development of a versatile molecular generation framework that can tackle diverse drug design challenges. However, existing methodologies often struggle to encompass all aspects of small molecule drug design, particularly those rooted in language models, especially in tasks like linker design, due to the autoregressive nature of large language model-based approaches.

View Article and Find Full Text PDF

Annotating active sites in enzymes is crucial for advancing multiple fields including drug discovery, disease research, enzyme engineering, and synthetic biology. Despite the development of numerous automated annotation algorithms, a significant trade-off between speed and accuracy limits their large-scale practical applications. We introduce EasIFA, an enzyme active site annotation algorithm that fuses latent enzyme representations from the Protein Language Model and 3D structural encoder, and then aligns protein-level information with the knowledge of enzymatic reactions using a multi-modal cross-attention framework.

View Article and Find Full Text PDF

Constructing high-entropy alloys (HEAs) with core-shell (CS) nanostructure is efficient for enhancing catalytic activity. However, it is extremely challenging to incorporate the CS structure with HEAs. Herein, PtCoNiMoRh@Rh CS nanoparticles (PtCoNiMoRh@Rh) with ∼5.

View Article and Find Full Text PDF

The Circular Electron-Positron Collider (CEPC) in China can also work as an excellent powerful synchrotron light source, which can generate high-quality synchrotron radiation. This synchrotron radiation has potential advantages in the medical field as it has a broad spectrum, with energies ranging from visible light to X-rays used in conventional radiotherapy, up to several megaelectronvolts. FLASH radiotherapy is one of the most advanced radiotherapy modalities.

View Article and Find Full Text PDF

Protein loop modeling is a challenging yet highly nontrivial task in protein structure prediction. Despite recent progress, existing methods including knowledge-based, ab initio, hybrid, and deep learning (DL) methods fall substantially short of either atomic accuracy or computational efficiency. To overcome these limitations, we present KarmaLoop, a novel paradigm that distinguishes itself as the first DL method centered on full-atom (encompassing both backbone and side-chain heavy atoms) protein loop modeling.

View Article and Find Full Text PDF

The optimization of therapeutic antibodies through traditional techniques, such as candidate screening via hybridoma or phage display, is resource-intensive and time-consuming. In recent years, computational and artificial intelligence-based methods have been actively developed to accelerate and improve the development of therapeutic antibodies. In this study, we developed an end-to-end sequence-based deep learning model, termed AttABseq, for the predictions of the antigen-antibody binding affinity changes connected with antibody mutations.

View Article and Find Full Text PDF

Artificial intelligence (AI)-aided drug design has demonstrated unprecedented effects on modern drug discovery, but there is still an urgent need for user-friendly interfaces that bridge the gap between these sophisticated tools and scientists, particularly those who are less computer savvy. Herein, we present DrugFlow, an AI-driven one-stop platform that offers a clean, convenient, and cloud-based interface to streamline early drug discovery workflows. By seamlessly integrating a range of innovative AI algorithms, covering molecular docking, quantitative structure-activity relationship modeling, molecular generation, ADMET (absorption, distribution, metabolism, excretion and toxicity) prediction, and virtual screening, DrugFlow can offer effective AI solutions for almost all crucial stages in early drug discovery, including hit identification and hit/lead optimization.

View Article and Find Full Text PDF

Organic frameworks face a trade-off between the framework stability and the bond dynamics, which necessitates the development of innovative linkages that can generate stable frameworks without hindering efficient synthesis. Although iodine(I)-based halogen-bonded organic frameworks (XOFs) have been developed, constructing XOFs based on bromine(I) is desirable yet challenging due to the high sensitivity of bromine(I) species. In this work, we present the inaugural construction of stable bromine(I)-bridged two-dimensional (2D) halogen-bonded organic frameworks, XOF(Br)-TPy-BF/OTf, based on sensitive [N⋅⋅⋅Br⋅⋅⋅N] halogen bonds.

View Article and Find Full Text PDF

Ribonucleic acid (RNA)-ligand interactions play a pivotal role in a wide spectrum of biological processes, ranging from protein biosynthesis to cellular reproduction. This recognition has prompted the broader acceptance of RNA as a viable candidate for drug targets. Delving into the atomic-scale understanding of RNA-ligand interactions holds paramount importance in unraveling intricate molecular mechanisms and further contributing to RNA-based drug discovery.

View Article and Find Full Text PDF

Deep learning-based molecular design has recently gained significant attention. While numerous DL-based generative models have been successfully developed for designing novel compounds, the majority of the generated molecules lack sufficiently novel scaffolds or high drug-like profiles. The aforementioned issues may not be fully captured by commonly used metrics for the assessment of molecular generative models, such as novelty, diversity, and quantitative estimation of the drug-likeness score.

View Article and Find Full Text PDF

Immunotherapy is a promising mainstream approach in anti-tumor therapy. It boasts advantages such as durable responses and lower side effects. However, there are still some limitations to be addressed.

View Article and Find Full Text PDF
Article Synopsis
  • Protein loops are vital for protein dynamics and various biological processes, yet there's a lack of comprehensive evaluation on loop modeling methods.
  • Researchers created two datasets to assess the accuracy and efficiency of 13 loop modeling approaches based on factors like loop length and protein types.
  • The knowledge-based method FREAD generally performed best, while Rosetta NGK excelled with short loops; AlphaFold2 and RoseTTAFold showed promise for longer loops but require more resources.
View Article and Find Full Text PDF

Effective synthesis planning powered by deep learning (DL) can significantly accelerate the discovery of new drugs and materials. However, most DL-assisted synthesis planning methods offer either none or very limited capability to recommend suitable reaction conditions (RCs) for their reaction predictions. Currently, the prediction of RCs with a DL framework is hindered by several factors, including: (a) lack of a standardized dataset for benchmarking, (b) lack of a general prediction model with powerful representation, and (c) lack of interpretability.

View Article and Find Full Text PDF

Target identification and bioactivity prediction are critical steps in the drug discovery process. Here we introduce CODD-Pred (COmprehensive Drug Design Predictor), an online web server with well-curated data sets from the GOSTAR database, which is designed with a dual purpose of predicting potential protein drug targets and computing bioactivity values of small molecules. We first designed a double molecular graph perception (DMGP) framework for target prediction based on a large library of 646 498 small molecules interacting with 640 human targets.

View Article and Find Full Text PDF

Nucleic acid (NA)-ligand interactions are of paramount importance in a variety of biological processes, including cellular reproduction and protein biosynthesis, and therefore, NAs have been broadly recognized as potential drug targets. Understanding NA-ligand interactions at the atomic scale is essential for investigating the molecular mechanism and further assisting in NA-targeted drug discovery. Molecular docking is one of the predominant computational approaches for predicting the interactions between NAs and small molecules.

View Article and Find Full Text PDF

Recently, deep generative models have been regarded as promising tools in fragment-based drug design (FBDD). Despite the growing interest in these models, they still face challenges in generating molecules with desired properties in low data regimes. In this study, we propose a novel flow-based autoregressive model named FFLOM for linker and R-group design.

View Article and Find Full Text PDF

Machine-learning-based scoring functions (MLSFs) have gained attention for their potential to improve accuracy in binding affinity prediction and structure-based virtual screening (SBVS) compared to classical SFs. Developing accurate MLSFs for SBVS requires a large and unbiased dataset that includes structurally diverse actives and decoys. Unfortunately, most datasets suffer from hidden biases and data insufficiency.

View Article and Find Full Text PDF

In the past few years, a number of machine learning (ML)-based molecular generative models have been proposed for generating molecules with desirable properties, but they all require a large amount of label data of pharmacological and physicochemical properties. However, experimental determination of these labels, especially bioactivity labels, is very expensive. In this study, we analyze the dependence of various multi-property molecule generation models on biological activity label data and propose Frag-G/M, a fragment-based multi-constraint molecular generation framework based on conditional transformer, recurrent neural networks (RNNs), and reinforcement learning (RL).

View Article and Find Full Text PDF

Graph neural networks (GNNs) have been widely used in molecular property prediction, but explaining their black-box predictions is still a challenge. Most existing explanation methods for GNNs in chemistry focus on attributing model predictions to individual nodes, edges or fragments that are not necessarily derived from a chemically meaningful segmentation of molecules. To address this challenge, we propose a method named substructure mask explanation (SME).

View Article and Find Full Text PDF

With the introduction of carbon divacancy, trivacancy, and tetravacancy defects near the Co-N site, we have explored the 2e ORR activity at the Co-N site from the perspective of spatial structure and the atomic orbital by DFT calculations. We demonstrate the hybridization strength between Co 3dyz (3dxz) and O 2py (2px) orbitals is the origin of 2e ORR activity at the Co-N site and the hybridization strength relates to the height of the Co 3d projected orbital in the Z direction. The bond length (L, L), the charge transfer from the Co site to the *OOH adsorbate (ΔQ), the d-band center of the Co site (ε), and the ICOHP value between Co 3d and O 2p orbitals as descriptors can well predict the 2e ORR activity at the Co-N site.

View Article and Find Full Text PDF

A PHP Error was encountered

Severity: Warning

Message: fopen(/var/lib/php/sessions/ci_session1mrht0p7abk9bi644h7c7skfee87mmhq): Failed to open stream: No space left on device

Filename: drivers/Session_files_driver.php

Line Number: 177

Backtrace:

File: /var/www/html/index.php
Line: 316
Function: require_once

A PHP Error was encountered

Severity: Warning

Message: session_start(): Failed to read session data: user (path: /var/lib/php/sessions)

Filename: Session/Session.php

Line Number: 137

Backtrace:

File: /var/www/html/index.php
Line: 316
Function: require_once