Recent years have seen tremendous success in the design of novel drug molecules through deep generative models. Nevertheless, existing methods only generate drug-like molecules, which require additional structural optimization to be developed into actual drugs. In this study, a deep learning method for generating target-specific ligands was proposed. This method is useful when the dataset for target-specific ligands is limited. Deep learning methods can extract and learn features (representations) in a data-driven way with little or no human participation. Generative pretraining (GPT) was used to extract the contextual features of the molecule. Three different protein-encoding methods were used to extract the physicochemical properties and amino acid information of the target protein. Protein-encoding and molecular sequence information are combined to guide molecule generation. Transfer learning was used to fine-tune the pretrained model to generate molecules with better binding ability to the target protein. The model was validated using three different targets. The docking results show that our model is capable of generating new molecules with higher docking scores for the target proteins.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9865828PMC
http://dx.doi.org/10.3390/ijms24021146DOI Listing

Publication Analysis

Top Keywords

transfer learning
8
deep learning
8
target-specific ligands
8
methods extract
8
target protein
8
petrans novo
4
novo drug
4
drug design
4
design protein-specific
4
protein-specific encoding
4

Similar Publications

Identification of circadian rhythm-related biomarkers and development of diagnostic models for Crohn's disease using machine learning algorithms.

Comput Methods Biomech Biomed Engin

January 2025

Department of Gastroenterolgy, The Second Affiliated Hospital of Chengdu Medical College, China National Nuclear Corporation 416 Hospital, Chengdu, China.

The global rise in Crohn's Disease (CD) incidence has intensified diagnostic challenges. This study identified circadian rhythm-related biomarkers for CD using datasets from the GEO database. Differentially expressed genes underwent Weighted Gene Co-Expression Network Analysis, with 49 hub genes intersected from GeneCards data.

View Article and Find Full Text PDF

Context.—: Generative artificial intelligence (AI) has emerged as a transformative force in various fields, including anatomic pathology, where it offers the potential to significantly enhance diagnostic accuracy, workflow efficiency, and research capabilities.

Objective.

View Article and Find Full Text PDF

In this research, a green approach utilizing deep eutectic solvent liquid-liquid microextraction is combined with smartphone digital image colorimetry for the determination of boron in nut samples. A smartphone camera was used to capture the image of the analyte extract located in a custom-made colorimetric box. Using ImageJ software, the images were split into RGB channels, with the green channel identified as the optimum.

View Article and Find Full Text PDF

Assessing water quality restoration measures in Lake Pampulha (Brazil) through remote sensing imagery.

Environ Sci Pollut Res Int

January 2025

LEESU, Ecole des Ponts Paris Tech, UPEC, AgroParisTech, F-77455 Marne-la-Vallée, Paris, France.

Urban reservoirs are frequently exposed to impacts from high population density, polluting activities, and the absence of environmental control measures and monitoring. In this study, we investigated the use of satellite imagery to assess restoration measures and support decision-making in a hypereutrophic urban reservoir. Since 2016, Lake Pampulha (Brazil) has undergone restoration measures, including the application of Phoslock®, to mitigate its poor water quality conditions.

View Article and Find Full Text PDF

Objective: Despite the identification of various prognostic factors for anaplastic thyroid carcinoma (ATC) patients over the years, a precise prognostic tool for these patients is still lacking. This study aimed to develop and validate a prognostic model for predicting survival outcomes for ATC patients using random survival forests (RSF), a machine learning algorithm.

Methods: A total of 1222 ATC patients were extracted from the Surveillance, Epidemiology, and End Results (SEER) database and randomly divided into a training set of 855 patients and a validation set of 367 patients.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!