GLDM: hit molecule generation with constrained graph latent diffusion model.

Brief Bioinform

School of Computer Science and Engineering, Nanyang Technological University, 50 Nanyang Ave, 639798, Singapore.

Published: March 2024

Discovering hit molecules with desired biological activity in a directed manner is a promising but profound task in computer-aided drug discovery. Inspired by recent generative AI approaches, particularly Diffusion Models (DM), we propose Graph Latent Diffusion Model (GLDM)-a latent DM that preserves both the effectiveness of autoencoders of compressing complex chemical data and the DM's capabilities of generating novel molecules. Specifically, we first develop an autoencoder to encode the molecular data into low-dimensional latent representations and then train the DM on the latent space to generate molecules inducing targeted biological activity defined by gene expression profiles. Manipulating DM in the latent space rather than the input space avoids complicated operations to map molecule decomposition and reconstruction to diffusion processes, and thus improves training efficiency. Experiments show that GLDM not only achieves outstanding performances on molecular generation benchmarks, but also generates samples with optimal chemical properties and potentials to induce desired biological activity.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10998532PMC
http://dx.doi.org/10.1093/bib/bbae142DOI Listing

Publication Analysis

Top Keywords

biological activity
12
graph latent
8
latent diffusion
8
diffusion model
8
desired biological
8
latent space
8
latent
6
gldm hit
4
hit molecule
4
molecule generation
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!