Small molecule generation via disentangled representation learning.

Bioinformatics

Department of Computer Science, Emory University, Atlanta, GA 30322, USA.

Published: June 2022

AI Article Synopsis

  • This text discusses the potential of expanding knowledge in small molecules for advancements in fields like drug discovery and biotechnology through improved molecular design techniques.
  • It highlights the challenges in understanding the complex relationship between chemical structures and biological properties, with a focus on using deep graph generative frameworks to enhance model interpretability in small molecule generation.
  • The authors present their disentangled representation learning framework, which has shown superior performance compared to existing models in generating biologically relevant small molecules, and provide resources for data and code availability.

Article Abstract

Motivation: Expanding our knowledge of small molecules beyond what is known in nature or designed in wet laboratories promises to significantly advance cheminformatics, drug discovery, biotechnology and material science. In silico molecular design remains challenging, primarily due to the complexity of the chemical space and the non-trivial relationship between chemical structures and biological properties. Deep generative models that learn directly from data are intriguing, but they have yet to demonstrate interpretability in the learned representation, so we can learn more about the relationship between the chemical and biological space. In this article, we advance research on disentangled representation learning for small molecule generation. We build on recent work by us and others on deep graph generative frameworks, which capture atomic interactions via a graph-based representation of a small molecule. The methodological novelty is how we leverage the concept of disentanglement in the graph variational autoencoder framework both to generate biologically relevant small molecules and to enhance model interpretability.

Results: Extensive qualitative and quantitative experimental evaluation in comparison with state-of-the-art models demonstrate the superiority of our disentanglement framework. We believe this work is an important step to address key challenges in small molecule generation with deep generative frameworks.

Availability And Implementation: Training and generated data are made available at https://ieee-dataport.org/documents/dataset-disentangled-representation-learning-interpretable-molecule-generation. All code is made available at https://anonymous.4open.science/r/D-MolVAE-2799/.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btac296DOI Listing

Publication Analysis

Top Keywords

small molecule
16
molecule generation
12
disentangled representation
8
representation learning
8
small molecules
8
relationship chemical
8
deep generative
8
small
6
generation disentangled
4
representation
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!