Exhaustive local chemical space exploration using a transformer model.

Nat Commun

Molecular AI, Discovery Sciences, R&D, AstraZeneca, Gothenburg, Sweden.

Published: August 2024

How many near-neighbors does a molecule have? This fundamental question in chemistry is crucial for molecular optimization problems under the similarity principle assumption. Generative models can sample molecules from a vast chemical space but lack explicit knowledge about molecular similarity. Therefore, these models need guidance from reinforcement learning to sample a relevant similar chemical space. However, they still miss a mechanism to measure the coverage of a specific region of the chemical space. To overcome these limitations, a source-target molecular transformer model, regularized via a similarity kernel function, is proposed. Trained on a largest dataset of ≥200 billion molecular pairs, the model enforces a direct relationship between generating a target molecule and its similarity to a source molecule. Results indicate that the regularization term significantly improves the correlation between generation probability and molecular similarity, enabling exhaustive exploration of molecule near-neighborhoods.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11345417PMC
http://dx.doi.org/10.1038/s41467-024-51672-4DOI Listing

Publication Analysis

Top Keywords

chemical space
16
transformer model
8
molecular similarity
8
molecular
5
similarity
5
exhaustive local
4
chemical
4
local chemical
4
space
4
space exploration
4

Similar Publications

Concomitant formation of protocells and prebiotic compounds under a plausible early Earth atmosphere.

Proc Natl Acad Sci U S A

January 2025

Laboratory of Crystallographic Studies, Instituto Andaluz de Ciencias de la Tierra, Consejo Superior de Investigaciones Científica, Armilla 18100, Spain.

Revealing the origin of life and unambiguously detecting fossil remains of the earliest organisms are closely related aspects of the same scientific research. The synthesis of prebiotic molecular building blocks of life and the first compartmentalization into protocells have been considered two events apart in time, space, or both. We conducted lightning experiments in borosilicate reactors filled with a mixture of gases mimicking plausible geochemical conditions of early Earth.

View Article and Find Full Text PDF

A hybrid meta on-top functional for multiconfiguration pair-density functional theory.

Proc Natl Acad Sci U S A

January 2025

Department of Chemistry, Chemical Theory Center, University of Minnesota, Minneapolis, MN 55455-0431.

Multiconfiguration pair-density functional theory (MC-PDFT) was proposed a decade ago, but it is still in the early stage of density functional development. MC-PDFT uses functionals that are called on-top functionals; they depend on the density and the on-top pair density. Most MC-PDFT calculations to date have been unoptimized translations of generalized gradient approximations (GGAs) of Kohn-Sham density functional theory (KS-DFT).

View Article and Find Full Text PDF

Dual-Anion-Rich Polymer Electrolytes for High-Voltage Solid-State Lithium Metal Batteries.

ACS Nano

January 2025

Department of Physics, JC STEM Lab of Energy and Materials Physics, City University of Hong Kong, Hong Kong 999077, P. R. China.

Solid polymer electrolytes (SPEs) are promising candidates for lithium metal batteries (LMBs) owing to their safety features and compatibility with lithium metal anodes. However, the inferior ionic conductivity and electrochemical stability of SPEs hinder their application in high-voltage solid-state LMBs (HVSSLMBs). Here, a strategy is proposed to develop a dual-anion-rich solvation structure by implementing ferroelectric barium titanate (BTO) nanoparticles (NPs) and dual lithium salts into poly(vinylidene fluoride) (PVDF)-based SPEs for HVSSLMBs.

View Article and Find Full Text PDF

From Monocyclization to Pentacyclization: A Versatile Plant Cyclase Produces Diverse Sesterterpenes with Anti-Liver Fibrosis Potential.

Adv Sci (Weinh)

January 2025

State Key Laboratory of Southwestern Chinese Medicine Resources, and Innovative Institute of Chinese Medicine and Pharmacy, Chengdu University of Traditional Chinese Medicine, Chengdu, 611137, P. R. China.

A prolific multi-product sesterterpene synthase CbTPS1 is characterized from the medicinal Brassicaceae plant Capsella bursa-pastoris. Twenty different sesterterpenes including 16 undescribed compounds, possessing 10 different mono-/di-/tri-/tetra-/penta-carbocyclic skeletons, including the unique 15-membered macrocyclic and 24(15→14)-abeo-capbuane scaffolds, are isolated and structurally elucidated from engineered Escherichia coli strains expressing CbTPS1. Site-directed mutagenesis assisted by molecular dynamics simulations resulted in the variant L354M with up to 13.

View Article and Find Full Text PDF

MoTe Photodetector for Integrated Lithium Niobate Photonics.

Nanomaterials (Basel)

January 2025

State Key Laboratory of High Field Laser Physics and CAS Center for Excellence in Ultra-Intense Laser Science, Shanghai Institute of Optics and Fine Mechanics (SIOM), Chinese Academy of Sciences (CAS), Shanghai 201800, China.

The integration of a photodetector that converts optical signals into electrical signals is essential for scalable integrated lithium niobate photonics. Two-dimensional materials provide a potential high-efficiency on-chip detection capability. Here, we demonstrate an efficient on-chip photodetector based on a few layers of MoTe on a thin film lithium niobate waveguide and integrate it with a microresonator operating in an optical telecommunication band.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!