This paper (i) explores the internal structure of two quantum mechanics datasets (QM7b, QM9), composed of several thousands of organic molecules and described in terms of electronic properties, and (ii) further explores an inverse design approach to molecular design consisting of using machine learning methods to approximate the atomic composition of molecules, using QM9 data. Understanding the structure and characteristics of this kind of data is important when predicting the atomic composition from physical-chemical properties in inverse molecular designs. Intrinsic dimension analysis, clustering, and outlier detection methods were used in the study. They revealed that for both datasets the intrinsic dimensionality is several times smaller than the descriptive dimensions. The QM7b data is composed of well-defined clusters related to atomic composition. The QM9 data consists of an outer region predominantly composed of outliers, and an inner, core region that concentrates clustered inliner objects. A significant relationship exists between the number of atoms in the molecule and its outlier/inliner nature. The spatial structure exhibits a relationship with molecular weight. Despite the structural differences between the two datasets, the predictability of variables of interest for inverse molecular design is high. This is exemplified by models estimating the number of atoms of the molecule from both the original properties and from lower dimensional embedding spaces. In the generative approach the input is given by a set of desired properties of the molecule and the output is an approximation of the atomic composition in terms of its constituent chemical elements. This could serve as the starting region for further search in the huge space determined by the set of possible chemical compounds. The quantum mechanic's dataset QM9 is used in the study, composed of 133,885 small organic molecules and 19 electronic properties. Different multi-target regression approaches were considered for predicting the atomic composition from the properties, including feature engineering techniques in an auto-machine learning framework. High-quality models were found that predict the atomic composition of the molecules from their electronic properties, as well as from a subset of only 52.6% size. Feature selection worked better than feature generation. The results validate the generative approach to inverse molecular design.

Download full-text PDF

Source
http://dx.doi.org/10.1002/jcc.27295DOI Listing

Publication Analysis

Top Keywords

atomic composition
24
electronic properties
12
molecular design
12
inverse molecular
12
machine learning
8
qm7b qm9
8
quantum mechanics
8
mechanics datasets
8
organic molecules
8
composition molecules
8

Similar Publications

Construction of Mn-Defective S/MnCdS for Promoting Photocatalytic N Reduction.

Inorg Chem

January 2025

Key Laboratory of Green and Precise Synthetic Chemistry and Applications, Ministry of Education, Huaibei Normal University, Huaibei, Anhui 235000, P. R. China.

Improving catalytic performance by controlling the microstructure of materials has become a hot topic in the field of photocatalysis, such as the surface defect site, multistage layered morphology, and exposed crystal surface. Due to the differences in the metal atomic radius (Mn and Cd) and solubility product constant (MnS and CdS), Mn defect easily occurred in the S/MnCdS (S/0.4MCS) composite.

View Article and Find Full Text PDF

In both nature and industry, aerosol droplets contain complex mixtures of solutes, which in many cases include multiple inorganic components. Understanding the drying kinetics of these droplets and the impact on resultant particle morphology is essential for a variety of applications including improving inhalable drugs, mitigating disease transmission, and developing more accurate climate models. However, the previous literature has only focused on the relationship between drying kinetics and particle morphology for aerosol droplets containing a single nonvolatile component.

View Article and Find Full Text PDF

Additives-Modified Electrodeposition for Synthesis of Hydrophobic Cu/CuO with Ag Single Atoms to Drive CO Electroreduction.

Adv Mater

January 2025

State Key Laboratory of Petroleum Molecular & Process Engineering, Shanghai Key Laboratory of Green Chemistry and Chemical Processes, School of Chemistry and Molecular Engineering, East China Normal University, Shanghai, 200062, China.

Copper-based electrocatalysts are recognized as crucial catalysts for CO electroreduction into multi-carbon products. However, achieving copper-based electrocatalysts with adjustable valences via one-step facile synthesis remains a challenge. In this study, Cu/CuO heterostructure is constructed by adjusting the anion species of the Cu ions-containing electrolyte during electrodeposition synthesis.

View Article and Find Full Text PDF

Co-assemblies of Silver Nanoclusters and Fullerenols With Enhanced Third-Order Nonlinear Optical Response.

Small Methods

January 2025

National Engineering Research Center for Colloidal Materials, Key Laboratory of Colloid and Interface Chemistry (Ministry of Education), School of Chemistry and Chemical Engineering, Shandong University, Jinan, 250100, China.

Exploring potential third-order nonlinear optical (NLO) materials attracts ever-increasing attention. Given that the atomically precise and rich adjustable structural features of silver nanoclusters (Ag NCs), as well as the unique π-electron conjugated system of carbon-based nanomaterials, a supramolecular co-assembly amplification strategy to enhance the luminescent intensity and NLO performance of the hybrids of the two components, are constructed and the relationship between structures and optical properties are investigated. By combining water soluble Ag NCs [(NH)[Ag(mna)] (Hmna = 2-mercaptonicotinic acid, abbreviated to Ag─NCs hereafter) containing uncoordinated carboxyl groups with water-soluble fullerene derivatives modified with multiple hydroxyl groups (fullerenols, C─OH), the π-electron delocalization is expanded owing to non-covalent hydrogen bonding effect between Ag6─NCs and C─OH, which provides a feasible basis for realizing the NLO response.

View Article and Find Full Text PDF

Surface Hydrophilic Modification of Polypropylene by Nanosecond Pulsed Ar/O Dielectric Barrier Discharge.

Materials (Basel)

December 2024

College of Electrical Engineering and Control Science, Nanjing Tech University, Nanjing 211816, China.

Polypropylene (PP) membranes have found diverse applications, such as in wastewater treatment, lithium-ion batteries, and pharmaceuticals, due to their low cost, excellent mechanical properties, thermal stability, and chemical resistance. However, the intrinsic hydrophobicity of PP materials leads to membrane fouling and filtration flux reduction, which greatly hinders the applications of PP membranes. Dielectric barrier discharge (DBD) is an effective technique for surface modification of materials because it generates a large area of low-temperature plasma at atmospheric pressure.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!