Chemical (molecular, quantum) machine learning relies on representing molecules in unique and informative ways. Here, we present the matrix of orthogonalized atomic orbital coefficients (MAOC) as a quantum-inspired molecular and atomic representation containing both structural (composition and geometry) and electronic (charge and spin multiplicity) information. MAOC is based on a cost-effective localization scheme that represents localized orbitals via a predefined set of atomic orbitals. The latter can be constructed from such small atom-centered basis sets as pcseg-0 and STO-3G in conjunction with guess (non-optimized) electronic configuration of the molecule. Importantly, MAOC is suitable for representing monatomic, molecular, and periodic systems and can distinguish compounds with identical compositions and geometries but distinct charges and spin multiplicities. Using principal component analysis, we constructed a more compact but equally powerful version of MAOC-PCX-MAOC. To test the performance of full and reduced MAOC and several other representations (CM, SOAP, SLATM, and SPAHM), we used a kernel ridge regression machine learning model to predict frontier molecular orbital energy levels and ground state single-point energies for chemically diverse neutral and charged, closed- and open-shell molecules from an extended QM7b dataset, as well as two new datasets, N-HPC-1 (N-heteropolycycles) and REDOX (nitroxyl and phenoxyl radicals, carbonyl, and cyano compounds). MAOC affords accuracy that is either similar or superior to other representations for a range of chemical properties and systems.

Download full-text PDF

Source
http://dx.doi.org/10.1063/5.0151122DOI Listing

Publication Analysis

Top Keywords

matrix orthogonalized
8
orthogonalized atomic
8
atomic orbital
8
orbital coefficients
8
machine learning
8
maoc
5
atomic
4
coefficients representation
4
representation radicals
4
radicals ions
4

Similar Publications

Triplet-triplet energy transfer (TEnT) is of particular interest in various photochemical, photobiological, and energy science processes. It involves the exchange of spin and energy of electrons between two molecular fragments. Here, quasi-diabatic self-consistent field solutions were used to obtain the diabatic states involved in TEnT.

View Article and Find Full Text PDF

Exploring Brain Imaging and Genetic Risk Factors in Different Progression States of Alzheimer's Disease Through OSnetNMF-Based Methods.

J Mol Neurosci

January 2025

Bio-Med Big Data Center, CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China.

Alzheimer's disease (AD) is a neurodegenerative disease with no effective treatment, often preceded by mild cognitive impairment (MCI). Multimodal imaging genetics integrates imaging and genetic data to gain a deeper understanding of disease progression and individual variations. This study focuses on exploring the mechanisms that drive the transition from normal cognition to MCI and ultimately to AD.

View Article and Find Full Text PDF

Advanced tissue technologies of blood-brain barrier organoids as high throughput toxicity readouts in drug development.

Heliyon

January 2025

Roche Pharma Research and Early Development (pRED), Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd, Grenzacherstrasse 124, 4070, Basel, Switzerland.

Recent advancements in engineering Complex models (CIVMs) such as Blood-brain barrier (BBB) organoids offer promising platforms for preclinical drug testing. However, their application in drug development, and especially for the regulatory purposes of toxicity assessment, requires robust and reproducible techniques. Here, we developed an adapted set of orthogonal image-based tissue methods including hematoxylin and eosin staining (HE), immunohistochemistry (IHC), multiplex immunofluorescence (mIF), and Matrix Assisted Laser Desorption/Ionization Mass Spectrometry Imaging (MALDI-MSI) to validate CIVMs for drug toxicity assessments.

View Article and Find Full Text PDF

Developing single-particle nanocomposite with aqueous-phase orthogonal multicolor phosphorescence or multimodal luminescence holds great significance for optical coding, anti-counterfeiting encryption, bioimaging, and biosensing. However, it faces challenges such as a limited range of emission wavelengths and difficulties in controlling the synthesis process. In this work, a conjugate structure manipulation integrated luminophor confinement strategy is proposed to prepare carbon dots@upconversion nanoparticles (CDs@UCNPs) featuring aqueous-phase orthogonal multicolor room-temperature phosphorescence-upconversion luminescence (RTP-UCL) through wet-chemical synthetic methods.

View Article and Find Full Text PDF

Pyrethroids are synthetic chemicals that account for 16% of the international insecticide market and have been shown to be of varying toxicity to different species. There are various methods available for detecting pyrethroids in agricultural products, but these products must be pre-treated to remove interference from the food matrix, such as through dispersion liquid-liquid microextraction (DLLME). This study employed two experimental design methods to optimize the continuous and discontinuous experimental parameters of DLLME and investigated whether DLLME combined with GC-NICI-MS is effective for detecting pyrethroids in agricultural products.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!