J Chem Inf Model
February 2025
Accurately predicting activation energies is crucial for understanding chemical reactions and modeling complex reaction systems. However, the high computational cost of quantum chemistry methods often limits the feasibility of large-scale studies, leading to a scarcity of high-quality activation energy data. In this work, we explore and compare three innovative approaches (transfer learning, delta learning, and feature engineering) to enhance the accuracy of activation energy predictions using graph neural networks, specifically focusing on methods that incorporate low-cost, low-level computational data.
View Article and Find Full Text PDFBeilstein J Org Chem
October 2024
This review surveys the recent advances and challenges in predicting and optimizing reaction conditions using machine learning techniques. The paper emphasizes the importance of acquiring and processing large and diverse datasets of chemical reactions, and the use of both global and local models to guide the design of synthetic processes. Global models exploit the information from comprehensive databases to suggest general reaction conditions for new reactions, while local models fine-tune the specific parameters for a given reaction family to improve yield and selectivity.
View Article and Find Full Text PDFAdditive engineering, with its excellent ability to passivate bulk or surface perovskite defects, has become a common strategy to improve the performance and stability of perovskite solar cells (PVSCs). Among the various additives reported so far, ammonium salts are considered an important branch. It is worth noting that although both ammonium-based additives (R-NH ) and amine-based additives (R-NH) are derivatives of ammonia (NH), the functions of the two can be easily confused due to their structural similarities.
View Article and Find Full Text PDFDeep graph neural networks are extensively utilized to predict chemical reactivity and molecular properties. However, because of the complexity of chemical space, such models often have difficulty extrapolating beyond the chemistry contained in the training set. Augmenting the model with quantum mechanical (QM) descriptors is anticipated to improve its generalizability.
View Article and Find Full Text PDFQuantum mechanics/molecular mechanics (QM/MM) simulations offer an efficient way to model reactions occurring in complex environments. This study introduces a specialized set of charge and Lennard-Jones parameters tailored for electrostatically embedded QM/MM calculations, aiming to accurately model both adsorption processes and catalytic reactions in zirconium-based metal-organic frameworks (Zr-MOFs). To validate our approach, we compare adsorption energies derived from QM/MM simulations against experimental results and Monte Carlo simulation outcomes.
View Article and Find Full Text PDFThis paper presents AutoTemplate, an innovative data preprocessing protocol, addressing the crucial need for high-quality chemical reaction datasets in the realm of machine learning applications in organic chemistry. Recent advances in artificial intelligence have expanded the application of machine learning in chemistry, particularly in yield prediction, retrosynthesis, and reaction condition prediction. However, the effectiveness of these models hinges on the integrity of chemical reaction datasets, which are often plagued by inconsistencies like missing reactants, incorrect atom mappings, and outright erroneous reactions.
View Article and Find Full Text PDFIn the field of chemical synthesis planning, the accurate recommendation of reaction conditions is essential for achieving successful outcomes. This work introduces an innovative deep learning approach designed to address the complex task of predicting appropriate reagents, solvents, and reaction temperatures for chemical reactions. Our proposed methodology combines a multi-label classification model with a ranking model to offer tailored reaction condition recommendations based on relevance scores derived from anticipated product yields.
View Article and Find Full Text PDFJ Chem Theory Comput
December 2023
Geometry optimization is a crucial step in computational chemistry, and the efficiency of optimization algorithms plays a pivotal role in reducing computational costs. In this study, we introduce a novel reinforcement-learning-based optimizer that surpasses traditional methods in terms of efficiency. What sets our model apart is its ability to incorporate chemical information into the optimization process.
View Article and Find Full Text PDFWater and other small molecules frequently coordinate within metal-organic frameworks (MOFs). These coordinated molecules may actively engage in mass transfer, moving together with the transport molecules, but this phenomenon has yet to be examined. In this study, we explore a unique water transfer mechanism in UTSA-280, where an incoming water molecule can displace a coordinated molecule for mass transfer.
View Article and Find Full Text PDFStructural flexibility is a critical issue that limits the application of metal-organic framework (MOF) membranes for gas separation. Herein we propose a mixed-linker approach to suppress the structural flexibility of the CAU-10-based (CAU = Christian-Albrechts-University) membranes. Specifically, pure CAU-10-PDC membranes display high separation performance but at the same time are highly unstable for the separation of CO/CH.
View Article and Find Full Text PDFQuantifying uncertainty in machine learning is important in new research areas with scarce high-quality data. In this work, we develop an explainable uncertainty quantification method for deep learning-based molecular property prediction. This method can capture aleatoric and epistemic uncertainties separately and attribute the uncertainties to atoms present in the molecule.
View Article and Find Full Text PDFJ Chem Theory Comput
November 2022
The accurate prediction of thermochemistry and kinetic parameters is an important task for reaction modeling. Unfortunately, the commonly used harmonic oscillator model is often not accurate enough due to the absence of anharmonic effects. In this work, we improve the representation of an anharmonic potential energy surface (PES) using uncoupled mode (UM) approximations, which model the full-dimensional PES as a sum of one-dimensional potentials of each mode.
View Article and Find Full Text PDFMachine learning predictions of molecular thermochemistry, such as formation enthalpy, have been limited for large and complicated species because of the lack of available training data. Such predictions would be important in the prediction of reaction thermodynamics and the construction of kinetic models. Herein, we introduce a graph-based deep learning approach that can separately learn the enthalpy contribution of each atom in its local environment with the effect of the overall molecular structure taken into account.
View Article and Find Full Text PDFHeavy metal contamination in underground water commonly occurs in industrial areas in Taiwan. Wine-processing waste sludge (WPWS) can adsorb and remove several toxic metals from aqueous solutions. In this study, WPWS particles were used to construct a permeable reactive barrier (PRB) for the remediation of a contaminant plume comprising HCrO, Cu, Zn, Ni, Cd, and AsO in a simulated aquifer.
View Article and Find Full Text PDFIn this study, proton-conducting behaviors of a cerium-based metal-organic framework (MOF), Ce-MOF-808, its zirconium-based isostructural MOF, and bimetallic MOFs with various Zr-to-Ce ratios are investigated. The significantly increased proton conductivity (σ) and decreased activation energy () are obtained by substituting Zr with Ce in the nodes of MOF-808. Ce-MOF-808 achieves a σ of 4.
View Article and Find Full Text PDFJ Chem Phys
August 2021
This article summarizes technical advances contained in the fifth major release of the Q-Chem quantum chemistry program package, covering developments since 2015. A comprehensive library of exchange-correlation functionals, along with a suite of correlated many-body methods, continues to be a hallmark of the Q-Chem software. The many-body methods include novel variants of both coupled-cluster and configuration-interaction approaches along with methods based on the algebraic diagrammatic construction and variational reduced density-matrix methods.
View Article and Find Full Text PDFMetal-organic framework (MOF) in biomass valorization is a promising technology developed in recent decades. By tailoring both the metal nodes and organic ligands, MOFs exhibit multiple functionalities, which not only extend their applicability in biomass conversion but also increase the complexity of material designs. To address this issue, quantum mechanical simulations have been used to provide mechanistic insights into the catalysis of biomass-derived molecules, which could potentially facilitate the development of novel MOF-based materials for biomass valorization.
View Article and Find Full Text PDFJ Chem Theory Comput
February 2021
Though quasi-Newton methods have been widely adopted in computational chemistry software for molecular geometry optimization, it is well known that these methods might not perform well for initial guess geometries far away from the local minima, where the quadratic approximation might be inaccurate. We propose a reinforcement learning approach to develop a model that produces a correction term for the quasi-Newton step calculated with the BFGS algorithm to improve the overall optimization performance. Our model is able to complete the optimization in about 30% fewer steps than pure BFGS for molecules starting from perturbed geometries.
View Article and Find Full Text PDFThe heterogeneous metal-organic framework Bi-BTC successfully catalyzed the synthesis of para-xylene from bio-based 2,5-dimethylfuran and acrylic acid in a promising yield (92 %), under relatively mild conditions (160 °C, 10 bar), and with a low reaction-energy barrier (47.3 kJ mol ). The proposed reaction strategy also demonstrates a remarkable versatility for furan derivatives such as furan and 2-methylfuran.
View Article and Find Full Text PDFAdvances in deep neural network (DNN)-based molecular property prediction have recently led to the development of models of remarkable accuracy and generalization ability, with graph convolutional neural networks (GCNNs) reporting state-of-the-art performance for this task. However, some challenges remain, and one of the most important that needs to be fully addressed concerns uncertainty quantification. DNN performance is affected by the volume and the quality of the training samples.
View Article and Find Full Text PDFMachine learning provides promising new methods for accurate yet rapid prediction of molecular properties, including thermochemistry, which is an integral component of many computer simulations, particularly automated reaction mechanism generation. Often, very large data sets with tens of thousands of molecules are required for training the models, but most data sets of experimental or high-accuracy quantum mechanical quality are much smaller. To overcome these limitations, we calculate new high-level data sets and derive bond additivity corrections to significantly improve enthalpies of formation.
View Article and Find Full Text PDFBecause collecting precise and accurate chemistry data is often challenging, chemistry data sets usually only span a small region of chemical space, which limits the performance and the scope of applicability of data-driven models. To address this issue, we integrated an active learning machine with automatic ab initio calculations to form a self-evolving model that can continuously adapt to new species appointed by the users. In the present work, we demonstrate the self-evolving concept by modeling the formation enthalpies of stable closed-shell polycyclic species calculated at the B3LYP/6-31G(2df,p) level of theory.
View Article and Find Full Text PDFNumerous studies have shown that many patients who suffer from type 2 diabetes mellitus exhibit cognitive dysfunction and neuronal synaptic impairments. Therefore, growing evidence suggests that type 2 diabetes mellitus has a close relationship with occurrence and progression of neurodegeneration and neural impairment in Alzheimer's disease. However, the relationship between metabolic disorders caused by type 2 diabetes mellitus and neurodegeneration and neural impairments in Alzheimer's disease is still not fully determined.
View Article and Find Full Text PDFKetohydroperoxides are important in liquid-phase autoxidation and in gas-phase partial oxidation and pre-ignition chemistry, but because of their low concentration, instability, and various analytical chemistry limitations, it has been challenging to experimentally determine their reactivity, and only a few pathways are known. In the present work, 75 elementary-step unimolecular reactions of the simplest γ-ketohydroperoxide, 3-hydroperoxypropanal, were discovered by a combination of density functional theory with several automated transition-state search algorithms: the Berny algorithm coupled with the freezing string method, single- and double-ended growing string methods, the heuristic KinBot algorithm, and the single-component artificial force induced reaction method (SC-AFIR). The present joint approach significantly outperforms previous manual and automated transition-state searches - 68 of the reactions of γ-ketohydroperoxide discovered here were previously unknown and completely unexpected.
View Article and Find Full Text PDF