What's What: The (Nearly) Definitive Guide to Reaction Role Assignment.

J Chem Inf Model

T5 Informatics GmbH , Spalenring 11, 4055 Basel, Switzerland.

Published: December 2016

When analyzing chemical reactions it is essential to know which molecules are actively involved in the reaction and which educts will form the product molecules. Assigning reaction roles, like reactant, reagent, or product, to the molecules of a chemical reaction might be a trivial problem for hand-curated reaction schemes but it is more difficult to automate, an essential step when handling large amounts of reaction data. Here, we describe a new fingerprint-based and data-driven approach to assign reaction roles which is also applicable to rather unbalanced and noisy reaction schemes. Given a set of molecules involved and knowing the product(s) of a reaction we assign the most probable reactants and sort out the remaining reagents. Our approach was validated using two different data sets: one hand-curated data set comprising about 680 diverse reactions extracted from patents which span more than 200 different reaction types and include up to 18 different reactants. A second set consists of 50 000 randomly picked reactions from US patents. The results of the second data set were compared to results obtained using two different atom-to-atom mapping algorithms. For both data sets our method assigns the reaction roles correctly for the vast majority of the reactions, achieving an accuracy of 88% and 97% respectively. The median time needed, about 8 ms, indicates that the algorithm is fast enough to be applied to large collections. The new method is available as part of the RDKit toolkit and the data sets and Jupyter notebooks used for evaluation of the new method are available in the Supporting Information of this publication.

Download full-text PDF

Source
http://dx.doi.org/10.1021/acs.jcim.6b00564DOI Listing

Publication Analysis

Top Keywords

reaction roles
12
data sets
12
reaction
11
product molecules
8
reaction schemes
8
data set
8
data
6
what's definitive
4
definitive guide
4
guide reaction
4

Similar Publications

Dinitrogen Activation: A Novel Approach with P/B Intermolecular FLP.

J Phys Chem A

January 2025

School of Applied Science and Humanities, Haldia Institute of Technology, ICARE Complex, Haldia 721657, India.

This study explores the reactivity of a new intermolecular P/B frustrated Lewis pair in the context of dinitrogen activation through a push-pull mechanism. The ab initio molecular dynamics model known as atom-centered density matrix propagation plays a pivotal role in elucidating the weakly associated encounter complex. In-depth analysis, mainly through intrinsic reaction coordinate calculations, supports a single-step mechanism.

View Article and Find Full Text PDF

Snails belonging to the genus Biomphalaria serve as obligatory intermediate hosts for the trematode Schistosoma mansoni, the causative agent for the most widespread form of schistosomiasis. The simpler nervous systems of gastropod molluscs, such as Biomphalaria, provide advantageous models for investigating neural responses to infection at the cellular and network levels. The present study examined neuropeptides related to cholecystokinin (CCK), a major multifunctional regulator of central nervous system (CNS) function in mammals.

View Article and Find Full Text PDF

Highly sensitive surface-enhanced Raman scattering detection of adenosine triphosphate based on core-satellite assemblies.

Anal Methods

November 2017

Lab of Biosystem and Microanalysis, State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, 200237, China.

As an important small molecule, adenosine triphosphate (ATP) plays an important role in the regulation of cell metabolism and supplies energy for various biochemical reactions in organisms. We herein developed a sensitive surface-enhanced Raman scattering (SERS) biosensor for highly specific detection of ATP using core-satellite assemblies. To construct the aptamer-based biosensor, a known ATP binding aptamer was divided into two segments.

View Article and Find Full Text PDF

Recently, there has been growing interest in the role of circular RNAs (circRNAs) in the progression of human cancers. Cellular senescence, a known anti-tumour mechanism, has been observed in several types of cancer. However, the regulatory interplay of circRNAs with cellular senescence in pancreatic cancer (PC) is still unknown.

View Article and Find Full Text PDF

Enhancing Catalytic Removal of Autoexhaust Soot Particles via the Modulation of Interfacial Oxygen Vacancies in Cu/CeO Catalysts.

Environ Sci Technol

January 2025

State Key Laboratory of Heavy Oil Processing, Key Laboratory of Optical Detection Technology for Oil and Gas, College of Science, China University of Petroleum, Beijing 102249, PR China.

The purification efficiency of autoexhaust carbon strongly depends on the heterogeneous interface structure between active metal and oxide, which can modulate the local electronic structure of defect sites to promote the activation of reactant molecules. Herein, the high-dispersion CuO clusters supported on the well-defined CeO nanorods were prepared using the complex deposition slow method. The formation of heteroatomic Cu-O-Ce interfacial structural units as active sites can capture electrons to achieve activation of the NO and O molecules.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!