Deep neural networks are effective in learning directly from low-level encoded data without the need of feature extraction. This paper shows how QSAR models can be constructed from 2D molecular graphs without computing chemical descriptors. Two graph convolutional neural network-based models are presented with and without a Bayesian estimation of the prediction uncertainty. The property under investigation is mutagenicity: Models developed here predict the output of the Ames test. These models take the SMILES representation of the molecules as input to produce molecular graphs in terms of adjacency matrices and subsequently use attention mechanisms to weight the role of their subgraphs in producing the output. The results positively compare with current state-of-the-art models. Furthermore, our proposed model interpretation can be enhanced by the automatic extraction of the substructures most important in driving the prediction, as well as by uncertainty estimations.

Download full-text PDF

Source
http://dx.doi.org/10.1007/s11030-021-10250-2DOI Listing

Publication Analysis

Top Keywords

descriptors graph
8
graph convolutional
8
convolutional neural
8
neural networks
8
molecular graphs
8
models
5
qsar modeling
4
modeling descriptors
4
networks case
4
case mutagenicity
4

Similar Publications

Strategic Integration of Machine Learning in the Design of Excellent Hybrid Perovskite Solar Cells.

J Phys Chem Lett

January 2025

College of Chemistry and Materials Science, Hebei University, Baoding 071002, P. R. China.

The photoelectric conversion efficiency (PCE) of perovskites remains beneath the Shockley-Queisser limit, despite its significant potential for solar cell applications. The present focus is on investigating potential multicomponent perovskite candidates, particularly on the application of machine learning to expedite band gap screening. To efficiently identify high-performance perovskites, we utilized a data set of 1346 hybrid organic-inorganic perovskites and employed 11 machine learning models, including decision trees, convolutional neural networks (CNNs), and graph neural networks (GNNs).

View Article and Find Full Text PDF

Prediction of Thermodynamic Properties of C-Based Fullerenols Using Machine Learning.

J Chem Theory Comput

January 2025

Guizhou Provincial Engineering Technology Research Center for Chemical Drug R&D, School of Pharmacy, Guizhou Medical University, Guiyang, Guizhou 550025, P. R. China.

Traditional machine learning methods face significant challenges in predicting the properties of highly symmetric molecules. In this study, we developed a machine learning model based on graph neural networks (GNNs) to accurately and swiftly predict the thermodynamic and photochemical properties of fullerenols, such as C(OH) ( = 1 to 30). First, we established a global method for generating fullerenol isomers through isomer fingerprinting, which can generate all possible isomers or produce diverse structural types on demand.

View Article and Find Full Text PDF

Structural indicators, also known as structural descriptors, including order parameters, have been proposed to quantify the structural properties of water to account for its anomalous behaviors. However, these indicators, mainly designed for bulk water, are not naturally transferrable to the vicinity of ions due to disruptions in the immediate neighboring space and a resulting loss of feature completeness. To address these non-bulk defects, we introduced a structural indicator that draws on the concept of clique number from graph theory and the criterion in agglomerative clustering, denoted as the average cluster number.

View Article and Find Full Text PDF

Topological indices, derived from molecular graphs, provide valuable numerical descriptors for the comprehensive analysis of pharmaceuticals. These indices are pivotal in the physicochemical characterization and predictive assessment of various drugs. In this study, we calculate several degree-based topological indices for a range of migraine treatment medications, including aspirin, caffeine, eletriptan, ergotamine, sumatriptan, rizatriptan, verapamil, diclofenac, frovatriptan, and droperidol.

View Article and Find Full Text PDF

Covalent organic frameworks are a novel class of porous polymers, notable for their crystalline structure, intricate frameworks, defined pore sizes, and capacity for structural design, synthetic control, and functional customization. This paper provides a comprehensive analysis of graph entropies and hybrid topological descriptors, derived from geometric, harmonic, and Zagreb indices. These descriptors are applied to study two variations of Marta covalent organic frameworks based on contorted hexabenzocoronenes.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!