SAF: Smart Aggregation Framework for Revealing Atoms Importance Rank and Improving Prediction Rates in Drug Discovery.

J Chem Inf Model

Department of Physiology, Biophysics & Systems Biology, Medicine Faculty, Technion IIT, Haifa 3525422, Israel.

Published: May 2024

Machine learning, and representation learning in particular, has the potential to facilitate drug discovery by screening a large chemical space in silico. A successful approach for representing molecules is to treat them as graphs and utilize graph neural networks. One of the key limitations of such methods is the necessity to represent compounds with different numbers of atoms, which requires aggregating the atom's information. Common aggregation operators, such as averaging, result in a loss of information at the atom level. In this work, we propose a novel aggregating approach where each atom is weighted nonlinearly using the Boltzmann distribution with a hyperparameter analogous to temperature. We show that using this weighted aggregation improves the ability of the gold standard message-passing neural network to predict antibiotic activity. Moreover, by changing the temperature hyperparameter, our approach can reveal the atoms that are important for activity prediction in a smooth and consistent way, thus providing a novel regulated attention mechanism for graph neural networks. We further validate our method by showing that it recapitulates the functional group in β-lactam antibiotics. The ability of our approach to rank the atoms' importance for a desired function can be used within any graph neural network to provide interpretability of the results and predictions at the node level.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11134513PMC
http://dx.doi.org/10.1021/acs.jcim.4c00107DOI Listing

Publication Analysis

Top Keywords

graph neural
12
drug discovery
8
neural networks
8
neural network
8
saf smart
4
smart aggregation
4
aggregation framework
4
framework revealing
4
revealing atoms
4
atoms rank
4

Similar Publications

Diffuse Large B-cell Lymphoma (DLBCL) is a lymphatic cancer of steadily growing incidence. Its diagnostic and follow-up rely on the analysis of clinical biomarkers and 18F-Fluorodeoxyglucose (FDG)-PET/CT images. In this context, we target the problem of assisting in the early identification of high-risk DLBCL patients from both images and tabular clinical data.

View Article and Find Full Text PDF

A universal strategy for smoothing deceleration in deep graph neural networks.

Neural Netw

January 2025

College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin, 150001, China. Electronic address:

Graph neural networks (GNNs) have shown great promise in modeling graph-structured data, but the over-smoothing problem restricts their effectiveness in deep layers. Two key weaknesses of existing research on deep GNN models are: (1) ignoring the beneficial aspects of intra-class smoothing while focusing solely on reducing inter-class smoothing, and (2) inefficient computation of residual weights that neglect the influence of neighboring nodes' distributions. To address these weaknesses, we propose a novel Smoothing Deceleration (SD) strategy to reduce the smoothing speed rate of nodes as information propagates between layers, thereby mitigating over-smoothing.

View Article and Find Full Text PDF

Implementing the discontinuous-Galerkin finite element method using graph neural networks with application to diffusion equations.

Neural Netw

December 2024

Department of Earth Science and Engineering, Imperial College London, Prince Consort Road, London SW7 2BP, UK; Centre for AI-Physics Modelling, Imperial-X, White City Campus, Imperial College London, W12 7SL, UK.

Machine learning (ML) has benefited from both software and hardware advancements, leading to increasing interest in capitalising on ML throughout academia and industry. There have been efforts in the scientific computing community to leverage this development via implementing conventional partial differential equation (PDE) solvers with machine learning packages, most of which rely on structured spatial discretisation and fast convolution algorithms. However, unstructured meshes are favoured in problems with complex geometries.

View Article and Find Full Text PDF

Gastric cancer is a leading cause of cancer-related deaths globally. As mortality rates continue to rise, predicting cancer survival using multimodal data-including histopathological images, genomic data, and clinical information-has become increasingly crucial. However, extracting effective predictive features from this complex data has posed challenges for survival analysis due to the high dimensionality and heterogeneity of histopathology images and genomic data.

View Article and Find Full Text PDF

Predicting drug combination side effects based on a metapath-based heterogeneous graph neural network.

BMC Bioinformatics

January 2025

Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, 100190, China.

In recent years, combined drug screening has played a very important role in modern drug discovery. Generally, synergistic drug combinations are crucial in treatment for many diseases. However, the toxic side effects of drug combinations are probably increased with the increase of drugs numbers, so the accurate prediction of toxic side effects of drug combinations is equally important.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!