Computational methods have been widely applied to resolve various core issues in drug discovery, such as molecular property prediction. In recent years, a data-driven computational method-deep learning had achieved a number of impressive successes in various domains. In drug discovery, graph neural networks (GNNs) take molecular graph data as input and learn graph-level representations in non-Euclidean space. An enormous amount of well-performed GNNs have been proposed for molecular graph learning. Meanwhile, efficient use of molecular data during training process, however, has not been paid enough attention. Curriculum learning (CL) is proposed as a training strategy by rearranging training queue based on calculated samples' difficulties, yet the effectiveness of CL method has not been determined in molecular graph learning. In this study, inspired by chemical domain knowledge and task prior information, we proposed a novel CL-based training strategy to improve the training efficiency of molecular graph learning, called CurrMG. Consisting of a difficulty measurer and a training scheduler, CurrMG is designed as a plug-and-play module, which is model-independent and easy-to-use on molecular data. Extensive experiments demonstrated that molecular graph learning models could benefit from CurrMG and gain noticeable improvement on five GNN models and eight molecular property prediction tasks (overall improvement is 4.08%). We further observed CurrMG's encouraging potential in resource-constrained molecular property prediction. These results indicate that CurrMG can be used as a reliable and efficient training strategy for molecular graph learning. Availability: The source code is available in https://github.com/gu-yaowen/CurrMG.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bib/bbac099DOI Listing

Publication Analysis

Top Keywords

molecular graph
28
graph learning
24
molecular
12
molecular property
12
property prediction
12
training strategy
12
strategy molecular
8
graph
8
learning
8
drug discovery
8

Similar Publications

Prediction of Thermodynamic Properties of C-Based Fullerenols Using Machine Learning.

J Chem Theory Comput

January 2025

Guizhou Provincial Engineering Technology Research Center for Chemical Drug R&D, School of Pharmacy, Guizhou Medical University, Guiyang, Guizhou 550025, P. R. China.

Traditional machine learning methods face significant challenges in predicting the properties of highly symmetric molecules. In this study, we developed a machine learning model based on graph neural networks (GNNs) to accurately and swiftly predict the thermodynamic and photochemical properties of fullerenols, such as C(OH) ( = 1 to 30). First, we established a global method for generating fullerenol isomers through isomer fingerprinting, which can generate all possible isomers or produce diverse structural types on demand.

View Article and Find Full Text PDF

GraphkmerDTA: integrating local sequence patterns and topological information for drug-target binding affinity prediction and applications in multi-target anti-Alzheimer's drug discovery.

Mol Divers

January 2025

Key Laboratory of Prevention and Treatment of Cardiovascular and Cerebrovascular Diseases Ministry of Education, Jiangxi Province Key Laboratory of Biomaterials and Biofabrication for Tissue Engineering, Gannan Medical University, Ganzhou, 341000, Jiangxi, China.

Identifying drug-target binding affinity (DTA) plays a critical role in early-stage drug discovery. Despite the availability of various existing methods, there are still two limitations. Firstly, sequence-based methods often extract features from fixed length protein sequences, requiring truncation or padding, which can result in information loss or the introduction of unwanted noise.

View Article and Find Full Text PDF

FlowPacker: Protein side-chain packing with torsional flow matching.

Bioinformatics

January 2025

Department of Molecular Genetics, University of Toronto, Ontario, M5S 3K3, Canada.

Motivation: Accurate prediction of protein side-chain conformations is necessary to understand protein folding, protein-protein interactions and facilitate de novo protein design.

Results: Here we apply torsional flow matching and equivariant graph attention to develop FlowPacker, a fast and performant model to predict protein side-chain conformations conditioned on the protein sequence and backbone. We show that FlowPacker outperforms previous state-of-the-art baselines across most metrics with improved runtime.

View Article and Find Full Text PDF

Motivation: Accurately predicting the degradation capabilities of proteolysis-targeting chimeras (PROTACs) for given target proteins and E3 ligases is important for PROTAC design. The distinctive ternary structure of PROTACs presents a challenge to traditional drug-target interaction prediction methods, necessitating more innovative approaches. While current state-of-the-art (SOTA) methods using graph neural networks (GNNs) can discern the molecular structure of PROTACs and proteins, thus enabling the efficient prediction of PROTACs' degradation capabilities, they rely heavily on limited crystal structure data of the POI-PROTAC-E3 ternary complex.

View Article and Find Full Text PDF

A Neural-Network-Based Mapping and Optimization Framework for High-Precision Coarse-Grained Simulation.

J Chem Theory Comput

January 2025

Beijing National Laboratory for Molecular Sciences, State Key Laboratory of Polymer Physics and Chemistry, Institute of Chemistry, Chinese Academy of Sciences, Beijing 100190, P. R. China.

The accuracy and efficiency of a coarse-grained (CG) force field are pivotal for high-precision molecular simulations of large systems with complex molecules. We present an automated mapping and optimization framework for molecular simulation (AMOFMS), which is designed to streamline and improve the force field optimization process. It features a neural-network-based mapping function, DSGPM-TP (deep supervised graph partitioning model with type prediction).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!