PLAS-5k: Dataset of Protein-Ligand Affinities from Molecular Dynamics for Machine Learning Applications.

Divya B Korlepara C S Vasavi Shruti Jeurkar Pradeep Kumar Pal Subhajit Roy Sarvesh Mehta Shubham Sharma Vishal Kumar Charuvaka Muvva Bhuvanesh Sridharan Akshit Garg Rohit Modee Agastya P Bhati Divya Nayar U Deva Priyakumar

Sci Data

Centre for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology, Hyderabad, 500032, India.

Published: September 2022

Computational methods and recently modern machine learning methods have played a key role in structure-based drug design. Though several benchmarking datasets are available for machine learning applications in virtual screening, accurate prediction of binding affinity for a protein-ligand complex remains a major challenge. New datasets that allow for the development of models for predicting binding affinities better than the state-of-the-art scoring functions are important. For the first time, we have developed a dataset, PLAS-5k comprised of 5000 protein-ligand complexes chosen from PDB database. The dataset consists of binding affinities along with energy components like electrostatic, van der Waals, polar and non-polar solvation energy calculated from molecular dynamics simulations using MMPBSA (Molecular Mechanics Poisson-Boltzmann Surface Area) method. The calculated binding affinities outperformed docking scores and showed a good correlation with the available experimental values. The availability of energy components may enable optimization of desired components during machine learning-based drug design. Further, OnionNet model has been retrained on PLAS-5k dataset and is provided as a baseline for the prediction of binding affinities.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9451116	PMC
http://dx.doi.org/10.1038/s41597-022-01631-9	DOI Listing

Publication Analysis

Top Keywords

binding affinities

machine learning

plas-5k dataset

molecular dynamics

learning applications

drug design

prediction binding

energy components

affinities

binding

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!