High-quality protein-ligand complex structures provide the basis for understanding the nature of noncovalent binding interactions at the atomic level and enable structure-based drug design. However, experimentally determined complex structures are scarce compared with the vast chemical space. In this study, we addressed this issue by constructing the BindingNet data set via comparative complex structure modeling, which contains 69,816 modeled high-quality protein-ligand complex structures with experimental binding affinity data. BindingNet provides valuable insights into investigating protein-ligand interactions, allowing visual inspection and interpretation of structural analogues' structure-activity relationships. It can also be used for evaluating machine-learning-based scoring functions. Our results indicate that machine learning models trained on BindingNet could reduce the bias caused by buried solvent-accessible surface area, as we previously found for models trained on the PDBbind data set. We also discussed strategies to improve BindingNet and its potential utilization for benchmarking the molecular docking methods and ligand binding free energy calculation approaches. The BindingNet complements PDBbind in constructing a sufficient and unbiased protein-ligand binding data set and is freely available at http://bindingnet.huanglab.org.cn.

Download full-text PDF

Source
http://dx.doi.org/10.1021/acs.jcim.3c01170DOI Listing

Publication Analysis

Top Keywords

data set
16
complex structures
12
protein-ligand binding
8
binding interactions
8
comparative complex
8
complex structure
8
structure modeling
8
high-quality protein-ligand
8
protein-ligand complex
8
models trained
8

Similar Publications

Compressive electron backscatter diffraction imaging.

J Microsc

January 2025

Department of Mechanical, Materials and Aerospace Engineering, University of Liverpool, Liverpool, UK.

Electron backscatter diffraction (EBSD) has developed over the last few decades into a valuable crystallographic characterisation method for a wide range of sample types. Despite these advances, issues such as the complexity of sample preparation, relatively slow acquisition, and damage in beam-sensitive samples, still limit the quantity and quality of interpretable data that can be obtained. To mitigate these issues, here we propose a method based on the subsampling of probe positions and subsequent reconstruction of an incomplete data set.

View Article and Find Full Text PDF

The HoloFood project used a hologenomic approach to understand the impact of host-microbiota interactions on salmon and chicken production by analysing multiomic data, phenotypic characteristics, and associated metadata in response to novel feeds. The project's raw data, derived analyses, and metadata are deposited in public, open archives (BioSamples, European Nucleotide Archive, MetaboLights, and MGnify), so making use of these diverse data types may require access to multiple resources. This is especially complex where analysis pipelines produce derived outputs such as functional profiles or genome catalogues.

View Article and Find Full Text PDF

Aim: Autistic traits exhibit neurodiversity with varying behaviors across developmental stages. Brain complexity theory, illustrating the dynamics of neural activity, may elucidate the evolution of autistic traits over time. Our study explored the patterns of brain complexity in autistic individuals from childhood to adulthood.

View Article and Find Full Text PDF

Background: Multianalyte machine learning (ML) models can potentially identify previously undetectable wrong blood in tube (WBIT) errors, improving upon current single-analyte delta check methodology. However, WBIT detection model performance has not been assessed in a real-world, low-prevalence context. To estimate real-world positive predictive values, we propose a methodology to assess WBIT detection models by evaluating the impact of missing data and by using a "low prevalence" validation data set.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!