A High-Quality Data Set of Protein-Ligand Binding Interactions Via Comparative Complex Structure Modeling.

Xuelian Li Cheng Shen Hui Zhu Yujian Yang Qing Wang Jincai Yang Niu Huang

J Chem Inf Model

National Institute of Biological Sciences, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100730, China.

Published: April 2024

High-quality protein-ligand complex structures provide the basis for understanding the nature of noncovalent binding interactions at the atomic level and enable structure-based drug design. However, experimentally determined complex structures are scarce compared with the vast chemical space. In this study, we addressed this issue by constructing the BindingNet data set via comparative complex structure modeling, which contains 69,816 modeled high-quality protein-ligand complex structures with experimental binding affinity data. BindingNet provides valuable insights into investigating protein-ligand interactions, allowing visual inspection and interpretation of structural analogues' structure-activity relationships. It can also be used for evaluating machine-learning-based scoring functions. Our results indicate that machine learning models trained on BindingNet could reduce the bias caused by buried solvent-accessible surface area, as we previously found for models trained on the PDBbind data set. We also discussed strategies to improve BindingNet and its potential utilization for benchmarking the molecular docking methods and ligand binding free energy calculation approaches. The BindingNet complements PDBbind in constructing a sufficient and unbiased protein-ligand binding data set and is freely available at http://bindingnet.huanglab.org.cn.

Download full-text PDF	Source
http://dx.doi.org/10.1021/acs.jcim.3c01170	DOI Listing

Publication Analysis

Top Keywords

data set

complex structures

protein-ligand binding

binding interactions

comparative complex

complex structure

structure modeling

high-quality protein-ligand

protein-ligand complex

models trained

Similar Publications

RETRACTION: Circular RNA circFBXO11 Modulates Hepatocellular Carcinoma Progress and Oxaliplatin Resistance Through miR-605/FOXO3/ABCB1 Axis.

J Cell Mol Med

January 2025

J. Li, X. Qin, R.

View Article and Find Full Text PDF

Similar Publications

Compressive electron backscatter diffraction imaging.

J Microsc

January 2025

Department of Mechanical, Materials and Aerospace Engineering, University of Liverpool, Liverpool, UK.

Zoë Broad Alex W Robinson Jack Wells Daniel Nicholls Amirafshar Moshtaghpour

Electron backscatter diffraction (EBSD) has developed over the last few decades into a valuable crystallographic characterisation method for a wide range of sample types. Despite these advances, issues such as the complexity of sample preparation, relatively slow acquisition, and damage in beam-sensitive samples, still limit the quantity and quality of interpretable data that can be obtained. To mitigate these issues, here we propose a method based on the subsampling of probe positions and subsequent reconstruction of an incomplete data set.

View Article and Find Full Text PDF

Similar Publications

HoloFood Data Portal: holo-omic datasets for analysing host-microbiota interactions in animal production.

Database (Oxford)

January 2025

European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Genome Campus, Hinxton, CB10 1SD, UK.

Alexander B Rogers Varsha Kale Germana Baldi Antton Alberdi M Thomas P Gilbert

The HoloFood project used a hologenomic approach to understand the impact of host-microbiota interactions on salmon and chicken production by analysing multiomic data, phenotypic characteristics, and associated metadata in response to novel feeds. The project's raw data, derived analyses, and metadata are deposited in public, open archives (BioSamples, European Nucleotide Archive, MetaboLights, and MGnify), so making use of these diverse data types may require access to multiple resources. This is especially complex where analysis pipelines produce derived outputs such as functional profiles or genome catalogues.

View Article and Find Full Text PDF

Similar Publications

Identifying Distinct Developmental Patterns of Brain Complexity in Autism: A Cross-Sectional Cohort Analysis Using the Autism Brain Imaging Data Exchange.

Psychiatry Clin Neurosci

January 2025

Institute of Brain Science, National Yang Ming Chiao Tung University, Taipei, Taiwan.

I-Jou Chi Shih-Jen Tsai Chun-Houh Chen Albert C Yang

Aim: Autistic traits exhibit neurodiversity with varying behaviors across developmental stages. Brain complexity theory, illustrating the dynamics of neural activity, may elucidate the evolution of autistic traits over time. Our study explored the patterns of brain complexity in autistic individuals from childhood to adulthood.

View Article and Find Full Text PDF

Similar Publications

A Multianalyte Machine Learning Model to Detect Wrong Blood in Complete Blood Count Tube Errors in a Pediatric Setting.

Clin Chem

January 2025

Department of Pathology and Laboratory Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, United States.

Brendan V Graham Stephen R Master Amrom E Obstfeld Robert B Wilson

Background: Multianalyte machine learning (ML) models can potentially identify previously undetectable wrong blood in tube (WBIT) errors, improving upon current single-analyte delta check methodology. However, WBIT detection model performance has not been assessed in a real-world, low-prevalence context. To estimate real-world positive predictive values, we propose a methodology to assess WBIT detection models by evaluating the impact of missing data and by using a "low prevalence" validation data set.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!