High-quality protein-ligand complex structures provide the basis for understanding the nature of noncovalent binding interactions at the atomic level and enable structure-based drug design. However, experimentally determined complex structures are scarce compared with the vast chemical space. In this study, we addressed this issue by constructing the BindingNet data set via comparative complex structure modeling, which contains 69,816 modeled high-quality protein-ligand complex structures with experimental binding affinity data. BindingNet provides valuable insights into investigating protein-ligand interactions, allowing visual inspection and interpretation of structural analogues' structure-activity relationships. It can also be used for evaluating machine-learning-based scoring functions. Our results indicate that machine learning models trained on BindingNet could reduce the bias caused by buried solvent-accessible surface area, as we previously found for models trained on the PDBbind data set. We also discussed strategies to improve BindingNet and its potential utilization for benchmarking the molecular docking methods and ligand binding free energy calculation approaches. The BindingNet complements PDBbind in constructing a sufficient and unbiased protein-ligand binding data set and is freely available at http://bindingnet.huanglab.org.cn.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1021/acs.jcim.3c01170 | DOI Listing |
J Microsc
January 2025
Department of Mechanical, Materials and Aerospace Engineering, University of Liverpool, Liverpool, UK.
Electron backscatter diffraction (EBSD) has developed over the last few decades into a valuable crystallographic characterisation method for a wide range of sample types. Despite these advances, issues such as the complexity of sample preparation, relatively slow acquisition, and damage in beam-sensitive samples, still limit the quantity and quality of interpretable data that can be obtained. To mitigate these issues, here we propose a method based on the subsampling of probe positions and subsequent reconstruction of an incomplete data set.
View Article and Find Full Text PDFDatabase (Oxford)
January 2025
European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Genome Campus, Hinxton, CB10 1SD, UK.
The HoloFood project used a hologenomic approach to understand the impact of host-microbiota interactions on salmon and chicken production by analysing multiomic data, phenotypic characteristics, and associated metadata in response to novel feeds. The project's raw data, derived analyses, and metadata are deposited in public, open archives (BioSamples, European Nucleotide Archive, MetaboLights, and MGnify), so making use of these diverse data types may require access to multiple resources. This is especially complex where analysis pipelines produce derived outputs such as functional profiles or genome catalogues.
View Article and Find Full Text PDFPsychiatry Clin Neurosci
January 2025
Institute of Brain Science, National Yang Ming Chiao Tung University, Taipei, Taiwan.
Aim: Autistic traits exhibit neurodiversity with varying behaviors across developmental stages. Brain complexity theory, illustrating the dynamics of neural activity, may elucidate the evolution of autistic traits over time. Our study explored the patterns of brain complexity in autistic individuals from childhood to adulthood.
View Article and Find Full Text PDFClin Chem
January 2025
Department of Pathology and Laboratory Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, United States.
Background: Multianalyte machine learning (ML) models can potentially identify previously undetectable wrong blood in tube (WBIT) errors, improving upon current single-analyte delta check methodology. However, WBIT detection model performance has not been assessed in a real-world, low-prevalence context. To estimate real-world positive predictive values, we propose a methodology to assess WBIT detection models by evaluating the impact of missing data and by using a "low prevalence" validation data set.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!