Motivation: Determination of the binding affinity of a protein-ligand complex is important to quantitatively specify whether a particular small molecule will bind to the target protein. Besides, collection of comprehensive datasets for protein-ligand complexes and their corresponding binding affinities is crucial in developing accurate scoring functions for the prediction of the binding affinities of previously unknown protein-ligand complexes. In the past decades, several databases of protein-ligand-binding affinities have been created via visual extraction from literature. However, such approaches are time-consuming and most of these databases are updated only a few times per year. Hence, there is an immediate demand for an automatic extraction method with high precision for binding affinity collection.
Result: We have created a new database of protein-ligand-binding affinity data, AutoBind, based on automatic information retrieval. We first compiled a collection of 1586 articles where the binding affinities have been marked manually. Based on this annotated collection, we designed four sentence patterns that are used to scan full-text articles as well as a scoring function to rank the sentences that match our patterns. The proposed sentence patterns can effectively identify the binding affinities in full-text articles. Our assessment shows that AutoBind achieved 84.22% precision and 79.07% recall on the testing corpus. Currently, 13 616 protein-ligand complexes and the corresponding binding affinities have been deposited in AutoBind from 17 221 articles.
Availability: AutoBind is automatically updated on a monthly basis, and it is freely available at http://autobind.csie.ncku.edu.tw/ and http://autobind.mc.ntu.edu.tw/. All of the deposited binding affinities have been refined and approved manually before being released.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1093/bioinformatics/bts367 | DOI Listing |
Acta Physiol (Oxf)
February 2025
Department of Biochemistry, Cell and Systems Biology, Institute of Systems, Molecular and Integrative Biology, Faculty of Health and Life Sciences, University of Liverpool, Liverpool, UK.
Aim: Long QT syndrome (LQTS) and catecholaminergic polymorphism ventricular tachycardia (CPVT) are inherited cardiac disorders often caused by mutations in ion channels. These arrhythmia syndromes have recently been associated with calmodulin (CaM) variants. Here, we investigate the impact of the arrhythmogenic variants D131E and Q135P on CaM's structure-function relationship.
View Article and Find Full Text PDFBreast Cancer Res
January 2025
Department of Cancer Biology, Loyola University Chicago Stritch School of Medicine, Maywood, IL, 50153, USA.
Resistance to endocrine therapies remains a major clinical hurdle in breast cancer. Mutations to estrogen receptor alpha (ERα) arise after continued therapeutic pressure. Next generation selective estrogen receptor modulators and degraders/downregulators (SERMs and SERDs) show clinical efficacy, but responses are often non-durable.
View Article and Find Full Text PDFJ Membr Biol
January 2025
School of Chemistry, Sambalpur University, Jyoti Vihar, Burla, Odisha, 768 109, India.
Membrane fusion is the first step in the infection process of the enveloped viruses. Enveloped viruses fuse either at the cell surface or enter the cell through endocytosis and transfer their internal genetic materials by fusing with the endosomal membrane at acidic pH. In this work, we have evaluated the effect of the Dengue virus fusion peptide (DENV FP) on the polyethylene glycol (PEG)-mediated lipid mixing of vesicles (hemifusion formation) at pH 5 and pH 7.
View Article and Find Full Text PDFSci Rep
January 2025
Industrial Biotechnology, Atta-ur-Rahman School of Applied Biosciences, National University of Sciences and Technology, Islamabad, Pakistan.
This study investigates a nanoparticle-based doxycycline (DOX) delivery system targeting cervical cancer cells via the CD44 receptor. Molecular docking revealed a strong binding affinity between hyaluronic acid (HA) and CD44 (binding energy: -7.2 kJ/mol).
View Article and Find Full Text PDFAnal Chem
January 2025
School of Chemistry and Chemical Engineering, Yangzhou University, Yangzhou 225002, P. R. China.
Tumor-derived extracellular vesicles (T-EVs) PD-L1 are an important biomarker for predicting immunotherapy response and can help us understand the mechanism of resistance to immunotherapy. However, this is due to the interference from a large proportion of nontumor-derived EVs. It is still challenging to accurately analyze T-EVs PD-L1 in complex human fluids.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!