AutoBind: automatic extraction of protein-ligand-binding affinity data from biological literature.

Bioinformatics

Department of Electrical Engineering, Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 70101, Taiwan.

Published: August 2012

Motivation: Determination of the binding affinity of a protein-ligand complex is important to quantitatively specify whether a particular small molecule will bind to the target protein. Besides, collection of comprehensive datasets for protein-ligand complexes and their corresponding binding affinities is crucial in developing accurate scoring functions for the prediction of the binding affinities of previously unknown protein-ligand complexes. In the past decades, several databases of protein-ligand-binding affinities have been created via visual extraction from literature. However, such approaches are time-consuming and most of these databases are updated only a few times per year. Hence, there is an immediate demand for an automatic extraction method with high precision for binding affinity collection.

Result: We have created a new database of protein-ligand-binding affinity data, AutoBind, based on automatic information retrieval. We first compiled a collection of 1586 articles where the binding affinities have been marked manually. Based on this annotated collection, we designed four sentence patterns that are used to scan full-text articles as well as a scoring function to rank the sentences that match our patterns. The proposed sentence patterns can effectively identify the binding affinities in full-text articles. Our assessment shows that AutoBind achieved 84.22% precision and 79.07% recall on the testing corpus. Currently, 13 616 protein-ligand complexes and the corresponding binding affinities have been deposited in AutoBind from 17 221 articles.

Availability: AutoBind is automatically updated on a monthly basis, and it is freely available at http://autobind.csie.ncku.edu.tw/ and http://autobind.mc.ntu.edu.tw/. All of the deposited binding affinities have been refined and approved manually before being released.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/bts367DOI Listing

Publication Analysis

Top Keywords

binding affinities
24
protein-ligand complexes
12
automatic extraction
8
protein-ligand-binding affinity
8
affinity data
8
binding
8
binding affinity
8
complexes corresponding
8
corresponding binding
8
sentence patterns
8

Similar Publications

Arrhythmogenic calmodulin variants D131E and Q135P disrupt interaction with the L-type voltage-gated Ca channel (Ca1.2) and reduce Ca-dependent inactivation.

Acta Physiol (Oxf)

February 2025

Department of Biochemistry, Cell and Systems Biology, Institute of Systems, Molecular and Integrative Biology, Faculty of Health and Life Sciences, University of Liverpool, Liverpool, UK.

Aim: Long QT syndrome (LQTS) and catecholaminergic polymorphism ventricular tachycardia (CPVT) are inherited cardiac disorders often caused by mutations in ion channels. These arrhythmia syndromes have recently been associated with calmodulin (CaM) variants. Here, we investigate the impact of the arrhythmogenic variants D131E and Q135P on CaM's structure-function relationship.

View Article and Find Full Text PDF

Resistance to endocrine therapies remains a major clinical hurdle in breast cancer. Mutations to estrogen receptor alpha (ERα) arise after continued therapeutic pressure. Next generation selective estrogen receptor modulators and degraders/downregulators (SERMs and SERDs) show clinical efficacy, but responses are often non-durable.

View Article and Find Full Text PDF

Dengue Virus Fusion Peptide Promotes Hemifusion Formation by Disordering the Interfacial Region of the Membrane.

J Membr Biol

January 2025

School of Chemistry, Sambalpur University, Jyoti Vihar, Burla, Odisha, 768 109, India.

Membrane fusion is the first step in the infection process of the enveloped viruses. Enveloped viruses fuse either at the cell surface or enter the cell through endocytosis and transfer their internal genetic materials by fusing with the endosomal membrane at acidic pH. In this work, we have evaluated the effect of the Dengue virus fusion peptide (DENV FP) on the polyethylene glycol (PEG)-mediated lipid mixing of vesicles (hemifusion formation) at pH 5 and pH 7.

View Article and Find Full Text PDF

This study investigates a nanoparticle-based doxycycline (DOX) delivery system targeting cervical cancer cells via the CD44 receptor. Molecular docking revealed a strong binding affinity between hyaluronic acid (HA) and CD44 (binding energy: -7.2 kJ/mol).

View Article and Find Full Text PDF

Tumor-derived extracellular vesicles (T-EVs) PD-L1 are an important biomarker for predicting immunotherapy response and can help us understand the mechanism of resistance to immunotherapy. However, this is due to the interference from a large proportion of nontumor-derived EVs. It is still challenging to accurately analyze T-EVs PD-L1 in complex human fluids.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!