Discovering rules for protein-ligand specificity using support vector inductive logic programming.

Protein Eng Des Sel

Structural Bioinformatics Group, Division of Molecular Biosciences, Imperial College London, London, UK.

Published: September 2009

Structural genomics initiatives are rapidly generating vast numbers of protein structures. Comparative modelling is also capable of producing accurate structural models for many protein sequences. However, for many of the known structures, functions are not yet determined, and in many modelling tasks, an accurate structural model does not necessarily tell us about function. Thus, there is a pressing need for high-throughput methods for determining function from structure. The spatial arrangement of key amino acids in a folded protein, on the surface or buried in clefts, is often the determinants of its biological function. A central aim of molecular biology is to understand the relationship between such substructures or surfaces and biological function, leading both to function prediction and to function design. We present a new general method for discovering the features of binding pockets that confer specificity for particular ligands. Using a recently developed machine-learning technique which couples the rule-discovery approach of inductive logic programming with the statistical learning power of support vector machines, we are able to discriminate, with high precision (90%) and recall (86%) between pockets that bind FAD and those that bind NAD on a large benchmark set given only the geometry and composition of the backbone of the binding pocket without the use of docking. In addition, we learn rules governing this specificity which can feed into protein functional design protocols. An analysis of the rules found suggests that key features of the binding pocket may be tied to conformational freedom in the ligand. The representation is sufficiently general to be applicable to any discriminatory binding problem. All programs and data sets are freely available to non-commercial users at http://www.sbg.bio.ic.ac.uk/svilp_ligand/.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3913550PMC
http://dx.doi.org/10.1093/protein/gzp035DOI Listing

Publication Analysis

Top Keywords

support vector
8
inductive logic
8
logic programming
8
accurate structural
8
biological function
8
features binding
8
binding pocket
8
function
6
discovering rules
4
rules protein-ligand
4

Similar Publications

Purpose: The incidence of cancer, which is a serious public health concern, is increasing. A predictive analysis driven by machine learning was integrated with haematology parameters to create a method for the simultaneous diagnosis of several malignancies at different stages.

Patients And Methods: We analysed a newly collected dataset from various hospitals in Jordan comprising 19,537 laboratory reports (6,280 cancer and 13,257 noncancer cases).

View Article and Find Full Text PDF

Radiomic signatures of brain metastases on MRI: utility in predicting pathological subtypes of lung cancer.

Transl Cancer Res

December 2024

Department of Radiology, Shanghai Chest Hospital, Shanghai Jiao Tong University, School of Medicine, Shanghai, China.

Background: The pathological sub-classification of lung cancer is crucial in diagnosis, treatment and prognosis for patients. Quick and timely identification of pathological subtypes from imaging examinations rather than histological tests could help guiding therapeutic strategies. The aim of the study is to construct a non-invasive radiomics-based model for predicting the subtypes of lung cancer on brain metastases (BMs) from multiple magnetic resonance imaging (MRI) sequences.

View Article and Find Full Text PDF

Background: Tick-borne diseases (TBDs) play a crucial role in human morbidity and mortality, as ticks are highly effective in spreading diseases by transmitting harmful pathogens to humans and animals. The last few decades have seen an increase in the number of recognized tick-borne pathogens and the incidence of TBD worldwide. Several of these diseases are ubiquitous in India.

View Article and Find Full Text PDF

Background A minority of patients receiving stereotactic body radiation therapy (SBRT) for non-small cell lung cancer (NSCLC) are not good responders. Radiomic features can be used to generate predictive algorithms and biomarkers that can determine treatment outcomes and stratify patients to their therapeutic options. This study investigated and attempted to validate the radiomic and clinical features obtained from early-stage and oligometastatic NSCLC patients who underwent SBRT, to predict local response.

View Article and Find Full Text PDF

Skin aging is one of the degenerative processes influenced by tyrosinase, elastase, collagenase, hyaluronidase, and matrix metalloproteinase-9 (MMP9) activity. One promising avenue for discovering antiaging therapeutics is the peptides from the spine. The aim of this study was to explore the potential of peptides from spine as a multitarget inhibitor for recombinant antiaging therapies through in silico approaches.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!