Comparing global and local likelihood score thresholds in multiclass laplacian-modified Naive Bayes protein target prediction.

Comb Chem High Throughput Screen

Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK.

Published: October 2015

The increase of publicly available bioactivity data has led to the extensive development and usage of in silico bioactivity prediction algorithms. A particularly popular approach for such analyses is the multiclass Naïve Bayes, whose output is commonly processed by applying empirically-derived likelihood score thresholds. In this work, we describe a systematic way for deriving score cut-offs on a per-protein target basis and compare their performance with global thresholds on a large scale using both 5-fold cross-validation (ChEMBL 14, 189k ligand-protein pairs over 477 protein targets) and external validation (WOMBAT, 63k pairs, 421 targets). The individual protein target cut-offs derived were compared to global cut-offs ranging from -10 to 40 in score bouts of 2.5. The results indicate that individual thresholds had equal or better performance in all comparisons with global thresholds, ranging from 95% of protein targets to 57.96%. It is shown that local thresholds behave differently for particular families of targets (CYPs, GPCRs, Kinases and TFs). Furthermore, we demonstrate the discrepancy in performance when we move away from the training dataset chemical space, using Tanimoto similarity as a metric (from 0 to 1 in steps of 0.2). Finally, the individual protein score cut-offs derived for the in silico bioactivity application used in this work are released, as well as the reproducible and transferable KNIME workflows used to carry out the analysis.

Download full-text PDF

Source
http://dx.doi.org/10.2174/1386207318666150305145012DOI Listing

Publication Analysis

Top Keywords

likelihood score
8
score thresholds
8
protein target
8
silico bioactivity
8
score cut-offs
8
global thresholds
8
protein targets
8
individual protein
8
cut-offs derived
8
thresholds
6

Similar Publications

Validity of the MED4CHILD tool for assessing adherence to the Mediterranean diet in preschool children.

Eur J Pediatr

January 2025

Growth, Exercise, Nutrition and Development (GENUD) Research Group, Instituto Agroalimentario de Aragón (IA2), Faculty of Health Sciences, Universidad de Zaragoza, Instituto de Investigación Sanitaria de Aragón (IIS Aragón), 50009, Saragossa, Spain.

Unlabelled: Most of the available tools to assess adherence to Mediterranean diet (MedDiet) were constructed for adults, having limited applicability to children and adolescents. The aim of this study is to validate a specific questionnaire to assess adherence to MedDiet in children aged 3 to 6 years (MED4CHILD questionnaire). The validation was performed in a baseline examination of a cohort of children who were recruited in schools in seven cities.

View Article and Find Full Text PDF

Background: The aim was to assess whether the postoperative Oxford Hip Score (OHS) demonstrated a ceiling effect at 1 or 2 years after total hip arthroplasty (THA) and to identify which patients are more likely to achieve a ceiling score and whether this limits assessment of their outcome.

Methods: A retrospective cohort of 7871 patients undergoing primary THA was identified from an established arthroplasty database. Patient demographics, ASA grade, socioeconomic status, OHS and EuroQol questionnaire were collected preoperatively and at 1 and 2 years postoperatively.

View Article and Find Full Text PDF

Background: Although evidence suggests that dental floss contains perfluoroalkyl and polyfluoroalkyl substances (PFASs), it is still uncertain whether the use of dental floss contributes to an increased risk of PFAS exposure.

Methods: We analysed data on serum PFAS concentrations and dental floss usage in a cohort of 6750 adults who participated in the National Health and Nutrition Examination Survey (NHANES) from 2009 to 2020. In our study, we used logistic regression, a survey-weighted linear model, item response theory (IRT) scores, inverse probability weights (IPWs) and sensitivity analysis to assess the potential impact of dental floss usage on human serum PFAS levels.

View Article and Find Full Text PDF

Introduction: This analysis aimed to investigate diabetes-specific psychological outcomes among adults with type 1 diabetes (T1D) using hybrid closed-loop (HCL) versus standard therapy.

Research Design And Methods: In this multicenter, open-label, randomized, controlled, parallel-group clinical trial, adults with T1D were allocated to 26 weeks of HCL (MiniMed™ 670G) or standard therapy (insulin pump or multiple daily injections without real-time continuous glucose monitoring). Psychological outcomes (awareness and fear of hypoglycemia; and diabetes-specific positive well-being, diabetes distress, diabetes treatment satisfaction, and diabetes-specific quality of life (QoL)) were measured at enrollment, mid-trial and end-trial.

View Article and Find Full Text PDF

Objective: Preventing return to alcohol is of critical importance for patients with alcohol-related cirrhosis and/or alcohol-associated hepatitis. Acamprosate is a widely used treatment for alcohol use disorder (AUD). We assessed the impact of acamprosate prescription in patients with advanced liver disease on abstinence rates and clinical outcomes.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!