Three machine learning models for the 2019 Solubility Challenge.

ADMET DMPK

EaStCHEM School of Chemistry and Biomedical Sciences Research Complex, University of St Andrews, North Haugh, St Andrews, Scotland, KY16 9ST, UK.

Published: June 2020

We describe three machine learning models submitted to the 2019 Solubility Challenge. All are founded on tree-like classifiers, with one model being based on Random Forest and another on the related Extra Trees algorithm. The third model is a consensus predictor combining the former two with a Bagging classifier. We call this consensus classifier Vox Machinarum, and here discuss how it benefits from the Wisdom of Crowds. On the first 2019 Solubility Challenge test set of 100 low-variance intrinsic aqueous solubilities, Extra Trees is our best classifier. One the other, a high-variance set of 32 molecules, we find that Vox Machinarum and Random Forest both perform a little better than Extra Trees, and almost equally to one another. We also compare the gold standard solubilities from the 2019 Solubility Challenge with a set of literature-based solubilities for most of the same compounds.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8915607PMC
http://dx.doi.org/10.5599/admet.835DOI Listing

Publication Analysis

Top Keywords

2019 solubility
16
solubility challenge
16
extra trees
12
three machine
8
machine learning
8
learning models
8
random forest
8
vox machinarum
8
0
4
models 2019
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!