Simultaneous regression and classification for drug sensitivity prediction using an advanced random forest method.

Sci Rep

Center for Bioinformatics, Saarland University, Saarland Informatics Campus (E2.1), 66123, Saarbrücken, Saarland, Germany.

Published: August 2022

Machine learning methods trained on cancer cell line panels are intensively studied for the prediction of optimal anti-cancer therapies. While classification approaches distinguish effective from ineffective drugs, regression approaches aim to quantify the degree of drug effectiveness. However, the high specificity of most anti-cancer drugs induces a skewed distribution of drug response values in favor of the more drug-resistant cell lines, negatively affecting the classification performance (class imbalance) and regression performance (regression imbalance) for the sensitive cell lines. Here, we present a novel approach called SimultAneoUs Regression and classificatiON Random Forests (SAURON-RF) based on the idea of performing a joint regression and classification analysis. We demonstrate that SAURON-RF improves the classification and regression performance for the sensitive cell lines at the expense of a moderate loss for the resistant ones. Furthermore, our results show that simultaneous classification and regression can be superior to regression or classification alone.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9356072PMC
http://dx.doi.org/10.1038/s41598-022-17609-xDOI Listing

Publication Analysis

Top Keywords

regression classification
16
cell lines
12
simultaneous regression
8
classification
8
regression
8
regression performance
8
sensitive cell
8
classification regression
8
classification drug
4
drug sensitivity
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!