Unknown

Dataset Information

0

Simultaneous regression and classification for drug sensitivity prediction using an advanced random forest method.


ABSTRACT: Machine learning methods trained on cancer cell line panels are intensively studied for the prediction of optimal anti-cancer therapies. While classification approaches distinguish effective from ineffective drugs, regression approaches aim to quantify the degree of drug effectiveness. However, the high specificity of most anti-cancer drugs induces a skewed distribution of drug response values in favor of the more drug-resistant cell lines, negatively affecting the classification performance (class imbalance) and regression performance (regression imbalance) for the sensitive cell lines. Here, we present a novel approach called SimultAneoUs Regression and classificatiON Random Forests (SAURON-RF) based on the idea of performing a joint regression and classification analysis. We demonstrate that SAURON-RF improves the classification and regression performance for the sensitive cell lines at the expense of a moderate loss for the resistant ones. Furthermore, our results show that simultaneous classification and regression can be superior to regression or classification alone.

SUBMITTER: Lenhof K 

PROVIDER: S-EPMC9356072 | biostudies-literature | 2022 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Simultaneous regression and classification for drug sensitivity prediction using an advanced random forest method.

Lenhof Kerstin K   Eckhart Lea L   Gerstner Nico N   Kehl Tim T   Lenhof Hans-Peter HP  

Scientific reports 20220805 1


Machine learning methods trained on cancer cell line panels are intensively studied for the prediction of optimal anti-cancer therapies. While classification approaches distinguish effective from ineffective drugs, regression approaches aim to quantify the degree of drug effectiveness. However, the high specificity of most anti-cancer drugs induces a skewed distribution of drug response values in favor of the more drug-resistant cell lines, negatively affecting the classification performance (cl  ...[more]

Similar Datasets

| S-EPMC5595802 | biostudies-literature
| S-EPMC6173405 | biostudies-literature
| S-EPMC9780130 | biostudies-literature
| S-EPMC9817173 | biostudies-literature
| S-EPMC7508310 | biostudies-literature
| S-EPMC6392252 | biostudies-literature
| S-EPMC5423585 | biostudies-literature
2005-07-30 | E-GEOD-3034 | biostudies-arrayexpress
| S-EPMC11192156 | biostudies-literature
| S-EPMC7188832 | biostudies-literature