Dataset Information

Machine learning applications for prediction of relapse in childhood acute lymphoblastic leukemia.

ABSTRACT: The prediction of relapse in childhood acute lymphoblastic leukemia (ALL) is a critical factor for successful treatment and follow-up planning. Our goal was to construct an ALL relapse prediction model based on machine learning algorithms. Monte Carlo cross-validation nested by 10-fold cross-validation was used to rank clinical variables on the randomly split training sets of 336 newly diagnosed ALL children, and a forward feature selection algorithm was employed to find the shortest list of most discriminatory variables. To enable an unbiased estimation of the prediction model to new patients, besides the split test sets of 150 patients, we introduced another independent data set of 84 patients to evaluate the model. The Random Forest model with 14 features achieved a cross-validation accuracy of 0.827 ± 0.031 on one set and an accuracy of 0.798 on the other, with the area under the curve of 0.902 ± 0.027 and 0.904, respectively. The model performed well across different risk-level groups, with the best accuracy of 0.829 in the standard-risk group. To our knowledge, this is the first study to use machine learning models to predict childhood ALL relapse based on medical data from Electronic Medical Record, which will further facilitate stratification treatments.

SUBMITTER: Pan L

PROVIDER: S-EPMC5547099 | biostudies-literature | 2017 Aug

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Machine learning applications for prediction of relapse in childhood acute lymphoblastic leukemia.

Pan Liyan L Liu Guangjian G Lin Fangqin F Zhong Shuling S Xia Huimin H Sun Xin X Liang Huiying H

Scientific reports 20170807 1

The prediction of relapse in childhood acute lymphoblastic leukemia (ALL) is a critical factor for successful treatment and follow-up planning. Our goal was to construct an ALL relapse prediction model based on machine learning algorithms. Monte Carlo cross-validation nested by 10-fold cross-validation was used to rank clinical variables on the randomly split training sets of 336 newly diagnosed ALL children, and a forward feature selection algorithm was employed to find the shortest list of mos ...[more]

PMID: 28784991

Dataset Information

Machine learning applications for prediction of relapse in childhood acute lymphoblastic leukemia.

Publications

Machine learning applications for prediction of relapse in childhood acute lymphoblastic leukemia.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Prediction of tumor lysis syndrome in childhood acute lymphoblastic leukemia based on machine learning models: a retrospective study.
| S-EPMC10955075 | biostudies-literature

FPGS relapse-specific mutations in relapsed childhood acute lymphoblastic leukemia.
| S-EPMC7374087 | biostudies-literature

Relapse-specific mutations in NT5C2 in childhood acute lymphoblastic leukemia.
| S-EPMC3681285 | biostudies-literature

Can Machine Learning Models Predict Asparaginase-associated Pancreatitis in Childhood Acute Lymphoblastic Leukemia.
| S-EPMC8946594 | biostudies-literature

Down-Regulated FOXO1 in Refractory/Relapse Childhood B-Cell Acute Lymphoblastic Leukemia.
| S-EPMC7686545 | biostudies-literature

A pilot study of implication of machine learning for relapse prediction after allogeneic stem cell transplantation in adults with Ph-positive acute lymphoblastic leukemia.
| S-EPMC10556079 | biostudies-literature

Drug Resistance Biomarkers and Their Clinical Applications in Childhood Acute Lymphoblastic Leukemia.
| S-EPMC6978753 | biostudies-literature

Proteogenomics of High hyperdiploid childhood acute lymphoblastic leukemia
2019-02-25 | PXD010175 | Pride

The Role of miRNAs in Childhood Acute Lymphoblastic Leukemia Relapse and the Associated Molecular Mechanisms.
| S-EPMC10779195 | biostudies-literature

CREBBP HAT domain mutations prevail in relapse cases of high hyperdiploid childhood acute lymphoblastic leukemia.
| S-EPMC4194312 | biostudies-literature