Unknown

Dataset Information

0

Comparing Logistic Regression Models with Alternative Machine Learning Methods to Predict the Risk of Drug Intoxication Mortality.


ABSTRACT: (1) Medical research has shown an increasing interest in machine learning, permitting massive multivariate data analysis. Thus, we developed drug intoxication mortality prediction models, and compared machine learning models and traditional logistic regression. (2) Categorized as drug intoxication, 8,937 samples were extracted from the Korea Centers for Disease Control and Prevention (2008-2017). We trained, validated, and tested each model through data and compared their performance using three measures: Brier score, calibration slope, and calibration-in-the-large. (3) A chi-square test demonstrated that mortality risk statistically significantly differed according to severity, intent, toxic substance, age, and sex. The multilayer perceptron model (MLP) had the highest area under the curve (AUC), and lowest Brier score in training and validation phases, while the logistic regression model (LR) showed the highest AUC (0.827) and lowest Brier score (0.0307) in the testing phase. MLP also had the second-highest AUC (0.816) and second-lowest Brier score (0.003258) in the testing phase, demonstrating better performance than the decision-making tree model. (4) Given the complexity of choosing tuning parameters, LR proved competitive when using medical datasets, which require strict accuracy.

SUBMITTER: Choi Y 

PROVIDER: S-EPMC7037603 | biostudies-literature | 2020 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Comparing Logistic Regression Models with Alternative Machine Learning Methods to Predict the Risk of Drug Intoxication Mortality.

Choi YoungJin Y   Boo YooKyung Y  

International journal of environmental research and public health 20200131 3


(1) Medical research has shown an increasing interest in machine learning, permitting massive multivariate data analysis. Thus, we developed drug intoxication mortality prediction models, and compared machine learning models and traditional logistic regression. (2) Categorized as drug intoxication, 8,937 samples were extracted from the Korea Centers for Disease Control and Prevention (2008-2017). We trained, validated, and tested each model through data and compared their performance using three  ...[more]

Similar Datasets

| S-EPMC6786577 | biostudies-literature
| S-EPMC4143639 | biostudies-literature
| S-EPMC7092376 | biostudies-literature
| S-EPMC3505214 | biostudies-other
| S-EPMC5091918 | biostudies-literature
| S-EPMC6874355 | biostudies-literature
| S-EPMC8096078 | biostudies-literature
| S-EPMC6094446 | biostudies-literature
| S-EPMC5568368 | biostudies-literature
| S-EPMC8688959 | biostudies-literature