Unknown

Dataset Information

0

Comprehensible Predictive Modeling Using Regularized Logistic Regression and Comorbidity Based Features.


ABSTRACT: Different studies have demonstrated the importance of comorbidities to better understand the origin and evolution of medical complications. This study focuses on improvement of the predictive model interpretability based on simple logical features representing comorbidities. We use group lasso based feature interaction discovery followed by a post-processing step, where simple logic terms are added. In the final step, we reduce the feature set by applying lasso logistic regression to obtain a compact set of non-zero coefficients that represent a more comprehensible predictive model. The effectiveness of the proposed approach was demonstrated on a pediatric hospital discharge dataset that was used to build a readmission risk estimation model. The evaluation of the proposed method demonstrates a reduction of the initial set of features in a regression model by 72%, with a slight improvement in the Area Under the ROC Curve metric from 0.763 (95% CI: 0.755-0.771) to 0.769 (95% CI: 0.761-0.777). Additionally, our results show improvement in comprehensibility of the final predictive model using simple comorbidity based terms for logistic regression.

SUBMITTER: Stiglic G 

PROVIDER: S-EPMC4672891 | biostudies-literature | 2015

REPOSITORIES: biostudies-literature

altmetric image

Publications

Comprehensible Predictive Modeling Using Regularized Logistic Regression and Comorbidity Based Features.

Stiglic Gregor G   Povalej Brzan Petra P   Fijacko Nino N   Wang Fei F   Delibasic Boris B   Kalousis Alexandros A   Obradovic Zoran Z  

PloS one 20151208 12


Different studies have demonstrated the importance of comorbidities to better understand the origin and evolution of medical complications. This study focuses on improvement of the predictive model interpretability based on simple logical features representing comorbidities. We use group lasso based feature interaction discovery followed by a post-processing step, where simple logic terms are added. In the final step, we reduce the feature set by applying lasso logistic regression to obtain a co  ...[more]

Similar Datasets

| S-EPMC8596493 | biostudies-literature
| S-EPMC6305753 | biostudies-literature
| S-EPMC6399553 | biostudies-literature
| S-EPMC4769543 | biostudies-literature
| S-EPMC8377384 | biostudies-literature
| S-EPMC3875235 | biostudies-literature
| S-EPMC7439205 | biostudies-literature
| S-EPMC10471899 | biostudies-literature
| S-EPMC4046566 | biostudies-literature
| S-EPMC6030386 | biostudies-literature