Unknown

Dataset Information

0

Variable selection for zero-inflated and overdispersed data with application to health care demand in Germany.


ABSTRACT: In health services and outcome research, count outcomes are frequently encountered and often have a large proportion of zeros. The zero-inflated negative binomial (ZINB) regression model has important applications for this type of data. With many possible candidate risk factors, this paper proposes new variable selection methods for the ZINB model. We consider maximum likelihood function plus a penalty including the least absolute shrinkage and selection operator (LASSO), smoothly clipped absolute deviation (SCAD), and minimax concave penalty (MCP). An EM (expectation-maximization) algorithm is proposed for estimating the model parameters and conducting variable selection simultaneously. This algorithm consists of estimating penalized weighted negative binomial models and penalized logistic models via the coordinated descent algorithm. Furthermore, statistical properties including the standard error formulae are provided. A simulation study shows that the new algorithm not only has more accurate or at least comparable estimation, but also is more robust than the traditional stepwise variable selection. The proposed methods are applied to analyze the health care demand in Germany using the open-source R package mpath.

SUBMITTER: Wang Z 

PROVIDER: S-EPMC5525141 | biostudies-literature | 2015 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Variable selection for zero-inflated and overdispersed data with application to health care demand in Germany.

Wang Zhu Z   Ma Shuangge S   Wang Ching-Yun CY  

Biometrical journal. Biometrische Zeitschrift 20150608 5


In health services and outcome research, count outcomes are frequently encountered and often have a large proportion of zeros. The zero-inflated negative binomial (ZINB) regression model has important applications for this type of data. With many possible candidate risk factors, this paper proposes new variable selection methods for the ZINB model. We consider maximum likelihood function plus a penalty including the least absolute shrinkage and selection operator (LASSO), smoothly clipped absolu  ...[more]

Similar Datasets

| S-EPMC7308073 | biostudies-literature
| S-EPMC10077942 | biostudies-literature
| S-EPMC4493133 | biostudies-literature
| S-EPMC7495888 | biostudies-literature
| S-EPMC10087693 | biostudies-literature
| S-EPMC5885979 | biostudies-literature
| S-EPMC7924499 | biostudies-literature
| S-EPMC4988952 | biostudies-literature
| S-EPMC7768662 | biostudies-literature
| S-EPMC10523642 | biostudies-literature