Unknown

Dataset Information

0

Mitigating Bias in Generalized Linear Mixed Models: The Case for Bayesian Nonparametrics.


ABSTRACT: Generalized linear mixed models are a common statistical tool for the analysis of clustered or longitudinal data where correlation is accounted for through cluster-specific random effects. In practice, the distribution of the random effects is typically taken to be a Normal distribution, although if this does not hold then the model is misspecified and standard estimation/inference may be invalid. An alternative is to perform a so-called nonparametric Bayesian analyses in which one assigns a Dirichlet process (DP) prior to the unknown distribution of the random effects. In this paper we examine operating characteristics for estimation of fixed effects and random effects based on such an analysis under a range of "true" random effects distributions. As part of this we investigate various approaches for selection of the precision parameter of the DP prior. In addition, we illustrate the use of the methods with an analysis of post-operative complications among n = 18, 643 female Medicare beneficiaries who underwent a hysterectomy procedure at N = 503 hospitals in the US. Overall, we conclude that using the DP priori n modeling the random effect distribution results in large reductions of bias with little loss of efficiency. While no single choice for the precision parameter will be optimal in all settings, certain strategies such as importance sampling or empirical Bayes can be used to obtain reasonable results in a broad range of data scenarios.

SUBMITTER: Antonelli J 

PROVIDER: S-EPMC5624537 | biostudies-literature | 2016 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Mitigating Bias in Generalized Linear Mixed Models: The Case for Bayesian Nonparametrics.

Antonelli Joseph J   Trippa Lorenzo L   Haneuse Sebastien S  

Statistical science : a review journal of the Institute of Mathematical Statistics 20160210 1


Generalized linear mixed models are a common statistical tool for the analysis of clustered or longitudinal data where correlation is accounted for through cluster-specific random effects. In practice, the distribution of the random effects is typically taken to be a Normal distribution, although if this does not hold then the model is misspecified and standard estimation/inference may be invalid. An alternative is to perform a so-called nonparametric Bayesian analyses in which one assigns a Dir  ...[more]

Similar Datasets

| S-EPMC2883299 | biostudies-literature
| S-EPMC5505735 | biostudies-literature
| S-EPMC9358725 | biostudies-literature
| S-EPMC10769057 | biostudies-literature
| S-EPMC7398826 | biostudies-literature
| S-EPMC3567190 | biostudies-literature
| S-EPMC3709111 | biostudies-literature
| S-EPMC6397884 | biostudies-literature
| S-EPMC3399744 | biostudies-literature
| S-EPMC3375403 | biostudies-literature