Unknown

Dataset Information

0

Too many covariates and too few cases? - a comparative study.


ABSTRACT: Prior research indicates that 10-15 cases or controls, whichever fewer, are required per parameter to reliably estimate regression coefficients in multivariable logistic regression models. This condition may be difficult to meet even in a well-designed study when the number of potential confounders is large, the outcome is rare, and/or interactions are of interest. Various propensity score approaches have been implemented when the exposure is binary. Recent work on shrinkage approaches like lasso were motivated by the critical need to develop methods for the p >> n situation, where p is the number of parameters and n is the sample size. Those methods, however, have been less frequently used when p?n, and in this situation, there is no guidance on choosing among regular logistic regression models, propensity score methods, and shrinkage approaches. To fill this gap, we conducted extensive simulations mimicking our motivating clinical data, estimating vaccine effectiveness for preventing influenza hospitalizations in the 2011-2012 influenza season. Ridge regression and penalized logistic regression models that penalize all but the coefficient of the exposure may be considered in these types of studies. Copyright © 2016 John Wiley & Sons, Ltd.

SUBMITTER: Chen Q 

PROVIDER: S-EPMC5050102 | biostudies-literature | 2016 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Too many covariates and too few cases? - a comparative study.

Chen Qingxia Q   Nian Hui H   Zhu Yuwei Y   Talbot H Keipp HK   Griffin Marie R MR   Harrell Frank E FE  

Statistics in medicine 20160630 25


Prior research indicates that 10-15 cases or controls, whichever fewer, are required per parameter to reliably estimate regression coefficients in multivariable logistic regression models. This condition may be difficult to meet even in a well-designed study when the number of potential confounders is large, the outcome is rare, and/or interactions are of interest. Various propensity score approaches have been implemented when the exposure is binary. Recent work on shrinkage approaches like lass  ...[more]

Similar Datasets

| S-EPMC6456380 | biostudies-literature
| S-EPMC6736290 | biostudies-literature
| S-EPMC4693066 | biostudies-literature
| S-EPMC2657741 | biostudies-literature
| S-EPMC10030646 | biostudies-literature
| S-EPMC8239176 | biostudies-literature
2023-08-04 | GSE223483 | GEO
| S-EPMC6279308 | biostudies-literature
| S-EPMC3922615 | biostudies-literature
| S-EPMC4578923 | biostudies-literature