Unknown

Dataset Information

0

Penalized logistic regression with low prevalence exposures beyond high dimensional settings.


ABSTRACT: Estimating and selecting risk factors with extremely low prevalences of exposure for a binary outcome is a challenge because classical standard techniques, markedly logistic regression, often fail to provide meaningful results in such settings. While penalized regression methods are widely used in high-dimensional settings, we were able to show their usefulness in low-dimensional settings as well. Specifically, we demonstrate that Firth correction, ridge, the lasso and boosting all improve the estimation for low-prevalence risk factors. While the methods themselves are well-established, comparison studies are needed to assess their potential benefits in this context. This is done here using the dataset of a large unmatched case-control study from France (2005-2008) about the relationship between prescription medicines and road traffic accidents and an accompanying simulation study. Results show that the estimation of risk factors with prevalences below 0.1% can be drastically improved by using Firth correction and boosting in particular, especially for ultra-low prevalences. When a moderate number of low prevalence exposures is available, we recommend the use of penalized techniques.

SUBMITTER: Doerken S 

PROVIDER: S-EPMC6527211 | biostudies-literature | 2019

REPOSITORIES: biostudies-literature

altmetric image

Publications

Penalized logistic regression with low prevalence exposures beyond high dimensional settings.

Doerken Sam S   Avalos Marta M   Lagarde Emmanuel E   Schumacher Martin M  

PloS one 20190520 5


Estimating and selecting risk factors with extremely low prevalences of exposure for a binary outcome is a challenge because classical standard techniques, markedly logistic regression, often fail to provide meaningful results in such settings. While penalized regression methods are widely used in high-dimensional settings, we were able to show their usefulness in low-dimensional settings as well. Specifically, we demonstrate that Firth correction, ridge, the lasso and boosting all improve the e  ...[more]

Similar Datasets

| S-EPMC3348559 | biostudies-literature
| S-EPMC2732298 | biostudies-literature
| S-EPMC7799181 | biostudies-literature
| S-EPMC2567351 | biostudies-literature
| S-EPMC6642380 | biostudies-literature
| S-EPMC3842118 | biostudies-other
| S-EPMC7654973 | biostudies-literature
| S-EPMC8375316 | biostudies-literature
| S-EPMC7331150 | biostudies-literature
| S-EPMC4266195 | biostudies-literature