Unknown

Dataset Information

0

Targeted maximum likelihood estimation for a binary treatment: A tutorial.


ABSTRACT: When estimating the average effect of a binary treatment (or exposure) on an outcome, methods that incorporate propensity scores, the G-formula, or targeted maximum likelihood estimation (TMLE) are preferred over naïve regression approaches, which are biased under misspecification of a parametric outcome model. In contrast propensity score methods require the correct specification of an exposure model. Double-robust methods only require correct specification of either the outcome or the exposure model. Targeted maximum likelihood estimation is a semiparametric double-robust method that improves the chances of correct model specification by allowing for flexible estimation using (nonparametric) machine-learning methods. It therefore requires weaker assumptions than its competitors. We provide a step-by-step guided implementation of TMLE and illustrate it in a realistic scenario based on cancer epidemiology where assumptions about correct model specification and positivity (ie, when a study participant had 0 probability of receiving the treatment) are nearly violated. This article provides a concise and reproducible educational introduction to TMLE for a binary outcome and exposure. The reader should gain sufficient understanding of TMLE from this introductory tutorial to be able to apply the method in practice. Extensive R-code is provided in easy-to-read boxes throughout the article for replicability. Stata users will find a testing implementation of TMLE and additional material in the Appendix S1 and at the following GitHub repository: https://github.com/migariane/SIM-TMLE-tutorial.

SUBMITTER: Luque-Fernandez MA 

PROVIDER: S-EPMC6032875 | biostudies-literature | 2018 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Targeted maximum likelihood estimation for a binary treatment: A tutorial.

Luque-Fernandez Miguel Angel MA   Schomaker Michael M   Rachet Bernard B   Schnitzer Mireille E ME  

Statistics in medicine 20180423 16


When estimating the average effect of a binary treatment (or exposure) on an outcome, methods that incorporate propensity scores, the G-formula, or targeted maximum likelihood estimation (TMLE) are preferred over naïve regression approaches, which are biased under misspecification of a parametric outcome model. In contrast propensity score methods require the correct specification of an exposure model. Double-robust methods only require correct specification of either the outcome or the exposure  ...[more]

Similar Datasets

| S-EPMC3818128 | biostudies-literature
| S-EPMC6053284 | biostudies-literature
| S-EPMC6800798 | biostudies-literature
| S-EPMC9489667 | biostudies-literature
| S-EPMC11228874 | biostudies-literature
| S-EPMC5051604 | biostudies-literature
| S-EPMC8577774 | biostudies-literature
| S-EPMC4405134 | biostudies-literature
| S-EPMC5008986 | biostudies-literature
| S-EPMC10748807 | biostudies-literature