Unknown

Dataset Information

0

Matching and Regression to the Mean in Difference-in-Differences Analysis.


ABSTRACT:

Objective

To demonstrate regression to the mean bias introduced by matching on preperiod variables in difference-in-differences studies.

Data sources

Simulated data.

Study design

We performed a Monte Carlo simulation to estimate the effect of a placebo intervention on simulated longitudinal data for units in treatment and control groups using unmatched and matched difference-in-differences analyses. We varied the preperiod level and trend differences between the treatment and control groups, and the serial correlation of the matching variables. We assessed estimator bias as the mean absolute deviation of estimated program effects from the true value of zero.

Principal findings

When preperiod outcome level is correlated with treatment assignment, an unmatched analysis is unbiased, but matching units on preperiod outcome levels produces biased estimates. The bias increases with greater preperiod level differences and weaker serial correlation in the outcome. This problem extends to matching on preperiod level of a time-varying covariate. When treatment assignment is correlated with preperiod trend only, the unmatched analysis is biased, and matching units on preperiod level or trend does not introduce additional bias.

Conclusions

Researchers should be aware of the threat of regression to the mean when constructing matched samples for difference-in-differences. We provide guidance on when to incorporate matching in this study design.

SUBMITTER: Daw JR 

PROVIDER: S-EPMC6232412 | biostudies-literature | 2018 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Matching and Regression to the Mean in Difference-in-Differences Analysis.

Daw Jamie R JR   Hatfield Laura A LA  

Health services research 20180629 6


<h4>Objective</h4>To demonstrate regression to the mean bias introduced by matching on preperiod variables in difference-in-differences studies.<h4>Data sources</h4>Simulated data.<h4>Study design</h4>We performed a Monte Carlo simulation to estimate the effect of a placebo intervention on simulated longitudinal data for units in treatment and control groups using unmatched and matched difference-in-differences analyses. We varied the preperiod level and trend differences between the treatment a  ...[more]

Similar Datasets

| S-EPMC6916299 | biostudies-literature
| S-EPMC7770064 | biostudies-literature
| S-EPMC7846118 | biostudies-literature
| S-EPMC6377539 | biostudies-literature
| S-EPMC3888642 | biostudies-literature
| S-EPMC7106802 | biostudies-literature
| S-EPMC4838740 | biostudies-other
| S-EPMC7541515 | biostudies-literature
| S-EPMC7374798 | biostudies-literature
| S-EPMC7240769 | biostudies-literature