Revisiting Event Study Designs
That is a working paper by Borusyak and Jaravel on SSRN. The paper looks at an underidentification problem in event study designs, where all units in the data eventually receive the treatment at the same time (i.e. there is no control group).
Since there is no control group, one way to check the validity of such an approach rests is to check that there is not pre-trend before the treatment occurs, much like the parallel trends assumption we use for diff-in-diff designs.
A natural way to do this is to estimate the following specification for an individual-year panel:
is an individual fixed effect, is a calendar year effect, and is a time period relative to that individual’s treatment period, which I’m assuming goes from -5 to 6 (0 is treatment year). Note that
where is a calendar year and is the year of treatment for individual .
Two things here:
-
is an time-invariant individual level variable so it is accounted for by the individual fixed effect.
-
, , and are linearly related, and the authors show that in the model, are only identified up to a linear trend. That is, for any set of estimate of , you can add a linear trend in and adjust the individual or year fixed effects to get the same predicted value.1
More practically, this means that you cannot identify linear trends in . If you actually try to estimate the model in R or Stata, the program will drop two of the categories rather than just one. The interpretation of the coefficients that are returned are deviations from the line between the two categories.
To illustrate, consider an example where the true ’s are indeed linear before treatment.
If we drop and from the estimation, then what we’re estimating is the deviation from the straight line that joins and .
Just to be sure, let’s try actually estimating a very simple example data set with the same “true” effects. 2
Borusyak and Jaravel propose a couple of ways to approach this problem. In brief,
-
Estimate two models: the full model, and a “semi-dynamic model” where for all , and run an F-test on both models.
-
Use individual random effects (they also propose a Hausman test using the result that random effects is more efficient than fixed effects).
I’ll probably do a later post on these when I’ve (a) thought more about how they would fit into a project and (b) figured out how to implement them in R.
There’s more in the paper besides the pre-trend discussion that should be of interest to applied micro researchers, such as estimating time-varying effects versus a single effect for all post-treatment periods.
Leave a Comment