Reducing bias in difference-in-differences models using entropy balancing
Authors:
Matthew Cefalu,
Brian G. Vegetabile,
Michael Dworsky,
Christine Eibner,
Federico Girosi
Abstract:
This paper illustrates the use of entropy balancing in difference-in-differences analyses when pre-intervention outcome trends suggest a possible violation of the parallel trends assumption. We describe a set of assumptions under which weighting to balance intervention and comparison groups on pre-intervention outcome trends leads to consistent difference-in-differences estimates even when pre-int…
▽ More
This paper illustrates the use of entropy balancing in difference-in-differences analyses when pre-intervention outcome trends suggest a possible violation of the parallel trends assumption. We describe a set of assumptions under which weighting to balance intervention and comparison groups on pre-intervention outcome trends leads to consistent difference-in-differences estimates even when pre-intervention outcome trends are not parallel. Simulated results verify that entropy balancing of pre-intervention outcomes trends can remove bias when the parallel trends assumption is not directly satisfied, and thus may enable researchers to use difference-in-differences designs in a wider range of observational settings than previously acknowledged.
△ Less
Submitted 9 November, 2020;
originally announced November 2020.
Deep Generative Modeling in Network Science with Applications to Public Policy Research
Authors:
Gavin S. Hartnett,
Raffaele Vardavas,
Lawrence Baker,
Michael Chaykowsky,
C. Ben Gibson,
Federico Girosi,
David P. Kennedy,
Osonde A. Osoba
Abstract:
Network data is increasingly being used in quantitative, data-driven public policy research. These are typically very rich datasets that contain complex correlations and inter-dependencies. This richness both promises to be quite useful for policy research, while at the same time posing a challenge for the useful extraction of information from these datasets - a challenge which calls for new data…
▽ More
Network data is increasingly being used in quantitative, data-driven public policy research. These are typically very rich datasets that contain complex correlations and inter-dependencies. This richness both promises to be quite useful for policy research, while at the same time posing a challenge for the useful extraction of information from these datasets - a challenge which calls for new data analysis methods. In this report, we formulate a research agenda of key methodological problems whose solutions would enable new advances across many areas of policy research. We then review recent advances in applying deep learning to network data, and show how these methods may be used to address many of the methodological problems we identified. We particularly emphasize deep generative methods, which can be used to generate realistic synthetic networks useful for microsimulation and agent-based models capable of informing key public policy questions. We extend these recent advances by developing a new generative framework which applies to large social contact networks commonly used in epidemiological modeling. For context, we also compare and contrast these recent neural network-based approaches with the more traditional Exponential Random Graph Models. Lastly, we discuss some open problems where more progress is needed.
△ Less
Submitted 16 October, 2020; v1 submitted 15 October, 2020;
originally announced October 2020.