-
Fast Learning of Optimal Policy Trees
Authors:
James Cussens,
Julia Hatamyar,
Vishalie Shah,
Noemi Kreif
Abstract:
We develop and implement a version of the popular "policytree" method (Athey and Wager, 2021) using discrete optimisation techniques. We test the performance of our algorithm in finite samples and find an improvement in the runtime of optimal policy tree learning by a factor of nearly 50 compared to the original version. We provide an R package, "fastpolicytree", for public use.
We develop and implement a version of the popular "policytree" method (Athey and Wager, 2021) using discrete optimisation techniques. We test the performance of our algorithm in finite samples and find an improvement in the runtime of optimal policy tree learning by a factor of nearly 50 compared to the original version. We provide an R package, "fastpolicytree", for public use.
△ Less
Submitted 18 June, 2025;
originally announced June 2025.
-
Exploring the heterogeneous impacts of Indonesia's conditional cash transfer scheme (PKH) on maternal health care utilisation using instrumental causal forests
Authors:
Vishalie Shah,
Julia Hatamyar,
Taufik Hidayat,
Noemi Kreif
Abstract:
This paper uses instrumental causal forests, a novel machine learning method, to explore the treatment effect heterogeneity of Indonesia's conditional cash transfer scheme on maternal health care utilisation. Using randomised programme assignment as an instrument for enrollment in the scheme, we estimate conditional local average treatment effects for four key outcomes: good assisted delivery, del…
▽ More
This paper uses instrumental causal forests, a novel machine learning method, to explore the treatment effect heterogeneity of Indonesia's conditional cash transfer scheme on maternal health care utilisation. Using randomised programme assignment as an instrument for enrollment in the scheme, we estimate conditional local average treatment effects for four key outcomes: good assisted delivery, delivery in a health care facility, pre-natal visits, and post-natal visits. We find significant treatment effect heterogeneity by supply-side characteristics, even though supply-side readiness was taken into account during programme development. Mothers in areas with more doctors, nurses, and delivery assistants were more likely to benefit from the programme, in terms of increased rates of good assisted delivery outcome. We also find large differences in benefits according to indicators of household poverty and survey wave, reflecting the possible impact of changes in programme design in its later years. The impact on post-natal visits in 2013 displayed the largest heterogeneity among all outcomes, with some women less likely to attend post-natal check ups after receiving the cash transfer in the long term.
△ Less
Submitted 22 January, 2025;
originally announced January 2025.
-
Learning control variables and instruments for causal analysis in observational data
Authors:
Nicolas Apfel,
Julia Hatamyar,
Martin Huber,
Jannis Kueck
Abstract:
This study introduces a data-driven, machine learning-based method to detect suitable control variables and instruments for assessing the causal effect of a treatment on an outcome in observational data, if they exist. Our approach tests the joint existence of instruments, which are associated with the treatment but not directly with the outcome (at least conditional on observables), and suitable…
▽ More
This study introduces a data-driven, machine learning-based method to detect suitable control variables and instruments for assessing the causal effect of a treatment on an outcome in observational data, if they exist. Our approach tests the joint existence of instruments, which are associated with the treatment but not directly with the outcome (at least conditional on observables), and suitable control variables, conditional on which the treatment is exogenous, and learns the partition of instruments and control variables from the observed data. The detection of sets of instruments and control variables relies on the condition that proper instruments are conditionally independent of the outcome given the treatment and suitable control variables. We establish the consistency of our method for detecting control variables and instruments under certain regularity conditions, investigate the finite sample performance through a simulation study, and provide an empirical application to labor market data from the Job Corps study.
△ Less
Submitted 5 July, 2024;
originally announced July 2024.
-
Machine Learning for Staggered Difference-in-Differences and Dynamic Treatment Effect Heterogeneity
Authors:
Julia Hatamyar,
Noemi Kreif,
Rudi Rocha,
Martin Huber
Abstract:
We combine two recently proposed nonparametric difference-in-differences methods, extending them to enable the examination of treatment effect heterogeneity in the staggered adoption setting using machine learning. The proposed method, machine learning difference-in-differences (MLDID), allows for estimation of time-varying conditional average treatment effects on the treated, which can be used to…
▽ More
We combine two recently proposed nonparametric difference-in-differences methods, extending them to enable the examination of treatment effect heterogeneity in the staggered adoption setting using machine learning. The proposed method, machine learning difference-in-differences (MLDID), allows for estimation of time-varying conditional average treatment effects on the treated, which can be used to conduct detailed inference on drivers of treatment effect heterogeneity. We perform simulations to evaluate the performance of MLDID and find that it accurately identifies the true predictors of treatment effect heterogeneity. We then use MLDID to evaluate the heterogeneous impacts of Brazil's Family Health Program on infant mortality, and find those in poverty and urban locations experienced the impact of the policy more quickly than other subgroups.
△ Less
Submitted 18 October, 2023;
originally announced October 2023.
-
Local Eviction Moratoria and the Spread of COVID-19
Authors:
Julia Hatamyar,
Christopher F. Parmeter
Abstract:
At various stages during the initial onset of the COVID-19 pandemic, various US states and local municipalities enacted eviction moratoria. One of the main aims of these moratoria was to slow the spread of COVID-19 infections. We deploy a semiparametric difference-in-differences approach with an event study specification to test whether the lifting of these local moratoria led to an increase in CO…
▽ More
At various stages during the initial onset of the COVID-19 pandemic, various US states and local municipalities enacted eviction moratoria. One of the main aims of these moratoria was to slow the spread of COVID-19 infections. We deploy a semiparametric difference-in-differences approach with an event study specification to test whether the lifting of these local moratoria led to an increase in COVID-19 cases and deaths. Our main findings, across a range of specifications, are inconclusive regarding the impact of the moratoria - especially after accounting for the number of actual evictions and conducting the analysis at the county level. We argue that recently developed augmented synthetic control (ASCM) methods are more appropriate in this setting. Our ASCM results also suggest that the lifting of eviction moratoria had little to no impact on COVID-19 cases and deaths. Thus, it seems that eviction moratoria had little to no robust effect on reducing the spread of COVID-19 throwing into question its use as a non-pharmaceutical intervention.
△ Less
Submitted 1 July, 2023;
originally announced July 2023.
-
Policy Learning with Rare Outcomes
Authors:
Julia Hatamyar,
Noemi Kreif
Abstract:
Machine learning (ML) estimates of conditional average treatment effects (CATE) can guide policy decisions, either by allowing targeting of individuals with beneficial CATE estimates, or as inputs to decision trees that optimise overall outcomes. There is limited information available regarding how well these algorithms perform in real-world policy evaluation scenarios. Using synthetic data, we co…
▽ More
Machine learning (ML) estimates of conditional average treatment effects (CATE) can guide policy decisions, either by allowing targeting of individuals with beneficial CATE estimates, or as inputs to decision trees that optimise overall outcomes. There is limited information available regarding how well these algorithms perform in real-world policy evaluation scenarios. Using synthetic data, we compare the finite sample performance of different policy learning algorithms, machine learning techniques employed during their learning phases, and methods for presenting estimated policy values. For each algorithm, we assess the resulting treatment allocation by measuring deviation from the ideal ("oracle") policy. Our main finding is that policy trees based on estimated CATEs outperform trees learned from doubly-robust scores. Across settings, Causal Forests and the Normalised Double-Robust Learner perform consistently well, while Bayesian Additive Regression Trees perform poorly. These methods are then applied to a case study targeting optimal allocation of subsidised health insurance, with the goal of reducing infant mortality in Indonesia.
△ Less
Submitted 3 October, 2023; v1 submitted 10 February, 2023;
originally announced February 2023.
-
Workplace Breastfeeding Legislation and Female Labor Force Participation in the United States
Authors:
Julia Hatamyar
Abstract:
This paper studies the effects of legislation mandating the provision of workplace breastfeeding amenities on the labor force participation of women in the United States. Using both the American Community Survey and the Panel Study of Income Dynamics, in a staggered difference-in-differences framework, I find evidence that workplace breastfeeding legislation significantly increases the likelihood…
▽ More
This paper studies the effects of legislation mandating the provision of workplace breastfeeding amenities on the labor force participation of women in the United States. Using both the American Community Survey and the Panel Study of Income Dynamics, in a staggered difference-in-differences framework, I find evidence that workplace breastfeeding legislation significantly increases the likelihood of female labor force participation (FLFP) across both datasets and multiple specifications, by at least 1.5 percentage points. The timing and magnitude of the post-law increases in FLFP differ across the two datasets. I bolster the analyses using the CDC's Infant Feeding Practices Survey and the Childhood and Adoption Supplement to the PSID, which further suggest an influence of the laws on breastfeeding women. Heterogeneity analysis indicates the presence of substantial treatment effect heterogeneity across subgroups, but the findings are specific to the separate datasets. Across both datasets, the legislation appears to be more effective in states where average pre-law FLFP was comparatively low. I also find evidence of a negative spillover effect, whereby women without children and women with older children may have reduced their LFP in response to the legislation.
△ Less
Submitted 22 October, 2024; v1 submitted 13 September, 2022;
originally announced September 2022.