-
Inference with few treated units
Authors:
Luis Alvarez,
Bruno Ferman,
Kaspar Wüthrich
Abstract:
In many causal inference applications, only one or a few units (or clusters of units) are treated. An important challenge in such settings is that standard inference methods that rely on asymptotic theory may be unreliable, even when the total number of units is large. This survey reviews and categorizes inference methods that are designed to accommodate few treated units, considering both cross-s…
▽ More
In many causal inference applications, only one or a few units (or clusters of units) are treated. An important challenge in such settings is that standard inference methods that rely on asymptotic theory may be unreliable, even when the total number of units is large. This survey reviews and categorizes inference methods that are designed to accommodate few treated units, considering both cross-sectional and panel data methods. We discuss trade-offs and connections between different approaches. In doing so, we propose slight modifications to improve the finite-sample validity of some methods, and we also provide theoretical justifications for existing heuristic approaches that have been proposed in the literature.
△ Less
Submitted 25 June, 2025; v1 submitted 28 April, 2025;
originally announced April 2025.
-
Estimating Causal Effects of Discrete and Continuous Treatments with Binary Instruments
Authors:
Victor Chernozhukov,
Iván Fernández-Val,
Sukjin Han,
Kaspar Wüthrich
Abstract:
We propose an instrumental variable framework for identifying and estimating causal effects of discrete and continuous treatments with binary instruments. The basis of our approach is a local copula representation of the joint distribution of the potential outcomes and unobservables determining treatment assignment. This representation allows us to introduce an identifying assumption, so-called co…
▽ More
We propose an instrumental variable framework for identifying and estimating causal effects of discrete and continuous treatments with binary instruments. The basis of our approach is a local copula representation of the joint distribution of the potential outcomes and unobservables determining treatment assignment. This representation allows us to introduce an identifying assumption, so-called copula invariance, that restricts the local dependence of the copula with respect to the treatment propensity. We show that copula invariance identifies treatment effects for the entire population and other subpopulations such as the treated. The identification results are constructive and lead to practical estimation and inference procedures based on distribution regression. An application to estimating the effect of sleep on well-being uncovers interesting patterns of heterogeneity.
△ Less
Submitted 13 December, 2024; v1 submitted 9 March, 2024;
originally announced March 2024.
-
The Power of Tests for Detecting $p$-Hacking
Authors:
Graham Elliott,
Nikolay Kudrin,
Kaspar Wüthrich
Abstract:
$p$-Hacking undermines the validity of empirical studies. A flourishing empirical literature investigates the prevalence of $p$-hacking based on the distribution of $p$-values across studies. Interpreting results in this literature requires a careful understanding of the power of methods for detecting $p$-hacking. We theoretically study the implications of likely forms of $p…
▽ More
$p$-Hacking undermines the validity of empirical studies. A flourishing empirical literature investigates the prevalence of $p$-hacking based on the distribution of $p$-values across studies. Interpreting results in this literature requires a careful understanding of the power of methods for detecting $p$-hacking. We theoretically study the implications of likely forms of $p$-hacking on the distribution of $p$-values to understand the power of tests for detecting it. Power depends crucially on the $p$-hacking strategy and the distribution of true effects. Publication bias can enhance the power for testing the joint null of no $p$-hacking and no publication bias.
△ Less
Submitted 22 April, 2024; v1 submitted 16 May, 2022;
originally announced May 2022.
-
Selection and parallel trends
Authors:
Dalia Ghanem,
Pedro H. C. Sant'Anna,
Kaspar Wüthrich
Abstract:
We study the role of selection into treatment in difference-in-differences (DiD) designs. We derive necessary and sufficient conditions for parallel trends assumptions under general classes of selection mechanisms. These conditions characterize the empirical content of parallel trends. We use the necessary conditions to provide a selection-based decomposition of the bias of DiD and provide easy-to…
▽ More
We study the role of selection into treatment in difference-in-differences (DiD) designs. We derive necessary and sufficient conditions for parallel trends assumptions under general classes of selection mechanisms. These conditions characterize the empirical content of parallel trends. We use the necessary conditions to provide a selection-based decomposition of the bias of DiD and provide easy-to-implement strategies for benchmarking its components. We also provide templates for justifying DiD in applications with and without covariates. A reanalysis of the causal effect of NSW training programs demonstrates the usefulness of our selection-based approach to benchmarking the bias of DiD.
△ Less
Submitted 30 May, 2025; v1 submitted 16 March, 2022;
originally announced March 2022.
-
Pairwise Valid Instruments
Authors:
Zhenting Sun,
Kaspar Wüthrich
Abstract:
Finding valid instruments is difficult. We propose Validity Set Instrumental Variable (VSIV) estimation, a method for estimating local average treatment effects (LATEs) in heterogeneous causal effect models when the instruments are partially invalid. We consider settings with pairwise valid instruments, that is, instruments that are valid for a subset of instrument value pairs. VSIV estimation exp…
▽ More
Finding valid instruments is difficult. We propose Validity Set Instrumental Variable (VSIV) estimation, a method for estimating local average treatment effects (LATEs) in heterogeneous causal effect models when the instruments are partially invalid. We consider settings with pairwise valid instruments, that is, instruments that are valid for a subset of instrument value pairs. VSIV estimation exploits testable implications of instrument validity to remove invalid pairs and provides estimates of the LATEs for all remaining pairs, which can be aggregated into a single parameter of interest using researcher-specified weights. We show that the proposed VSIV estimators are asymptotically normal under weak conditions and remove or reduce the asymptotic bias relative to standard LATE estimators (that is, LATE estimators that do not use testable implications to remove invalid variation). We evaluate the finite sample properties of VSIV estimation in application-based simulations and apply our method to estimate the returns to college education using parental education as an instrument.
△ Less
Submitted 8 February, 2025; v1 submitted 15 March, 2022;
originally announced March 2022.
-
A model of multiple hypothesis testing
Authors:
Davide Viviano,
Kaspar Wuthrich,
Paul Niehaus
Abstract:
Multiple hypothesis testing practices vary widely, without consensus on which are appropriate when. This paper provides an economic foundation for these practices designed to capture leading examples, such as regulatory approval on the basis of clinical trials. In studies of multiple treatments or sub-populations, adjustments may be appropriate depending on scale economies in the research producti…
▽ More
Multiple hypothesis testing practices vary widely, without consensus on which are appropriate when. This paper provides an economic foundation for these practices designed to capture leading examples, such as regulatory approval on the basis of clinical trials. In studies of multiple treatments or sub-populations, adjustments may be appropriate depending on scale economies in the research production function, with control of classical notions of compound errors emerging in some but not all cases. In studies with multiple outcomes, indexing is appropriate and adjustments to test levels may be appropriate if the intended audience is heterogeneous. Data on actual costs in the drug approval process suggest both that some adjustment is warranted in that setting and that standard procedures may be overly conservative.
△ Less
Submitted 31 January, 2025; v1 submitted 27 April, 2021;
originally announced April 2021.
-
Green governments
Authors:
Niklas Potrafke,
Kaspar Wuthrich
Abstract:
We examine how Green governments influence environmental, macroeconomic, and education outcomes. We exploit that the Fukushima nuclear disaster in Japan gave rise to an unanticipated change in government in the German state Baden-Wuerttemberg in 2011. Using the synthetic control method, we find no evidence that the Green government influenced CO2 emissions or increased renewable energy usage overa…
▽ More
We examine how Green governments influence environmental, macroeconomic, and education outcomes. We exploit that the Fukushima nuclear disaster in Japan gave rise to an unanticipated change in government in the German state Baden-Wuerttemberg in 2011. Using the synthetic control method, we find no evidence that the Green government influenced CO2 emissions or increased renewable energy usage overall. The share of wind power usage even decreased. Intra-ecological conflicts prevented the Green government from implementing drastic changes in environmental policies. The results do not suggest that the Green government influenced macroeconomic outcomes. Inclusive education policies caused comprehensive schools to become larger.
△ Less
Submitted 1 March, 2022; v1 submitted 17 December, 2020;
originally announced December 2020.
-
Bias correction for quantile regression estimators
Authors:
Grigory Franguridi,
Bulat Gafarov,
Kaspar Wuthrich
Abstract:
We study the bias of classical quantile regression and instrumental variable quantile regression estimators. While being asymptotically first-order unbiased, these estimators can have non-negligible second-order biases. We derive a higher-order stochastic expansion of these estimators using empirical process theory. Based on this expansion, we derive an explicit formula for the second-order bias a…
▽ More
We study the bias of classical quantile regression and instrumental variable quantile regression estimators. While being asymptotically first-order unbiased, these estimators can have non-negligible second-order biases. We derive a higher-order stochastic expansion of these estimators using empirical process theory. Based on this expansion, we derive an explicit formula for the second-order bias and propose a feasible bias correction procedure that uses finite-difference estimators of the bias components. The proposed bias correction method performs well in simulations. We provide an empirical illustration using Engel's classical data on household food expenditure.
△ Less
Submitted 12 February, 2025; v1 submitted 5 November, 2020;
originally announced November 2020.
-
Protectionism and economic growth: Causal evidence from the first era of globalization
Authors:
Niklas Potrafke,
Fabian Ruthardt,
Kaspar Wüthrich
Abstract:
We investigate how protectionist policies influence economic growth. Our empirical strategy exploits an extraordinary tax scandal that gave rise to an unexpected change of government in Sweden. A free-trade majority in parliament was overturned by a protectionist majority in 1887. The protectionist government increased tariffs. We employ the synthetic control method to select control countries aga…
▽ More
We investigate how protectionist policies influence economic growth. Our empirical strategy exploits an extraordinary tax scandal that gave rise to an unexpected change of government in Sweden. A free-trade majority in parliament was overturned by a protectionist majority in 1887. The protectionist government increased tariffs. We employ the synthetic control method to select control countries against which economic growth in Sweden can be compared. We do not find evidence suggesting that protectionist policies influenced economic growth and examine channels why. The new tariff laws increased government revenue. However, the results do not suggest that the protectionist government stimulated the economy by increasing government expenditure.
△ Less
Submitted 3 March, 2022; v1 submitted 5 October, 2020;
originally announced October 2020.
-
Instrumental Variable Quantile Regression
Authors:
Victor Chernozhukov,
Christian Hansen,
Kaspar Wuthrich
Abstract:
This chapter reviews the instrumental variable quantile regression model of Chernozhukov and Hansen (2005). We discuss the key conditions used for identification of structural quantile effects within this model which include the availability of instruments and a restriction on the ranks of structural disturbances. We outline several approaches to obtaining point estimates and performing statistica…
▽ More
This chapter reviews the instrumental variable quantile regression model of Chernozhukov and Hansen (2005). We discuss the key conditions used for identification of structural quantile effects within this model which include the availability of instruments and a restriction on the ranks of structural disturbances. We outline several approaches to obtaining point estimates and performing statistical inference for model parameters. Finally, we point to possible directions for future research.
△ Less
Submitted 28 August, 2020;
originally announced September 2020.
-
Distributional conformal prediction
Authors:
Victor Chernozhukov,
Kaspar Wüthrich,
Yinchu Zhu
Abstract:
We propose a robust method for constructing conditionally valid prediction intervals based on models for conditional distributions such as quantile and distribution regression. Our approach can be applied to important prediction problems including cross-sectional prediction, k-step-ahead forecasts, synthetic controls and counterfactual prediction, and individual treatment effects prediction. Our m…
▽ More
We propose a robust method for constructing conditionally valid prediction intervals based on models for conditional distributions such as quantile and distribution regression. Our approach can be applied to important prediction problems including cross-sectional prediction, k-step-ahead forecasts, synthetic controls and counterfactual prediction, and individual treatment effects prediction. Our method exploits the probability integral transform and relies on permuting estimated ranks. Unlike regression residuals, ranks are independent of the predictors, allowing us to construct conditionally valid prediction intervals under heteroskedasticity. We establish approximate conditional validity under consistent estimation and provide approximate unconditional validity under model misspecification, overfitting, and with time series data. We also propose a simple "shape" adjustment of our baseline method that yields optimal prediction intervals.
△ Less
Submitted 21 August, 2021; v1 submitted 17 September, 2019;
originally announced September 2019.
-
Detecting p-hacking
Authors:
Graham Elliott,
Nikolay Kudrin,
Kaspar Wuthrich
Abstract:
We theoretically analyze the problem of testing for $p$-hacking based on distributions of $p$-values across multiple studies. We provide general results for when such distributions have testable restrictions (are non-increasing) under the null of no $p$-hacking. We find novel additional testable restrictions for $p$-values based on $t$-tests. Specifically, the shape of the power functions results…
▽ More
We theoretically analyze the problem of testing for $p$-hacking based on distributions of $p$-values across multiple studies. We provide general results for when such distributions have testable restrictions (are non-increasing) under the null of no $p$-hacking. We find novel additional testable restrictions for $p$-values based on $t$-tests. Specifically, the shape of the power functions results in both complete monotonicity as well as bounds on the distribution of $p$-values. These testable restrictions result in more powerful tests for the null hypothesis of no $p$-hacking. When there is also publication bias, our tests are joint tests for $p$-hacking and publication bias. A reanalysis of two prominent datasets shows the usefulness of our new tests.
△ Less
Submitted 25 May, 2021; v1 submitted 16 June, 2019;
originally announced June 2019.
-
Omitted variable bias of Lasso-based inference methods: A finite sample analysis
Authors:
Kaspar Wuthrich,
Ying Zhu
Abstract:
We study the finite sample behavior of Lasso-based inference methods such as post double Lasso and debiased Lasso. We show that these methods can exhibit substantial omitted variable biases (OVBs) due to Lasso not selecting relevant controls. This phenomenon can occur even when the coefficients are sparse and the sample size is large and larger than the number of controls. Therefore, relying on th…
▽ More
We study the finite sample behavior of Lasso-based inference methods such as post double Lasso and debiased Lasso. We show that these methods can exhibit substantial omitted variable biases (OVBs) due to Lasso not selecting relevant controls. This phenomenon can occur even when the coefficients are sparse and the sample size is large and larger than the number of controls. Therefore, relying on the existing asymptotic inference theory can be problematic in empirical applications. We compare the Lasso-based inference methods to modern high-dimensional OLS-based methods and provide practical guidance.
△ Less
Submitted 12 September, 2021; v1 submitted 20 March, 2019;
originally announced March 2019.
-
Decentralization Estimators for Instrumental Variable Quantile Regression Models
Authors:
Hiroaki Kaido,
Kaspar Wuthrich
Abstract:
The instrumental variable quantile regression (IVQR) model (Chernozhukov and Hansen, 2005) is a popular tool for estimating causal quantile effects with endogenous covariates. However, estimation is complicated by the non-smoothness and non-convexity of the IVQR GMM objective function. This paper shows that the IVQR estimation problem can be decomposed into a set of conventional quantile regressio…
▽ More
The instrumental variable quantile regression (IVQR) model (Chernozhukov and Hansen, 2005) is a popular tool for estimating causal quantile effects with endogenous covariates. However, estimation is complicated by the non-smoothness and non-convexity of the IVQR GMM objective function. This paper shows that the IVQR estimation problem can be decomposed into a set of conventional quantile regression sub-problems which are convex and can be solved efficiently. This reformulation leads to new identification results and to fast, easy to implement, and tuning-free estimators that do not require the availability of high-level "black box" optimization routines.
△ Less
Submitted 16 September, 2020; v1 submitted 28 December, 2018;
originally announced December 2018.
-
Debiasing and $t$-tests for synthetic control inference on average causal effects
Authors:
Victor Chernozhukov,
Kaspar Wuthrich,
Yinchu Zhu
Abstract:
We propose a practical and robust method for making inferences on average treatment effects estimated by synthetic controls. We develop a $K$-fold cross-fitting procedure for bias correction. To avoid the difficult estimation of the long-run variance, inference is based on a self-normalized $t$-statistic, which has an asymptotically pivotal $t$-distribution. Our $t$-test is easy to implement, prov…
▽ More
We propose a practical and robust method for making inferences on average treatment effects estimated by synthetic controls. We develop a $K$-fold cross-fitting procedure for bias correction. To avoid the difficult estimation of the long-run variance, inference is based on a self-normalized $t$-statistic, which has an asymptotically pivotal $t$-distribution. Our $t$-test is easy to implement, provably robust against misspecification, and valid with stationary and non-stationary data. It demonstrates an excellent small sample performance in application-based simulations and performs well relative to other methods. We illustrate the usefulness of the $t$-test by revisiting the effect of carbon taxes on emissions.
△ Less
Submitted 23 May, 2025; v1 submitted 27 December, 2018;
originally announced December 2018.
-
An Exact and Robust Conformal Inference Method for Counterfactual and Synthetic Controls
Authors:
Victor Chernozhukov,
Kaspar Wüthrich,
Yinchu Zhu
Abstract:
We introduce new inference procedures for counterfactual and synthetic control methods for policy evaluation. We recast the causal inference problem as a counterfactual prediction and a structural breaks testing problem. This allows us to exploit insights from conformal prediction and structural breaks testing to develop permutation inference procedures that accommodate modern high-dimensional est…
▽ More
We introduce new inference procedures for counterfactual and synthetic control methods for policy evaluation. We recast the causal inference problem as a counterfactual prediction and a structural breaks testing problem. This allows us to exploit insights from conformal prediction and structural breaks testing to develop permutation inference procedures that accommodate modern high-dimensional estimators, are valid under weak and easy-to-verify conditions, and are provably robust against misspecification. Our methods work in conjunction with many different approaches for predicting counterfactual mean outcomes in the absence of the policy intervention. Examples include synthetic controls, difference-in-differences, factor and matrix completion models, and (fused) time series panel data models. Our approach demonstrates an excellent small-sample performance in simulations and is taken to a data application where we re-evaluate the consequences of decriminalizing indoor prostitution. Open-source software for implementing our conformal inference methods is available.
△ Less
Submitted 20 May, 2021; v1 submitted 25 December, 2017;
originally announced December 2017.
-
Generic Inference on Quantile and Quantile Effect Functions for Discrete Outcomes
Authors:
Victor Chernozhukov,
Iván Fernández-Val,
Blaise Melly,
Kaspar Wüthrich
Abstract:
Quantile and quantile effect functions are important tools for descriptive and causal analyses due to their natural and intuitive interpretation. Existing inference methods for these functions do not apply to discrete random variables. This paper offers a simple, practical construction of simultaneous confidence bands for quantile and quantile effect functions of possibly discrete random variables…
▽ More
Quantile and quantile effect functions are important tools for descriptive and causal analyses due to their natural and intuitive interpretation. Existing inference methods for these functions do not apply to discrete random variables. This paper offers a simple, practical construction of simultaneous confidence bands for quantile and quantile effect functions of possibly discrete random variables. It is based on a natural transformation of simultaneous confidence bands for distribution functions, which are readily available for many problems. The construction is generic and does not depend on the nature of the underlying problem. It works in conjunction with parametric, semiparametric, and nonparametric modeling methods for observed and counterfactual distributions, and does not depend on the sampling scheme. We apply our method to characterize the distributional impact of insurance coverage on health care utilization and obtain the distributional decomposition of the racial test score gap. We find that universal insurance coverage increases the number of doctor visits across the entire distribution, and that the racial test score gap is small at early ages but grows with age due to socio economic factors affecting child development especially at the top of the distribution. These are new, interesting empirical findings that complement previous analyses that focused on mean effects only. In both applications, the outcomes of interest are discrete rendering existing inference methods invalid for obtaining uniform confidence bands for observed and counterfactual quantile functions and for their difference -- the quantile effects functions.
△ Less
Submitted 30 August, 2018; v1 submitted 17 August, 2016;
originally announced August 2016.