-
Sharp and Robust Estimation of Partially Identified Discrete Response Models
Authors:
Shakeeb Khan,
Tatiana Komarova,
Denis Nekipelov
Abstract:
Semiparametric discrete choice models are widely used in a variety of practical applications. While these models are point identified in the presence of continuous covariates, they can become partially identified when covariates are discrete. In this paper we find that classical estimators, including the maximum score estimator, (Manski (1975)), loose their attractive statistical properties withou…
▽ More
Semiparametric discrete choice models are widely used in a variety of practical applications. While these models are point identified in the presence of continuous covariates, they can become partially identified when covariates are discrete. In this paper we find that classical estimators, including the maximum score estimator, (Manski (1975)), loose their attractive statistical properties without point identification. First of all, they are not sharp with the estimator converging to an outer region of the identified set, (Komarova (2013)), and in many discrete designs it weakly converges to a random set. Second, they are not robust, with their distribution limit discontinuously changing with respect to the parameters of the model. We propose a novel class of estimators based on the concept of a quantile of a random set, which we show to be both sharp and robust. We demonstrate that our approach extends from cross-sectional settings to classical static and dynamic discrete panel data models.
△ Less
Submitted 28 May, 2024; v1 submitted 3 October, 2023;
originally announced October 2023.
-
Multivariate ordered discrete response models
Authors:
Tatiana Komarova,
William Matcham
Abstract:
We introduce multivariate ordered discrete response models with general rectangular structures. From the perspective of behavioral economics, these non-lattice models correspond to broad bracketing in decision making, whereas lattice models, which researchers typically estimate in practice, correspond to narrow bracketing. In these models, we specify latent processes as a sum of an index of covari…
▽ More
We introduce multivariate ordered discrete response models with general rectangular structures. From the perspective of behavioral economics, these non-lattice models correspond to broad bracketing in decision making, whereas lattice models, which researchers typically estimate in practice, correspond to narrow bracketing. In these models, we specify latent processes as a sum of an index of covariates and an unobserved error, with unobservables for different latent processes potentially correlated. We provide conditions that are sufficient for identification under the independence of errors and covariates and outline an estimation approach. We present simulations and empirical examples, with a particular focus on probit specifications.
△ Less
Submitted 13 March, 2023; v1 submitted 11 May, 2022;
originally announced May 2022.
-
Incorporating Social Welfare in Program-Evaluation and Treatment Choice
Authors:
Debopam Bhattacharya,
Tatiana Komarova
Abstract:
The econometric literature on treatment-effects typically takes functionals of outcome-distributions as `social welfare' and ignores program-impacts on unobserved utilities. We show how to incorporate aggregate utility within econometric program-evaluation and optimal treatment-targeting for a heterogenous population. In the practically important setting of discrete-choice, under unrestricted pref…
▽ More
The econometric literature on treatment-effects typically takes functionals of outcome-distributions as `social welfare' and ignores program-impacts on unobserved utilities. We show how to incorporate aggregate utility within econometric program-evaluation and optimal treatment-targeting for a heterogenous population. In the practically important setting of discrete-choice, under unrestricted preference-heterogeneity and income-effects, the indirect-utility distribution becomes a closed-form functional of average demand. This enables nonparametric cost-benefit analysis of policy-interventions and their optimal targeting based on planners' redistributional preferences. For ordered/continuous choice, utility-distributions can be bounded. Our methods are illustrated with Indian survey-data on private-tuition, where income-paths of usage-maximizing subsidies differ significantly from welfare-maximizing ones.
△ Less
Submitted 18 November, 2022; v1 submitted 18 May, 2021;
originally announced May 2021.
-
Identification and Formal Privacy Guarantees
Authors:
Tatiana Komarova,
Denis Nekipelov
Abstract:
Empirical economic research crucially relies on highly sensitive individual datasets. At the same time, increasing availability of public individual-level data makes it possible for adversaries to potentially de-identify anonymized records in sensitive research datasets. Most commonly accepted formal definition of an individual non-disclosure guarantee is referred to as differential privacy. It re…
▽ More
Empirical economic research crucially relies on highly sensitive individual datasets. At the same time, increasing availability of public individual-level data makes it possible for adversaries to potentially de-identify anonymized records in sensitive research datasets. Most commonly accepted formal definition of an individual non-disclosure guarantee is referred to as differential privacy. It restricts the interaction of researchers with the data by allowing them to issue queries to the data. The differential privacy mechanism then replaces the actual outcome of the query with a randomised outcome.
The impact of differential privacy on the identification of empirical economic models and on the performance of estimators in nonlinear empirical Econometric models has not been sufficiently studied. Since privacy protection mechanisms are inherently finite-sample procedures, we define the notion of identifiability of the parameter of interest under differential privacy as a property of the limit of experiments. It is naturally characterized by the concepts from the random sets theory.
We show that particular instances of regression discontinuity design may be problematic for inference with differential privacy as parameters turn out to be neither point nor partially identified. The set of differentially private estimators converges weakly to a random set. Our analysis suggests that many other estimators that rely on nuisance parameters may have similar properties with the requirement of differential privacy. We show that identification becomes possible if the target parameter can be deterministically located within the random set. In that case, a full exploration of the random set of the weak limits of differentially private estimators can allow the data curator to select a sequence of instances of differentially private estimators converging to the target parameter in probability.
△ Less
Submitted 3 May, 2021; v1 submitted 25 June, 2020;
originally announced June 2020.
-
Testing nonparametric shape restrictions
Authors:
Tatiana Komarova,
Javier Hidalgo
Abstract:
We describe and examine a test for a general class of shape constraints, such as constraints on the signs of derivatives, U-(S-)shape, symmetry, quasi-convexity, log-convexity, $r$-convexity, among others, in a nonparametric framework using partial sums empirical processes. We show that, after a suitable transformation, its asymptotic distribution is a functional of the standard Brownian motion, s…
▽ More
We describe and examine a test for a general class of shape constraints, such as constraints on the signs of derivatives, U-(S-)shape, symmetry, quasi-convexity, log-convexity, $r$-convexity, among others, in a nonparametric framework using partial sums empirical processes. We show that, after a suitable transformation, its asymptotic distribution is a functional of the standard Brownian motion, so that critical values are available. However, due to the possible poor approximation of the asymptotic critical values to the finite sample ones, we also describe a valid bootstrap algorithm.
△ Less
Submitted 8 June, 2020; v1 submitted 4 September, 2019;
originally announced September 2019.