-
Coarse Personalization
Authors:
Walter W. Zhang,
Sanjog Misra
Abstract:
With advances in estimating heterogeneous treatment effects, firms can personalize and target individuals at a granular level. However, feasibility constraints limit full personalization. In practice, firms choose segments of individuals and assign a treatment to each segment to maximize profits: We call this the coarse personalization problem. We propose a two-step solution that simultaneously ma…
▽ More
With advances in estimating heterogeneous treatment effects, firms can personalize and target individuals at a granular level. However, feasibility constraints limit full personalization. In practice, firms choose segments of individuals and assign a treatment to each segment to maximize profits: We call this the coarse personalization problem. We propose a two-step solution that simultaneously makes segmentation and targeting decisions. First, the firm personalizes by estimating conditional average treatment effects. Second, the firm discretizes using treatment effects to choose which treatments to offer and their segments. We show that a combination of available machine learning tools for estimating heterogeneous treatment effects and a novel application of optimal transport methods provides a viable and efficient solution. With data from a large-scale field experiment in promotions management, we find our methodology outperforms extant approaches that segment on consumer characteristics, consumer preferences, or those that only search over a prespecified grid. Using our procedure, the firm recoups over $99.5\%$ of its expected incremental profits under full personalization while offering only five segments. We conclude by discussing how coarse personalization arises in other domains.
△ Less
Submitted 30 June, 2025; v1 submitted 12 April, 2022;
originally announced April 2022.
-
Deep Learning for Individual Heterogeneity
Authors:
Max H. Farrell,
Tengyuan Liang,
Sanjog Misra
Abstract:
This paper integrates deep neural networks (DNNs) into structural economic models to increase flexibility and capture rich heterogeneity while preserving interpretability. Economic structure and machine learning are complements in empirical modeling, not substitutes: DNNs provide the capacity to learn complex, non-linear heterogeneity patterns, while the structural model ensures the estimates rema…
▽ More
This paper integrates deep neural networks (DNNs) into structural economic models to increase flexibility and capture rich heterogeneity while preserving interpretability. Economic structure and machine learning are complements in empirical modeling, not substitutes: DNNs provide the capacity to learn complex, non-linear heterogeneity patterns, while the structural model ensures the estimates remain interpretable and suitable for decision making and policy analysis. We start with a standard parametric structural model and then enrich its parameters into fully flexible functions of observables, which are estimated using a particular DNN architecture whose structure reflects the economic model. We illustrate our framework by studying demand estimation in consumer choice. We show that by enriching a standard demand model we can capture rich heterogeneity, and further, exploit this heterogeneity to create a personalized pricing strategy. This type of optimization is not possible without economic structure, but cannot be heterogeneous without machine learning. Finally, we provide theoretical justification of each step in our proposed methodology. We first establish non-asymptotic bounds and convergence rates of our structural deep learning approach. Next, a novel and quite general influence function calculation allows for feasible inference via double machine learning in a wide variety of contexts. These results may be of interest in many other contexts, as they generalize prior work.
△ Less
Submitted 24 April, 2025; v1 submitted 27 October, 2020;
originally announced October 2020.
-
The Identity Fragmentation Bias
Authors:
Tesary Lin,
Sanjog Misra
Abstract:
Consumers interact with firms across multiple devices, browsers, and machines; these interactions are often recorded with different identifiers for the same consumer. The failure to correctly match different identities leads to a fragmented view of exposures and behaviors. This paper studies the identity fragmentation bias, referring to the estimation bias resulted from using fragmented data. Usin…
▽ More
Consumers interact with firms across multiple devices, browsers, and machines; these interactions are often recorded with different identifiers for the same consumer. The failure to correctly match different identities leads to a fragmented view of exposures and behaviors. This paper studies the identity fragmentation bias, referring to the estimation bias resulted from using fragmented data. Using a formal framework, we decompose the contributing factors of the estimation bias caused by data fragmentation and discuss the direction of bias. Contrary to conventional wisdom, this bias cannot be signed or bounded under standard assumptions. Instead, upward biases and sign reversals can occur even in experimental settings. We then compare several corrective measures, and discuss their respective advantages and caveats.
△ Less
Submitted 7 February, 2021; v1 submitted 28 August, 2020;
originally announced August 2020.
-
Deep Neural Networks for Estimation and Inference
Authors:
Max H. Farrell,
Tengyuan Liang,
Sanjog Misra
Abstract:
We study deep neural networks and their use in semiparametric inference. We establish novel rates of convergence for deep feedforward neural nets. Our new rates are sufficiently fast (in some cases minimax optimal) to allow us to establish valid second-step inference after first-step estimation with deep learning, a result also new to the literature. Our estimation rates and semiparametric inferen…
▽ More
We study deep neural networks and their use in semiparametric inference. We establish novel rates of convergence for deep feedforward neural nets. Our new rates are sufficiently fast (in some cases minimax optimal) to allow us to establish valid second-step inference after first-step estimation with deep learning, a result also new to the literature. Our estimation rates and semiparametric inference results handle the current standard architecture: fully connected feedforward neural networks (multi-layer perceptrons), with the now-common rectified linear unit activation function and a depth explicitly diverging with the sample size. We discuss other architectures as well, including fixed-width, very deep networks. We establish nonasymptotic bounds for these deep nets for a general class of nonparametric regression-type loss functions, which includes as special cases least squares, logistic regression, and other generalized linear models. We then apply our theory to develop semiparametric inference, focusing on causal parameters for concreteness, such as treatment effects, expected welfare, and decomposition effects. Inference in many other semiparametric contexts can be readily obtained. We demonstrate the effectiveness of deep learning with a Monte Carlo analysis and an empirical application to direct mail marketing.
△ Less
Submitted 18 September, 2019; v1 submitted 26 September, 2018;
originally announced September 2018.