-
Consumption Stimulus with Digital Coupons
Authors:
Ying Chen,
Mingyi Li,
Jiaming Mao,
Jingyi Zhou
Abstract:
We study consumption stimulus with digital coupons, which provide time-limited subsidies contingent on minimum spending. We analyze a large-scale program in China and present five main findings: (1) the program generates large short-term effects, with each $\yen$1 of government subsidy inducing $\yen$3.4 in consumer spending; (2) consumption responses vary substantially, driven by both demand-side…
▽ More
We study consumption stimulus with digital coupons, which provide time-limited subsidies contingent on minimum spending. We analyze a large-scale program in China and present five main findings: (1) the program generates large short-term effects, with each $\yen$1 of government subsidy inducing $\yen$3.4 in consumer spending; (2) consumption responses vary substantially, driven by both demand-side factors (e.g., wealth) and supply-side factors (e.g., local consumption amenities); (3) The largest spending increases occur among consumers whose baseline spending already exceeds coupon thresholds and for whom coupon subsidies should be equivalent to cash, suggesting behavioral motivations; (4) high-response consumers disproportionately direct their spending toward large businesses, leading to a regressive allocation of stimulus benefits; and (5) targeting the most responsive consumers can double total stimulus effects. A hybrid design combining targeted distribution with direct support to small businesses improves both the efficiency and equity of the program.
△ Less
Submitted 2 July, 2025;
originally announced July 2025.
-
Steering Prosocial AI Agents: Computational Basis of LLM's Decision Making in Social Simulation
Authors:
Ji Ma
Abstract:
Large language models (LLMs) increasingly serve as human-like decision-making agents in social science and applied settings. These LLM-agents are typically assigned human-like characters and placed in real-life contexts. However, how these characters and contexts shape an LLM's behavior remains underexplored. This study proposes and tests methods for probing, quantifying, and modifying an LLM's in…
▽ More
Large language models (LLMs) increasingly serve as human-like decision-making agents in social science and applied settings. These LLM-agents are typically assigned human-like characters and placed in real-life contexts. However, how these characters and contexts shape an LLM's behavior remains underexplored. This study proposes and tests methods for probing, quantifying, and modifying an LLM's internal representations in a Dictator Game -- a classic behavioral experiment on fairness and prosocial behavior. We extract ``vectors of variable variations'' (e.g., ``male'' to ``female'') from the LLM's internal state. Manipulating these vectors during the model's inference can substantially alter how those variables relate to the model's decision-making. This approach offers a principled way to study and regulate how social concepts can be encoded and engineered within transformer-based models, with implications for alignment, debiasing, and designing AI agents for social simulations in both academic and commercial applications.
△ Less
Submitted 15 April, 2025;
originally announced April 2025.
-
Managing Procurement Auction Failure: Bid Requirements or Reserve Prices
Authors:
Jun Ma,
Vadim Marmer,
Pai Xu
Abstract:
This paper examines bid requirements, where the government may cancel a procurement contract unless two or more bids are received. Using a first-price auction model with endogenous entry, we compare the bid requirement and reserve price mechanisms in terms of auction failure and procurement costs. We find that reserve prices result in lower procurement costs and substantially lower failure probabi…
▽ More
This paper examines bid requirements, where the government may cancel a procurement contract unless two or more bids are received. Using a first-price auction model with endogenous entry, we compare the bid requirement and reserve price mechanisms in terms of auction failure and procurement costs. We find that reserve prices result in lower procurement costs and substantially lower failure probabilities, especially when entry costs are high, or signals are sufficiently informative. Bid requirements are more likely to result in zero entry, while reserve prices can sustain positive entry under broader conditions.
△ Less
Submitted 5 March, 2025;
originally announced March 2025.
-
Assortative Marriage and Geographic Sorting
Authors:
Jiaming Mao,
Jiayi Wen
Abstract:
Between 1980 and 2000, the U.S. experienced a significant rise in geographic sorting and educational homogamy, with college graduates increasingly concentrating in high-skill cities and marrying similarly educated spouses. We develop and estimate a spatial equilibrium model with local labor, housing, and marriage markets, incorporating a marriage matching framework with transferable utility. Using…
▽ More
Between 1980 and 2000, the U.S. experienced a significant rise in geographic sorting and educational homogamy, with college graduates increasingly concentrating in high-skill cities and marrying similarly educated spouses. We develop and estimate a spatial equilibrium model with local labor, housing, and marriage markets, incorporating a marriage matching framework with transferable utility. Using the model, we estimate trends in assortative preferences, quantify the interplay between marital and geographic sorting, and assess their combined impact on household inequality. Welfare analyses show that after accounting for marriage, the college well-being gap grew substantially more than the college wage gap.
△ Less
Submitted 18 February, 2025;
originally announced February 2025.
-
Can Machines Think Like Humans? A Behavioral Evaluation of LLM-Agents in Dictator Games
Authors:
Ji Ma
Abstract:
As Large Language Model (LLM)-based agents increasingly undertake real-world tasks and engage with human society, how well do we understand their behaviors? We (1) investigate how LLM agents' prosocial behaviors -- a fundamental social norm -- can be induced by different personas and benchmarked against human behaviors; and (2) introduce a behavioral and social science approach to evaluate LLM age…
▽ More
As Large Language Model (LLM)-based agents increasingly undertake real-world tasks and engage with human society, how well do we understand their behaviors? We (1) investigate how LLM agents' prosocial behaviors -- a fundamental social norm -- can be induced by different personas and benchmarked against human behaviors; and (2) introduce a behavioral and social science approach to evaluate LLM agents' decision-making. We explored how different personas and experimental framings affect these AI agents' altruistic behavior in dictator games and compared their behaviors within the same LLM family, across various families, and with human behaviors. The findings reveal substantial variations and inconsistencies among LLMs and notable differences compared to human behaviors. Merely assigning a human-like identity to LLMs does not produce human-like behaviors. Despite being trained on extensive human-generated data, these AI agents are unable to capture the internal processes of human decision-making. Their alignment with human is highly variable and dependent on specific model architectures and prompt formulations; even worse, such dependence does not follow a clear pattern. LLMs can be useful task-specific tools but are not yet intelligent human-like agents.
△ Less
Submitted 16 December, 2024; v1 submitted 28 October, 2024;
originally announced October 2024.
-
Debiased Inference for Dynamic Nonlinear Panels with Multi-dimensional Heterogeneities
Authors:
Xuan Leng,
Jiaming Mao,
Yutao Sun
Abstract:
We introduce a generic class of dynamic nonlinear heterogeneous parameter models that incorporate individual and time fixed effects in both the intercept and slope. These models are subject to the incidental parameter problem, in that the limiting distribution of the point estimator is not centered at zero, and that test statistics do not follow their standard asymptotic distributions as in the ab…
▽ More
We introduce a generic class of dynamic nonlinear heterogeneous parameter models that incorporate individual and time fixed effects in both the intercept and slope. These models are subject to the incidental parameter problem, in that the limiting distribution of the point estimator is not centered at zero, and that test statistics do not follow their standard asymptotic distributions as in the absence of the fixed effects. To address the problem, we develop an analytical bias correction procedure to construct a bias-corrected likelihood. The resulting estimator follows an asymptotic normal distribution with mean zero. Moreover, likelihood-based tests statistics -- including likelihood-ratio, Lagrange-multiplier, and Wald tests -- follow the limiting chi-squared distribution under the null hypothesis. Simulations demonstrate the effectiveness of the proposed correction method, and an empirical application on the labor force participation of single mothers underscores its practical importance.
△ Less
Submitted 14 May, 2025; v1 submitted 4 May, 2023;
originally announced May 2023.
-
Causal Estimation of Position Bias in Recommender Systems Using Marketplace Instruments
Authors:
Rina Friedberg,
Karthik Rajkumar,
Jialiang Mao,
Qian Yao,
YinYin Yu,
Min Liu
Abstract:
Information retrieval systems, such as online marketplaces, news feeds, and search engines, are ubiquitous in today's digital society. They facilitate information discovery by ranking retrieved items on predicted relevance, i.e. likelihood of interaction (click, share) between users and items. Typically modeled using past interactions, such rankings have a major drawback: interaction depends on th…
▽ More
Information retrieval systems, such as online marketplaces, news feeds, and search engines, are ubiquitous in today's digital society. They facilitate information discovery by ranking retrieved items on predicted relevance, i.e. likelihood of interaction (click, share) between users and items. Typically modeled using past interactions, such rankings have a major drawback: interaction depends on the attention items receive. A highly-relevant item placed outside a user's attention could receive little interaction. This discrepancy between observed interaction and true relevance is termed the position bias. Position bias degrades relevance estimation and when it compounds over time, it can silo users into false relevant items, causing marketplace inefficiencies. Position bias may be identified with randomized experiments, but such an approach can be prohibitive in cost and feasibility. Past research has also suggested propensity score methods, which do not adequately address unobserved confounding; and regression discontinuity designs, which have poor external validity. In this work, we address these concerns by leveraging the abundance of A/B tests in ranking evaluations as instrumental variables. Historical A/B tests allow us to access exogenous variation in rankings without manually introducing them, harming user experience and platform revenue. We demonstrate our methodology in two distinct applications at LinkedIn - feed ads and the People-You-May-Know (PYMK) recommender. The marketplaces comprise users and campaigns on the ads side, and invite senders and recipients on PYMK. By leveraging prior experimentation, we obtain quasi-experimental variation in item rankings that is orthogonal to user relevance. Our method provides robust position effect estimates that handle unobserved confounding well, greater generalizability, and easily extends to other information retrieval systems.
△ Less
Submitted 12 May, 2022;
originally announced May 2022.
-
Approximately Efficient Bilateral Trade
Authors:
Yuan Deng,
Jieming Mao,
Balasubramanian Sivan,
Kangning Wang
Abstract:
We study bilateral trade between two strategic agents. The celebrated result of Myerson and Satterthwaite states that in general, no incentive-compatible, individually rational and weakly budget balanced mechanism can be efficient. I.e., no mechanism with these properties can guarantee a trade whenever buyer value exceeds seller cost. Given this, a natural question is whether there exists a mechan…
▽ More
We study bilateral trade between two strategic agents. The celebrated result of Myerson and Satterthwaite states that in general, no incentive-compatible, individually rational and weakly budget balanced mechanism can be efficient. I.e., no mechanism with these properties can guarantee a trade whenever buyer value exceeds seller cost. Given this, a natural question is whether there exists a mechanism with these properties that guarantees a constant fraction of the first-best gains-from-trade, namely a constant fraction of the gains-from-trade attainable whenever buyer's value weakly exceeds seller's cost. In this work, we positively resolve this long-standing open question on constant-factor approximation, mentioned in several previous works, using a simple mechanism.
△ Less
Submitted 5 November, 2021;
originally announced November 2021.
-
Inference on Individual Treatment Effects in Nonseparable Triangular Models
Authors:
Jun Ma,
Vadim Marmer,
Zhengfei Yu
Abstract:
In nonseparable triangular models with a binary endogenous treatment and a binary instrumental variable, Vuong and Xu (2017) established identification results for individual treatment effects (ITEs) under the rank invariance assumption. Using their approach, Feng, Vuong, and Xu (2019) proposed a uniformly consistent kernel estimator for the density of the ITE that utilizes estimated ITEs. In this…
▽ More
In nonseparable triangular models with a binary endogenous treatment and a binary instrumental variable, Vuong and Xu (2017) established identification results for individual treatment effects (ITEs) under the rank invariance assumption. Using their approach, Feng, Vuong, and Xu (2019) proposed a uniformly consistent kernel estimator for the density of the ITE that utilizes estimated ITEs. In this paper, we establish the asymptotic normality of the density estimator of Feng, Vuong, and Xu (2019) and show that the ITE estimation errors have a non-negligible effect on the asymptotic distribution of the estimator. We propose asymptotically valid standard errors that account for ITEs estimation, as well as a bias correction. Furthermore, we develop uniform confidence bands for the density of the ITE using the jackknife multiplier or nonparametric bootstrap critical values.
△ Less
Submitted 15 February, 2023; v1 submitted 12 July, 2021;
originally announced July 2021.
-
Interactive Communication in Bilateral Trade
Authors:
Jieming Mao,
Renato Paes Leme,
Kangning Wang
Abstract:
We define a model of interactive communication where two agents with private types can exchange information before a game is played. The model contains Bayesian persuasion as a special case of a one-round communication protocol. We define message complexity corresponding to the minimum number of interactive rounds necessary to achieve the best possible outcome. Our main result is that for bilatera…
▽ More
We define a model of interactive communication where two agents with private types can exchange information before a game is played. The model contains Bayesian persuasion as a special case of a one-round communication protocol. We define message complexity corresponding to the minimum number of interactive rounds necessary to achieve the best possible outcome. Our main result is that for bilateral trade, agents don't stop talking until they reach an efficient outcome: Either agents achieve an efficient allocation in finitely many rounds of communication; or the optimal communication protocol has infinite number of rounds. We show an important class of bilateral trade settings where efficient allocation is achievable with a small number of rounds of communication.
△ Less
Submitted 3 June, 2021;
originally announced June 2021.
-
Optimal Pricing Schemes for an Impatient Buyer
Authors:
Yuan Deng,
Jieming Mao,
Balasubramanian Sivan,
Kangning Wang
Abstract:
A patient seller aims to sell a good to an impatient buyer (i.e., one who discounts utility over time). The buyer will remain in the market for a period of time $T$, and her private value is drawn from a publicly known distribution. What is the revenue-optimal pricing-curve (sequence of (price, time) pairs) for the seller? Is randomization of help here? Is the revenue-optimal pricing curve computa…
▽ More
A patient seller aims to sell a good to an impatient buyer (i.e., one who discounts utility over time). The buyer will remain in the market for a period of time $T$, and her private value is drawn from a publicly known distribution. What is the revenue-optimal pricing-curve (sequence of (price, time) pairs) for the seller? Is randomization of help here? Is the revenue-optimal pricing curve computable in polynomial time? We answer these questions in this paper. We give an efficient algorithm for computing the revenue-optimal pricing curve. We show that pricing curves, that post a price at each point of time and let the buyer pick her utility maximizing time to buy, are revenue-optimal among a much broader class of sequential lottery mechanisms. I.e., mechanisms that allow the seller to post a menu of lotteries at each point of time cannot get any higher revenue than pricing curves. We also show that the even broader class of mechanisms that allow the menu of lotteries to be adaptively set, can earn strictly higher revenue than that of pricing curves, and the revenue gap can be as big as the support size of the buyer's value distribution.
△ Less
Submitted 11 February, 2023; v1 submitted 3 June, 2021;
originally announced June 2021.
-
Empirical Likelihood Covariate Adjustment for Regression Discontinuity Designs
Authors:
Jun Ma,
Zhengfei Yu
Abstract:
This paper proposes a versatile covariate adjustment method that directly incorporates covariate balance in regression discontinuity (RD) designs. The new empirical entropy balancing method reweights the standard local polynomial RD estimator by using the entropy balancing weights that minimize the Kullback--Leibler divergence from the uniform weights while satisfying the covariate balance constra…
▽ More
This paper proposes a versatile covariate adjustment method that directly incorporates covariate balance in regression discontinuity (RD) designs. The new empirical entropy balancing method reweights the standard local polynomial RD estimator by using the entropy balancing weights that minimize the Kullback--Leibler divergence from the uniform weights while satisfying the covariate balance constraints. Our estimator can be formulated as an empirical likelihood estimator that efficiently incorporates the information from the covariate balance condition as correctly specified over-identifying moment restrictions, and thus has an asymptotic variance no larger than that of the standard estimator without covariates. We demystify the asymptotic efficiency gain of Calonico, Cattaneo, Farrell, and Titiunik (2019)'s regression-based covariate-adjusted estimator, as their estimator has the same asymptotic variance as ours. Further efficiency improvement from balancing over sieve spaces is possible if our entropy balancing weights are computed using stronger covariate balance constraints that are imposed on functions of covariates. We then show that our method enjoys favorable second-order properties from empirical likelihood estimation and inference: the estimator has a small (bounded) nonlinearity bias, and the likelihood ratio based confidence set admits a simple analytical correction that can be used to improve coverage accuracy. The coverage accuracy of our confidence set is robust against slight perturbation to the covariate balance condition, which may happen in cases such as data contamination and misspecified "unaffected" outcomes used as covariates. The proposed entropy balancing approach for covariate adjustment is applicable to other RD-related settings.
△ Less
Submitted 28 May, 2024; v1 submitted 20 August, 2020;
originally announced August 2020.
-
Ensemble Learning with Statistical and Structural Models
Authors:
Jiaming Mao,
Jingzhi Xu
Abstract:
Statistical and structural modeling represent two distinct approaches to data analysis. In this paper, we propose a set of novel methods for combining statistical and structural models for improved prediction and causal inference. Our first proposed estimator has the doubly robustness property in that it only requires the correct specification of either the statistical or the structural model. Our…
▽ More
Statistical and structural modeling represent two distinct approaches to data analysis. In this paper, we propose a set of novel methods for combining statistical and structural models for improved prediction and causal inference. Our first proposed estimator has the doubly robustness property in that it only requires the correct specification of either the statistical or the structural model. Our second proposed estimator is a weighted ensemble that has the ability to outperform both models when they are both misspecified. Experiments demonstrate the potential of our estimators in various settings, including fist-price auctions, dynamic models of entry and exit, and demand estimation with instrumental variables.
△ Less
Submitted 7 June, 2020;
originally announced June 2020.
-
Structural Regularization
Authors:
Jiaming Mao,
Zhesheng Zheng
Abstract:
We propose a novel method for modeling data by using structural models based on economic theory as regularizers for statistical models. We show that even if a structural model is misspecified, as long as it is informative about the data-generating mechanism, our method can outperform both the (misspecified) structural model and un-structural-regularized statistical models. Our method permits a Bay…
▽ More
We propose a novel method for modeling data by using structural models based on economic theory as regularizers for statistical models. We show that even if a structural model is misspecified, as long as it is informative about the data-generating mechanism, our method can outperform both the (misspecified) structural model and un-structural-regularized statistical models. Our method permits a Bayesian interpretation of theory as prior knowledge and can be used both for statistical prediction and causal inference. It contributes to transfer learning by showing how incorporating theory into statistical modeling can significantly improve out-of-domain predictions and offers a way to synthesize reduced-form and structural approaches for causal effect estimation. Simulation experiments demonstrate the potential of our method in various settings, including first-price auctions, dynamic models of entry and exit, and demand estimation with instrumental variables. Our method has potential applications not only in economics, but in other scientific disciplines whose theoretical models offer important insight but are subject to significant misspecification concerns.
△ Less
Submitted 12 June, 2020; v1 submitted 27 April, 2020;
originally announced April 2020.
-
Stable and Efficient Structures in Multigroup Network Formation
Authors:
Shadi Mohagheghi,
Jingying Ma,
Francesco Bullo
Abstract:
In this work we present a strategic network formation model predicting the emergence of multigroup structures. Individuals decide to form or remove links based on the benefits and costs those connections carry; we focus on bilateral consent for link formation. An exogenous system specifies the frequency of coordination issues arising among the groups. We are interested in structures that arise to…
▽ More
In this work we present a strategic network formation model predicting the emergence of multigroup structures. Individuals decide to form or remove links based on the benefits and costs those connections carry; we focus on bilateral consent for link formation. An exogenous system specifies the frequency of coordination issues arising among the groups. We are interested in structures that arise to resolve coordination issues and, specifically, structures in which groups are linked through bridging, redundant, and co-membership interconnections. We characterize the conditions under which certain structures are stable and study their efficiency as well as the convergence of formation dynamics.
△ Less
Submitted 28 January, 2020;
originally announced January 2020.
-
Monotonicity-Constrained Nonparametric Estimation and Inference for First-Price Auctions
Authors:
Jun Ma,
Vadim Marmer,
Artyom Shneyerov,
Pai Xu
Abstract:
We propose a new nonparametric estimator for first-price auctions with independent private values that imposes the monotonicity constraint on the estimated inverse bidding strategy. We show that our estimator has a smaller asymptotic variance than that of Guerre, Perrigne and Vuong's (2000) estimator. In addition to establishing pointwise asymptotic normality of our estimator, we provide a bootstr…
▽ More
We propose a new nonparametric estimator for first-price auctions with independent private values that imposes the monotonicity constraint on the estimated inverse bidding strategy. We show that our estimator has a smaller asymptotic variance than that of Guerre, Perrigne and Vuong's (2000) estimator. In addition to establishing pointwise asymptotic normality of our estimator, we provide a bootstrap-based approach to constructing uniform confidence bands for the density function of latent valuations.
△ Less
Submitted 27 September, 2019;
originally announced September 2019.
-
Inference for First-Price Auctions with Guerre, Perrigne, and Vuong's Estimator
Authors:
Jun Ma,
Vadim Marmer,
Artyom Shneyerov
Abstract:
We consider inference on the probability density of valuations in the first-price sealed-bid auctions model within the independent private value paradigm. We show the asymptotic normality of the two-step nonparametric estimator of Guerre, Perrigne, and Vuong (2000) (GPV), and propose an easily implementable and consistent estimator of the asymptotic variance. We prove the validity of the pointwise…
▽ More
We consider inference on the probability density of valuations in the first-price sealed-bid auctions model within the independent private value paradigm. We show the asymptotic normality of the two-step nonparametric estimator of Guerre, Perrigne, and Vuong (2000) (GPV), and propose an easily implementable and consistent estimator of the asymptotic variance. We prove the validity of the pointwise percentile bootstrap confidence intervals based on the GPV estimator. Lastly, we use the intermediate Gaussian approximation approach to construct bootstrap-based asymptotically valid uniform confidence bands for the density of the valuations.
△ Less
Submitted 15 March, 2019;
originally announced March 2019.