-
Production Function Estimation without Invertibility: Imperfectly Competitive Environments and Demand Shocks
Authors:
Ulrich Doraszelski,
Lixiong Li
Abstract:
We advance the proxy variable approach to production function estimation. We show that the invertibility assumption at its heart is testable. We characterize what goes wrong if invertibility fails and what can still be done. We show that rethinking how the estimation procedure is implemented either eliminates or mitigates the bias that arises if invertibility fails. Furthermore, we show how a modi…
▽ More
We advance the proxy variable approach to production function estimation. We show that the invertibility assumption at its heart is testable. We characterize what goes wrong if invertibility fails and what can still be done. We show that rethinking how the estimation procedure is implemented either eliminates or mitigates the bias that arises if invertibility fails. Furthermore, we show how a modification of the procedure ensures Neyman orthogonality, enhancing efficiency and robustness by rendering the asymptotic distribution of the GMM estimator in the second step of the estimation procedure invariant to estimation noise from the first step.
△ Less
Submitted 16 June, 2025;
originally announced June 2025.
-
Comment on "Generic machine learning inference on heterogeneous treatment effects in randomized experiments."
Authors:
Kosuke Imai,
Michael Lingzhi Li
Abstract:
We analyze the split-sample robust inference (SSRI) methodology proposed by Chernozhukov, Demirer, Duflo, and Fernandez-Val (CDDF) for quantifying uncertainty in heterogeneous treatment effect estimation. While SSRI effectively accounts for randomness in data splitting, its computational cost can be prohibitive when combined with complex machine learning (ML) models. We present an alternative rand…
▽ More
We analyze the split-sample robust inference (SSRI) methodology proposed by Chernozhukov, Demirer, Duflo, and Fernandez-Val (CDDF) for quantifying uncertainty in heterogeneous treatment effect estimation. While SSRI effectively accounts for randomness in data splitting, its computational cost can be prohibitive when combined with complex machine learning (ML) models. We present an alternative randomization inference (RI) approach that maintains SSRI's generality without requiring repeated data splitting. By leveraging cross-fitting and design-based inference, RI achieves valid confidence intervals while significantly reducing computational burden. We compare the two methods through simulation, demonstrating that RI retains statistical efficiency while being more practical for large-scale applications.
△ Less
Submitted 10 February, 2025;
originally announced February 2025.
-
Trade, Trees, and Lives
Authors:
Xinming Du,
Lei Li,
Eric Zou
Abstract:
This paper shows a cascading mechanism through which international trade-induced deforestation results in a decline of health outcomes in cities distant from where trade activities occur. We examine Brazil, which has ramped up agricultural export over the last two decades to meet rising global demand. Using a shift-share research design, we first show that export shocks cause substantial local agr…
▽ More
This paper shows a cascading mechanism through which international trade-induced deforestation results in a decline of health outcomes in cities distant from where trade activities occur. We examine Brazil, which has ramped up agricultural export over the last two decades to meet rising global demand. Using a shift-share research design, we first show that export shocks cause substantial local agricultural expansion and a virtual one-for-one decline in forest cover. We then construct a dynamic area-of-effect model that predicts where atmospheric changes should be felt - due to loss of forests that would otherwise serve to filter out and absorb air pollutants as they travel - downwind of the deforestation areas. Leveraging quasi-random variation in these atmospheric connections, we establish a causal link between deforestation upstream and subsequent rises in air pollution and premature deaths downstream, with the mortality effects predominantly driven by cardiovascular and respiratory causes. Our estimates reveal a large telecoupled health externality of trade deforestation: over 700,000 premature deaths in Brazil over the past two decades. This equates to $0.18 loss in statistical life value per $1 agricultural exports over the study period.
△ Less
Submitted 20 November, 2024;
originally announced November 2024.
-
Co-benefits of Agricultural Diversification and Technology for Food and Nutrition Security in China
Authors:
Thomas Cherico Wanger,
Estelle Raveloaritiana,
Siyan Zeng,
Haixiu Gao,
Xueqing He,
Yiwen Shao,
Panlong Wu,
Kris A. G. Wyckhuys,
Wenwu Zhou,
Yi Zou,
Zengrong Zhu,
Ling Li,
Haiyan Cen,
Yunhui Liu,
Shenggen Fan
Abstract:
China is the leading crop producer and has successfully implemented sustainable development programs related to agriculture. Sustainable agriculture has been promoted to achieve national food security targets such as food self-sufficiency through the well-facilitated farmland construction (WFFC) approach. The WFFC is introduced in Chinas current national 10-year plan to consolidate farmlands into…
▽ More
China is the leading crop producer and has successfully implemented sustainable development programs related to agriculture. Sustainable agriculture has been promoted to achieve national food security targets such as food self-sufficiency through the well-facilitated farmland construction (WFFC) approach. The WFFC is introduced in Chinas current national 10-year plan to consolidate farmlands into large and simplified production areas to maximise automation, and improve soil fertility and productivity. However, research suggests that diversified and smaller farms faciliate ecosystem services, can improve yield resilience, defuse human health threats, and increase farm profitability. Currently, WFFC has not considered ecological farmland improvements and it may miss long-term environmental benefits including ecosystem service preservation conducive to yields. Moreover, the nutritional status in China has changed in recent decades with undernutrition being dramatically reduced, but the prevalence of overweight, obesity, and chronic diseases being increased. While a strategic choice and management of crop and livestock species can improve nutrition, the environmental and production benefits of agricultural diversification are currently not well interlinked with Chinas food and nutrition security discussions. Lastly, the role of agricultural technology for socioeconomic benefits and the link with diversified agricultural production may provide vast benefits for food security. Here, we focus on the opportunities and co-benefits of agricultural diversification and technology innovations to advance food and nutrition security in China through ecosystem service and yield benefits. Our applied five-point research agenda can provide evidence-based opportunities to support China in reaching its ambitious food security targets through agricultural diversification with global ramifications.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
Modeling Link-level Road Traffic Resilience to Extreme Weather Events Using Crowdsourced Data
Authors:
Songhua Hu,
Kailai Wang,
Lingyao Li,
Yingrui Zhao,
Zhenbing He,
Yunpeng,
Zhang
Abstract:
Climate changes lead to more frequent and intense weather events, posing escalating risks to road traffic. Crowdsourced data offer new opportunities to monitor and investigate changes in road traffic flow during extreme weather. This study utilizes diverse crowdsourced data from mobile devices and the community-driven navigation app, Waze, to examine the impact of three weather events (i.e., flood…
▽ More
Climate changes lead to more frequent and intense weather events, posing escalating risks to road traffic. Crowdsourced data offer new opportunities to monitor and investigate changes in road traffic flow during extreme weather. This study utilizes diverse crowdsourced data from mobile devices and the community-driven navigation app, Waze, to examine the impact of three weather events (i.e., floods, winter storms, and fog) on road traffic. Three metrics, speed change, event duration, and area under the curve (AUC), are employed to assess link-level traffic change and recovery. In addition, a user's perceived severity is computed to evaluate link-level weather impact based on crowdsourced reports. This study evaluates a range of new data sources, and provides insights into the resilience of road traffic to extreme weather, which are crucial for disaster preparedness, response, and recovery in road transportation systems.
△ Less
Submitted 22 October, 2023;
originally announced October 2023.
-
Adjustment with Many Regressors Under Covariate-Adaptive Randomizations
Authors:
Liang Jiang,
Liyao Li,
Ke Miao,
Yichong Zhang
Abstract:
Our paper discovers a new trade-off of using regression adjustments (RAs) in causal inference under covariate-adaptive randomizations (CARs). On one hand, RAs can improve the efficiency of causal estimators by incorporating information from covariates that are not used in the randomization. On the other hand, RAs can degrade estimation efficiency due to their estimation errors, which are not asymp…
▽ More
Our paper discovers a new trade-off of using regression adjustments (RAs) in causal inference under covariate-adaptive randomizations (CARs). On one hand, RAs can improve the efficiency of causal estimators by incorporating information from covariates that are not used in the randomization. On the other hand, RAs can degrade estimation efficiency due to their estimation errors, which are not asymptotically negligible when the number of regressors is of the same order as the sample size. Ignoring the estimation errors of RAs may result in serious over-rejection of causal inference under the null hypothesis. To address the issue, we construct a new ATE estimator by optimally linearly combining the estimators with and without RAs. We then develop a unified inference theory for this estimator under CARs. It has two features: (1) the Wald test based on it achieves the exact asymptotic size under the null hypothesis, regardless of whether the number of covariates is fixed or diverges no faster than the sample size; and (2) it guarantees weak efficiency improvement over estimators both with and without RAs.
△ Less
Submitted 17 February, 2025; v1 submitted 17 April, 2023;
originally announced April 2023.
-
Ambiguous Cheap Talk
Authors:
Longjian Li
Abstract:
This paper explores how ambiguity affects communication. We consider a cheap talk model in which the receiver evaluates the sender's message with respect to its worst-case expected payoff generated by multiplier preferences. We characterize the receiver's optimal strategy and show that the receiver's posterior action is consistent with his ex-ante action. We find that in some situations, ambiguity…
▽ More
This paper explores how ambiguity affects communication. We consider a cheap talk model in which the receiver evaluates the sender's message with respect to its worst-case expected payoff generated by multiplier preferences. We characterize the receiver's optimal strategy and show that the receiver's posterior action is consistent with his ex-ante action. We find that in some situations, ambiguity improves communication by shifting the receiver's optimal action upwards, and these situations are not rare.
△ Less
Submitted 18 September, 2022;
originally announced September 2022.
-
Incentivizing Hidden Types in Secretary Problem
Authors:
Longjian Li,
Alexis Akira Toda
Abstract:
We study a game between $N$ job applicants who incur a cost $c$ (relative to the job value) to reveal their type during interviews and an administrator who seeks to maximize the probability of hiring the best. We define a full learning equilibrium and prove its existence, uniqueness, and optimality. In equilibrium, the administrator accepts the current best applicant $n$ with probability $c$ if…
▽ More
We study a game between $N$ job applicants who incur a cost $c$ (relative to the job value) to reveal their type during interviews and an administrator who seeks to maximize the probability of hiring the best. We define a full learning equilibrium and prove its existence, uniqueness, and optimality. In equilibrium, the administrator accepts the current best applicant $n$ with probability $c$ if $n<n^*$ and with probability 1 if $n\ge n^*$ for a threshold $n^*$ independent of $c$. In contrast to the case without cost, where the success probability converges to $1/\mathrm{e}\approx 0.37$ as $N$ tends to infinity, with cost the success probability decays like $N^{-c}$.
△ Less
Submitted 22 July, 2024; v1 submitted 11 August, 2022;
originally announced August 2022.
-
Feature-based intermittent demand forecast combinations: bias, accuracy and inventory implications
Authors:
Li Li,
Yanfei Kang,
Fotios Petropoulos,
Feng Li
Abstract:
Intermittent demand forecasting is a ubiquitous and challenging problem in production systems and supply chain management. In recent years, there has been a growing focus on developing forecasting approaches for intermittent demand from academic and practical perspectives. However, limited attention has been given to forecast combination methods, which have achieved competitive performance in fore…
▽ More
Intermittent demand forecasting is a ubiquitous and challenging problem in production systems and supply chain management. In recent years, there has been a growing focus on developing forecasting approaches for intermittent demand from academic and practical perspectives. However, limited attention has been given to forecast combination methods, which have achieved competitive performance in forecasting fast-moving time series. The current study aims to examine the empirical outcomes of some existing forecast combination methods and propose a generalized feature-based framework for intermittent demand forecasting. The proposed framework has been shown to improve the accuracy of point and quantile forecasts based on two real data sets. Further, some analysis of features, forecasting pools and computational efficiency is also provided. The findings indicate the intelligibility and flexibility of the proposed approach in intermittent demand forecasting and offer insights regarding inventory decisions.
△ Less
Submitted 31 August, 2022; v1 submitted 18 April, 2022;
originally announced April 2022.
-
Finite Sample Inference in Incomplete Models
Authors:
Lixiong Li,
Marc Henry
Abstract:
We propose confidence regions for the parameters of incomplete models with exact coverage of the true parameter in finite samples. Our confidence region inverts a test, which generalizes Monte Carlo tests to incomplete models. The test statistic is a discrete analogue of a new optimal transport characterization of the sharp identified region. Both test statistic and critical values rely on simulat…
▽ More
We propose confidence regions for the parameters of incomplete models with exact coverage of the true parameter in finite samples. Our confidence region inverts a test, which generalizes Monte Carlo tests to incomplete models. The test statistic is a discrete analogue of a new optimal transport characterization of the sharp identified region. Both test statistic and critical values rely on simulation drawn from the distribution of latent variables and are computed using solutions to discrete optimal transport, hence linear programming problems. We also propose a fast preliminary search in the parameter space with an alternative, more conservative yet consistent test, based on a parameter free critical value.
△ Less
Submitted 30 April, 2024; v1 submitted 1 April, 2022;
originally announced April 2022.
-
Approximate Group Fairness for Clustering
Authors:
Bo Li,
Lijun Li,
Ankang Sun,
Chenhao Wang,
Yingfan Wang
Abstract:
We incorporate group fairness into the algorithmic centroid clustering problem, where $k$ centers are to be located to serve $n$ agents distributed in a metric space. We refine the notion of proportional fairness proposed in [Chen et al., ICML 2019] as {\em core fairness}, and $k$-clustering is in the core if no coalition containing at least $n/k$ agents can strictly decrease their total distance…
▽ More
We incorporate group fairness into the algorithmic centroid clustering problem, where $k$ centers are to be located to serve $n$ agents distributed in a metric space. We refine the notion of proportional fairness proposed in [Chen et al., ICML 2019] as {\em core fairness}, and $k$-clustering is in the core if no coalition containing at least $n/k$ agents can strictly decrease their total distance by deviating to a new center together. Our solution concept is motivated by the situation where agents are able to coordinate and utilities are transferable. A string of existence, hardness and approximability results is provided. Particularly, we propose two dimensions to relax core requirements: one is on the degree of distance improvement, and the other is on the size of deviating coalition. For both relaxations and their combination, we study the extent to which relaxed core fairness can be satisfied in metric spaces including line, tree and general metric space, and design approximation algorithms accordingly.
△ Less
Submitted 31 March, 2022;
originally announced March 2022.
-
An Integrated Vaccination Site Selection and Dose Allocation Problem with Fairness Concerns
Authors:
Mohammad Firouz,
Linda Li,
Daizy Ahmed,
Abdulaziz Ahmed
Abstract:
Fairness in vaccination is not only important from a social justice point of view, but experience has shown that a fair distribution of vaccine proves more effective in public immunization by preventing highly-concentrated infected areas to form among the population. In this paper, we address fairness from two simultaneous points of view: equity and accessibility. Equity in our setting means that…
▽ More
Fairness in vaccination is not only important from a social justice point of view, but experience has shown that a fair distribution of vaccine proves more effective in public immunization by preventing highly-concentrated infected areas to form among the population. In this paper, we address fairness from two simultaneous points of view: equity and accessibility. Equity in our setting means that as far as possible, each demand zone should receive a fair-share of the total doses available. On the other hand, accessibility means that as far as possible, each demand zone should have equal travel distance to access their assigned vaccination site.
△ Less
Submitted 10 November, 2021;
originally announced November 2021.
-
Bayesian forecast combination using time-varying features
Authors:
Li Li,
Yanfei Kang,
Feng Li
Abstract:
In this work, we propose a novel framework for density forecast combination by constructing time-varying weights based on time series features, which is called Feature-based Bayesian Forecasting Model Averaging (FEBAMA). Our framework estimates weights in the forecast combination via Bayesian log predictive scores, in which the optimal forecasting combination is determined by time series features…
▽ More
In this work, we propose a novel framework for density forecast combination by constructing time-varying weights based on time series features, which is called Feature-based Bayesian Forecasting Model Averaging (FEBAMA). Our framework estimates weights in the forecast combination via Bayesian log predictive scores, in which the optimal forecasting combination is determined by time series features from historical information. In particular, we use an automatic Bayesian variable selection method to add weight to the importance of different features. To this end, our approach has better interpretability compared to other black-box forecasting combination schemes. We apply our framework to stock market data and M3 competition data. Based on our structure, a simple maximum-a-posteriori scheme outperforms benchmark methods, and Bayesian variable selection can further enhance the accuracy for both point and density forecasts.
△ Less
Submitted 14 June, 2022; v1 submitted 4 August, 2021;
originally announced August 2021.
-
Discordant Relaxations of Misspecified Models
Authors:
Lixiong Li,
Désiré Kédagni,
Ismaël Mourifié
Abstract:
In many set-identified models, it is difficult to obtain a tractable characterization of the identified set. Therefore, researchers often rely on non-sharp identification conditions, and empirical results are often based on an outer set of the identified set. This practice is often viewed as conservative yet valid because an outer set is always a superset of the identified set. However, this paper…
▽ More
In many set-identified models, it is difficult to obtain a tractable characterization of the identified set. Therefore, researchers often rely on non-sharp identification conditions, and empirical results are often based on an outer set of the identified set. This practice is often viewed as conservative yet valid because an outer set is always a superset of the identified set. However, this paper shows that when the model is refuted by the data, two sets of non-sharp identification conditions derived from the same model could lead to disjoint outer sets and conflicting empirical results. We provide a sufficient condition for the existence of such discordancy, which covers models characterized by conditional moment inequalities and the Artstein (1983) inequalities. We also derive sufficient conditions for the non-existence of discordant submodels, therefore providing a class of models for which constructing outer sets cannot lead to misleading interpretations. In the case of discordancy, we follow Masten and Poirier (2021) by developing a method to salvage misspecified models, but unlike them, we focus on discrete relaxations. We consider all minimum relaxations of a refuted model that restores data-consistency. We find that the union of the identified sets of these minimum relaxations is robust to detectable misspecifications and has an intuitive empirical interpretation.
△ Less
Submitted 26 April, 2024; v1 submitted 21 December, 2020;
originally announced December 2020.
-
Evolutionary dynamics of cryptocurrency transaction networks: An empirical study
Authors:
Jiaqi Liang,
Linjing Li,
Daniel Zeng
Abstract:
Cryptocurrency is a well-developed blockchain technology application that is currently a heated topic throughout the world. The public availability of transaction histories offers an opportunity to analyze and compare different cryptocurrencies. In this paper, we present a dynamic network analysis of three representative blockchain-based cryptocurrencies: Bitcoin, Ethereum, and Namecoin. By analyz…
▽ More
Cryptocurrency is a well-developed blockchain technology application that is currently a heated topic throughout the world. The public availability of transaction histories offers an opportunity to analyze and compare different cryptocurrencies. In this paper, we present a dynamic network analysis of three representative blockchain-based cryptocurrencies: Bitcoin, Ethereum, and Namecoin. By analyzing the accumulated network growth, we find that, unlike most other networks, these cryptocurrency networks do not always densify over time, and they are changing all the time with relatively low node and edge repetition ratios. Therefore, we then construct separate networks on a monthly basis, trace the changes of typical network characteristics (including degree distribution, degree assortativity, clustering coefficient, and the largest connected component) over time, and compare the three. We find that the degree distribution of these monthly transaction networks cannot be well fitted by the famous power-law distribution, at the same time, different currency still has different network properties, e.g., both Bitcoin and Ethereum networks are heavy-tailed with disassortative mixing, however, only the former can be treated as a small world. These network properties reflect the evolutionary characteristics and competitive power of these three cryptocurrencies and provide a foundation for future research.
△ Less
Submitted 26 August, 2018;
originally announced August 2018.
-
Stochastic Switching Games
Authors:
Liangchen Li,
Michael Ludkovski
Abstract:
We study nonzero-sum stochastic switching games. Two players compete for market dominance through controlling (via timing options) the discrete-state market regime $M$. Switching decisions are driven by a continuous stochastic factor $X$ that modulates instantaneous revenue rates and switching costs. This generates a competitive feedback between the short-term fluctuations due to $X$ and the mediu…
▽ More
We study nonzero-sum stochastic switching games. Two players compete for market dominance through controlling (via timing options) the discrete-state market regime $M$. Switching decisions are driven by a continuous stochastic factor $X$ that modulates instantaneous revenue rates and switching costs. This generates a competitive feedback between the short-term fluctuations due to $X$ and the medium-term advantages based on $M$. We construct threshold-type Feedback Nash Equilibria which characterize stationary strategies describing long-run dynamic equilibrium market organization. Two sequential approximation schemes link the switching equilibrium to (i) constrained optimal switching, (ii) multi-stage timing games. We provide illustrations using an Ornstein-Uhlenbeck $X$ that leads to a recurrent equilibrium $M^\ast$ and a Geometric Brownian Motion $X$ that makes $M^\ast$ eventually "absorbed" as one player eventually gains permanent advantage. Explicit computations and comparative statics regarding the emergent macroscopic market equilibrium are also provided.
△ Less
Submitted 10 July, 2018;
originally announced July 2018.
-
A General Method for Demand Inversion
Authors:
Lixiong Li
Abstract:
This paper describes a numerical method to solve for mean product qualities which equates the real market share to the market share predicted by a discrete choice model. The method covers a general class of discrete choice model, including the pure characteristics model in Berry and Pakes(2007) and the random coefficient logit model in Berry et al.(1995) (hereafter BLP). The method transforms the…
▽ More
This paper describes a numerical method to solve for mean product qualities which equates the real market share to the market share predicted by a discrete choice model. The method covers a general class of discrete choice model, including the pure characteristics model in Berry and Pakes(2007) and the random coefficient logit model in Berry et al.(1995) (hereafter BLP). The method transforms the original market share inversion problem to an unconstrained convex minimization problem, so that any convex programming algorithm can be used to solve the inversion. Moreover, such results also imply that the computational complexity of inverting a demand model should be no more than that of a convex programming problem. In simulation examples, I show the method outperforms the contraction mapping algorithm in BLP. I also find the method remains robust in pure characteristics models with near-zero market shares.
△ Less
Submitted 26 February, 2018; v1 submitted 12 February, 2018;
originally announced February 2018.
-
Diversification, economies of scope, and exports growth of Chinese firms
Authors:
Mercedes Campi,
Marco Dueñas,
Le Li,
Huabin Wu
Abstract:
In the 1990s, China started a process of structural reforms and of trade liberalization, which was followed by the accession to the World Trade Organization (WTO) in 2001. In this paper, we analyze trade patterns of Chinese firms for the period 2000-2006, characterized by a notable increase in exports volumes. Theoretically, in a more open economy, firms are expected to move from the production of…
▽ More
In the 1990s, China started a process of structural reforms and of trade liberalization, which was followed by the accession to the World Trade Organization (WTO) in 2001. In this paper, we analyze trade patterns of Chinese firms for the period 2000-2006, characterized by a notable increase in exports volumes. Theoretically, in a more open economy, firms are expected to move from the production of a set of less-competitive products towards more internationally competitive ones, which implies specialization. We study several stylized facts on the distribution of Chinese firms trade and growth rates, and we analyze whether firms have diversified or specialized their trade patterns between 2000 and 2006. We show that Chinese export patterns are very heterogeneous, that the volatility of growth rates depends on the level of exports, and that volatility is stronger after trade liberalization. Both, diversification in products and destinations have a positive impact on trade growth, but diversification of destinations has a stronger effect. We conclude that the success of Chinese exports is not only due to an increase in the intensive margin, related to the existence of economies of scale, but also due to an increase in the extensive margin, related to the existence of economies of scope.
△ Less
Submitted 23 January, 2018; v1 submitted 8 January, 2018;
originally announced January 2018.