Search | arXiv e-print repository

Comparing Misspecified Models with Big Data: A Variational Bayesian Perspective

Authors: Yong Li, Sushanta K. Mallick, Tao Zeng, Junxing Zhang

Abstract: Optimal data detection in massive multiple-input multiple-output (MIMO) systems often requires prohibitively high computational complexity. A variety of detection algorithms have been proposed in the literature, offering different trade-offs between complexity and detection performance. In recent years, Variational Bayes (VB) has emerged as a widely used method for addressing statistical inference… ▽ More Optimal data detection in massive multiple-input multiple-output (MIMO) systems often requires prohibitively high computational complexity. A variety of detection algorithms have been proposed in the literature, offering different trade-offs between complexity and detection performance. In recent years, Variational Bayes (VB) has emerged as a widely used method for addressing statistical inference in the context of massive data. This study focuses on misspecified models and examines the risk functions associated with predictive distributions derived from variational posterior distributions. These risk functions, defined as the expectation of the Kullback-Leibler (KL) divergence between the true data-generating density and the variational predictive distributions, provide a framework for assessing predictive performance. We propose two novel information criteria for predictive model comparison based on these risk functions. Under certain regularity conditions, we demonstrate that the proposed information criteria are asymptotically unbiased estimators of their respective risk functions. Through comprehensive numerical simulations and empirical applications in economics and finance, we demonstrate the effectiveness of these information criteria in comparing misspecified models in the context of massive data. △ Less

Submitted 1 July, 2025; originally announced July 2025.

arXiv:2506.09291 [pdf, ps, other]

Competition Complexity in Multi-Item Auctions: Beyond VCG and Regularity

Authors: Hedyeh Beyhaghi, Linda Cai, Yiding Feng, Yingkai Li, S. Matthew Weinberg

Abstract: We quantify the value of the monopoly's bargaining power in terms of competition complexity--that is, the number of additional bidders the monopoly must attract in simple auctions to match the expected revenue of the optimal mechanisms (c.f., Bulow and Klemperer, 1996, Eden et al., 2017)--within the setting of multi-item auctions. We show that for simple auctions that sell items separately, the co… ▽ More We quantify the value of the monopoly's bargaining power in terms of competition complexity--that is, the number of additional bidders the monopoly must attract in simple auctions to match the expected revenue of the optimal mechanisms (c.f., Bulow and Klemperer, 1996, Eden et al., 2017)--within the setting of multi-item auctions. We show that for simple auctions that sell items separately, the competition complexity is $Θ(\frac{n}α)$ in an environment with $n$ original bidders under the slightly stronger assumption of $α$-strong regularity, in contrast to the standard regularity assumption in the literature, which requires $Ω(n \cdot \ln \frac{m}{n})$ additional bidders (Feldman et al., 2018). This significantly reduces the value of learning the distribution to design the optimal mechanisms, especially in large markets with many items for sale. For simple auctions that sell items as a grand bundle, we establish a constant competition complexity bound in a single-bidder environment when the number of items is small or when the value distribution has a monotone hazard rate. Some of our competition complexity results also hold when we compete against the first best benchmark (i.e., optimal social welfare). △ Less

Submitted 10 June, 2025; originally announced June 2025.

arXiv:2505.04414 [pdf, other]

A Powerful Chi-Square Specification Test with Support Vectors

Authors: Yuhao Li, Xiaojun Song

Abstract: Specification tests, such as Integrated Conditional Moment (ICM) and Kernel Conditional Moment (KCM) tests, are crucial for model validation but often lack power in finite samples. This paper proposes a novel framework to enhance specification test performance using Support Vector Machines (SVMs) for direction learning. We introduce two alternative SVM-based approaches: one maximizes the discrepan… ▽ More Specification tests, such as Integrated Conditional Moment (ICM) and Kernel Conditional Moment (KCM) tests, are crucial for model validation but often lack power in finite samples. This paper proposes a novel framework to enhance specification test performance using Support Vector Machines (SVMs) for direction learning. We introduce two alternative SVM-based approaches: one maximizes the discrepancy between nonparametric and parametric classes, while the other maximizes the separation between residuals and the origin. Both approaches lead to a $t$-type test statistic that converges to a standard chi-square distribution under the null hypothesis. Our method is computationally efficient and capable of detecting any arbitrary alternative. Simulation studies demonstrate its superior performance compared to existing methods, particularly in large-dimensional settings. △ Less

Submitted 7 May, 2025; originally announced May 2025.

arXiv:2505.01161 [pdf, other]

Model Checks in a Kernel Ridge Regression Framework

Authors: Yuhao Li

Abstract: We propose new reproducing kernel-based tests for model checking in conditional moment restriction models. By regressing estimated residuals on kernel functions via kernel ridge regression (KRR), we obtain a coefficient function in a reproducing kernel Hilbert space (RKHS) that is zero if and only if the model is correctly specified. We introduce two classes of test statistics: (i) projection-base… ▽ More We propose new reproducing kernel-based tests for model checking in conditional moment restriction models. By regressing estimated residuals on kernel functions via kernel ridge regression (KRR), we obtain a coefficient function in a reproducing kernel Hilbert space (RKHS) that is zero if and only if the model is correctly specified. We introduce two classes of test statistics: (i) projection-based tests, using RKHS inner products to capture global deviations, and (ii) random location tests, evaluating the KRR estimator at randomly chosen covariate points to detect local departures. The tests are consistent against fixed alternatives and sensitive to local alternatives at the $n^{-1/2}$ rate. When nuisance parameters are estimated, Neyman orthogonality projections ensure valid inference without repeated estimation in bootstrap samples. The random location tests are interpretable and can visualize model misspecification. Simulations show strong power and size control, especially in higher dimensions, outperforming existing methods. △ Less

Submitted 2 May, 2025; originally announced May 2025.

arXiv:2504.10389 [pdf, other]

Diversity-Fair Online Selection

Authors: Ming Hu, Yanzhi Li, Tongwen Wu

Abstract: Online selection problems frequently arise in applications such as crowdsourcing and employee recruitment. Existing research typically focuses on candidates with a single attribute. However, crowdsourcing tasks often require contributions from individuals across various demographics. Further motivated by the dynamic nature of crowdsourcing and hiring, we study the diversity-fair online selection p… ▽ More Online selection problems frequently arise in applications such as crowdsourcing and employee recruitment. Existing research typically focuses on candidates with a single attribute. However, crowdsourcing tasks often require contributions from individuals across various demographics. Further motivated by the dynamic nature of crowdsourcing and hiring, we study the diversity-fair online selection problem, in which a recruiter must make real-time decisions to foster workforce diversity across many dimensions. We propose two scenarios for this problem. The fixed-capacity scenario, suited for short-term hiring for crowdsourced workers, provides the recruiter with a fixed capacity to fill temporary job vacancies. In contrast, in the unknown-capacity scenario, recruiters optimize diversity across recruitment seasons with increasing capacities, reflecting that the firm honors diversity consideration in a long-term employee acquisition strategy. By modeling the diversity over $d$ dimensions as a max-min fairness objective, we show that no policy can surpass a competitive ratio of $O(1/d^{1/3})$ for either scenario, indicating that any achievable result inevitably decays by some polynomial factor in $d$. To this end, we develop bilevel hierarchical randomized policies that ensure compliance with the capacity constraint. For the fixed-capacity scenario, leveraging marginal information about the arriving population allows us to achieve a competitive ratio of $1/(4\sqrt{d} \lceil \log_2 d \rceil)$. For the unknown-capacity scenario, we establish a competitive ratio of $Ω(1/d^{3/4})$ under mild boundedness conditions. In both bilevel hierarchical policies, the higher level determines ex-ante selection probabilities and then informs the lower level's randomized selection that ensures no loss in efficiency. Both policies prioritize core diversity and then adjust for underrepresented dimensions. △ Less

Submitted 14 April, 2025; v1 submitted 14 April, 2025; originally announced April 2025.

arXiv:2503.17174 [pdf, other]

How to Promote Autonomous Driving with Evolving Technology: Business Strategy and Pricing Decision

Authors: Mingliang Li, Yanrong Li, Lai Wei, Wei Jiang, Zuo-Jun Max Shen

Abstract: Recently, autonomous driving system (ADS) has been widely adopted due to its potential to enhance travel convenience and alleviate traffic congestion, thereby improving the driving experience for consumers and creating lucrative opportunities for manufacturers. With the advancement of data sensing and control technologies, the reliability of ADS and the purchase intentions of consumers are continu… ▽ More Recently, autonomous driving system (ADS) has been widely adopted due to its potential to enhance travel convenience and alleviate traffic congestion, thereby improving the driving experience for consumers and creating lucrative opportunities for manufacturers. With the advancement of data sensing and control technologies, the reliability of ADS and the purchase intentions of consumers are continually evolving, presenting challenges for manufacturers in promotion and pricing decisions. To address this issue, we develop a two-stage game-theoretical model to characterize the decision-making processes of manufacturers and consumers before and after a technology upgrade. Considering the unique structural characteristics of ADS, which consists of driving software and its supporting hardware (SSH), we propose different business strategies for SSH (bundle or unbundle with the vehicle) and driving software (perpetual licensing or subscription) from the manufacturer's perspective. We find that, first, SSH strategies influence the optimal software strategies by changing the consumers' entry barriers to the ADS market. Specifically, for manufacturers with mature ADS technology, the bundle strategy provides consumers with a lower entry barrier by integrating SSH, making the flexible subscription model a dominant strategy; while perpetual licensing outperforms under the unbundle strategy. Second, the software strategies influence the optimal SSH strategy by altering consumers' exit barriers. Perpetual licensing imposes higher exit barriers; when combined with a bundle strategy that lowers entry barriers, it becomes a more advantageous choice for manufacturers with mature ADS technology. In contrast, the subscription strategy allows consumers to easily exit the market, making the bundle strategy advantageous only when a substantial proportion of consumers are compatible with ADS. △ Less

Submitted 21 March, 2025; originally announced March 2025.

arXiv:2502.15549 [pdf, other]

Blockchain innovation in promoting employment

Authors: David Lee Kuo Chuen, Yang Li

Abstract: Blockchain technology, though conceptualized in the early 1990s, only gained practical relevance with Bitcoin's launch in 2009. Recent advancements have demonstrated its transformative potential, particularly in the digital art and global payment sectors. Non-fungible tokens (NFTs) have redefined digital ownership, while financial institutions use blockchain to enhance cross-border transactions, r… ▽ More Blockchain technology, though conceptualized in the early 1990s, only gained practical relevance with Bitcoin's launch in 2009. Recent advancements have demonstrated its transformative potential, particularly in the digital art and global payment sectors. Non-fungible tokens (NFTs) have redefined digital ownership, while financial institutions use blockchain to enhance cross-border transactions, reducing costs and settlement times. Using the Diamond-Mortensen-Pissarides (DMP) model, this paper examines blockchain's impact on labor markets by improving job-matching efficiency, thereby reducing unemployment. However, high research costs and competition with incumbent technologies hinder early-stage blockchain adoption. We extend the DMP model to analyze the role of government intervention through tax and wage policies in mitigating these barriers. Our findings suggest that lowering firm tax rates can accelerate blockchain innovation, enhance labor market efficiency, and promote employment growth, highlighting the critical balance between technological progress and economic policy in fostering blockchain-driven economic transformation. △ Less

Submitted 21 February, 2025; originally announced February 2025.

arXiv:2502.14261 [pdf]

SOE's ESG Performance on Financial Flexibility: The Evidence from the Hong Kong Stock Market

Authors: Yan Li

Abstract: As the global economic environment becomes increasingly unstable, enhancing financial flexibility to cope with risks has become the consensus of many companies. At the same time, environmental, social, and governance (ESG) performance may be one of the effective ways. We studied the impact of a firm's ESG performance on its financial flexibility with a sample of companies listed on the Hong Kong s… ▽ More As the global economic environment becomes increasingly unstable, enhancing financial flexibility to cope with risks has become the consensus of many companies. At the same time, environmental, social, and governance (ESG) performance may be one of the effective ways. We studied the impact of a firm's ESG performance on its financial flexibility with a sample of companies listed on the Hong Kong stock market from 2018 to 2022. The empirical results show that good environmental, social and governance performance can significantly improve a firm's financial flexibility. In addition, this paper also finds that the influence of ESG performance on financial flexibility is weak for state-owned enterprises due to the influence of governance structure and market characteristics. Finally, the further analysis shows that there is a mediating role played by financing constraints in this process. This study can provide background information for state-owned enterprises' governance, information disclosure, and corporate operations. It also has guiding significance for relevant investors, management and officials. △ Less

Submitted 19 February, 2025; originally announced February 2025.

Comments: 15 pages; 0 figures

arXiv:2502.14257 [pdf]

Community Bank Establishment and Consumption Growth: Evidence from Panel Study of Income Dynamics in USA

Authors: Yan Li

Abstract: Consumption is a primary source of economic growth and key indicator of poverty. The establishment of community banks can provide credit resources, unlocking household consumption potential and playing a crucial role in economic development. This study explores the role of community banks in promoting consumption by using data from the Panel Study of Income Dynamics (PSID) for 11 waves from 1980 t… ▽ More Consumption is a primary source of economic growth and key indicator of poverty. The establishment of community banks can provide credit resources, unlocking household consumption potential and playing a crucial role in economic development. This study explores the role of community banks in promoting consumption by using data from the Panel Study of Income Dynamics (PSID) for 11 waves from 1980 to 1990, and constructing a fixed-effects model using the time-varying difference-in-differences (DID) method. The findings indicate that the establishment of community banks effectively stimulates growth in local household consumption, primarily by increasing household income and reducing precautionary savings. Therefore, both government and financial institutions should further promote the development of regional financial institutions and credit tools to promote economic growth. △ Less

Submitted 19 February, 2025; originally announced February 2025.

Comments: 15 pages, 2 figures

arXiv:2502.13423 [pdf]

The Policy Paradox: Government Debt Servicing and Local Bank Risk Growth

Authors: Yan Li

Abstract: The issue of local government debt is widely recognized as one of the "gray rhinos" affecting the stable development of China's economy. Government debt can transmit risks to local banks, which are among the primary holders of local debt, thereby triggering systemic financial risks. Consequently, exploring debt resolution pathways and evaluating the systematic effects of debt servicing policies ha… ▽ More The issue of local government debt is widely recognized as one of the "gray rhinos" affecting the stable development of China's economy. Government debt can transmit risks to local banks, which are among the primary holders of local debt, thereby triggering systemic financial risks. Consequently, exploring debt resolution pathways and evaluating the systematic effects of debt servicing policies has become critically important. This study employs panel data from 348 local commercial banks across 29 provincial-level administrative regions in China from 2010 to 2023, and constructs a difference-in-differences (DID) model to investigate the impact of the State Council's special supervision of debt servicing on local bank risks. The findings indicate that the government's debt servicing policy essentially represents a shift of government debt from explicit to implicit forms, significantly increasing the risks faced by local banks and producing outcomes contrary to the policy's original intent. This effect is particularly pronounced for rural commercial banks and banks with high customer concentration and fewer branches. Mechanism analysis reveals two key insights. First, local banks are heavily influenced by local government control; the government's debt servicing requires banks to support the government by purchasing government bonds and other financial instruments, which leads to a deterioration in asset quality and an expansion of risk exposure. Second, government debt crowds out private credit from local banks, weakening the region's repayment capacity and ultimately increasing bank risk. Our research uncovers the counterintuitive effects of government debt servicing and offers corresponding policy recommendations. △ Less

Submitted 8 April, 2025; v1 submitted 18 February, 2025; originally announced February 2025.

Comments: 16 pages, 3 figures

arXiv:2502.09095 [pdf]

Blockchain-based Ecommerce It's an Evolution NOT a Revolution-Experimental Evidence from Users' Perspective

Authors: David Lee Kuo Chuen, Yang Li, Weibiao Xu, Willy Zhao

Abstract: Proponents of blockchains believe that this technology will revolutionize e-commerce. To evaluate this belief, we invite several groups of students to transact on a decentralized peer-to-peer marketplace built on the platform provided by Origin Protocol Inc., and then we conduct a survey about their experience of usage. Based on our survey results, we find that 33% of respondents play tricks on ot… ▽ More Proponents of blockchains believe that this technology will revolutionize e-commerce. To evaluate this belief, we invite several groups of students to transact on a decentralized peer-to-peer marketplace built on the platform provided by Origin Protocol Inc., and then we conduct a survey about their experience of usage. Based on our survey results, we find that 33% of respondents play tricks on others, which implies that this undesirable result may hinder the widespread adoption of blockchain technologies. We also attempt to propose a conceptual mechanism to mitigate fraudulent behaviors. In the event of disputation, a trusted authority is entitled to the right to downgrade the fraudulent side's credit record, which is stored by a permissioned blockchain accessed only by the authority. Such a punishment can effectively decrease agents' incentives to sell counterfeits and leave fake ratings. In sum, we must distinguish what we proposed blockchains will do and what blockchains can do before enabling this technology in e-commerce. △ Less

Submitted 13 February, 2025; originally announced February 2025.

arXiv:2412.20176 [pdf]

The impact of China's economic growth on poverty alleviation: From absolute to relative poverty

Authors: Yixun Kang, Ying Li

Abstract: This paper investigates the extent to which China's economic growth and development influence poverty levels, focusing on the dichotomy between absolute and relative poverty. Leveraging data from sources like the World Bank, Statista, and Macrotrends, and employing economic frameworks such as the Lewis Model, Poverty Headcount Ratio, and Gini Coefficient, the study examines China's transformation… ▽ More This paper investigates the extent to which China's economic growth and development influence poverty levels, focusing on the dichotomy between absolute and relative poverty. Leveraging data from sources like the World Bank, Statista, and Macrotrends, and employing economic frameworks such as the Lewis Model, Poverty Headcount Ratio, and Gini Coefficient, the study examines China's transformation from combating absolute poverty to addressing relative poverty. The findings highlight that robust economic growth from 2011 to 2022, driven by urban development and rural infrastructure investments, successfully eradicated absolute poverty and elevated rural incomes. However, this progress also exacerbated income inequality, as evidenced by a rising Gini Coefficient, complicating efforts to alleviate relative poverty. Through multidimensional analyses encompassing regional disparities, migration patterns, educational access, and societal factors, the paper underscores the dual impact of economic development on poverty alleviation. It concludes by advocating for policies that balance economic growth with equitable resource distribution to tackle persistent relative poverty and foster sustainable development. △ Less

Submitted 28 December, 2024; originally announced December 2024.

arXiv:2412.12676 [pdf, other]

Raising Bidders' Awareness in Second-Price Auctions

Authors: Ying Xue Li, Burkhard C. Schipper

Abstract: When bidders bid on complex objects, they might be unaware of characteristics effecting their valuations. We assume that each buyer's valuation is a sum of independent random variables, one for each characteristic. When a bidder is unaware of a characteristic, he omits the random variable from the sum. We study the seller's decision to raise bidders' awareness of characteristics before a second-pr… ▽ More When bidders bid on complex objects, they might be unaware of characteristics effecting their valuations. We assume that each buyer's valuation is a sum of independent random variables, one for each characteristic. When a bidder is unaware of a characteristic, he omits the random variable from the sum. We study the seller's decision to raise bidders' awareness of characteristics before a second-price auction with entry fees. Optimal entry fees capture an additional unawareness rent due to unaware bidders misperceiving their probability of winning and the price to be paid upon winning. When raising a bidder's individual awareness of a characteristic with positive expected value, the seller faces a trade-off between positive effects on the expected first order statistic and unawareness rents of remaining unaware bidders on one hand and the loss of the unawareness rent from the newly aware bidder on the other. We present characterization results on raising public awareness together with no versus full information. We discuss the winner's curse due to unawareness of characteristics. △ Less

Submitted 17 December, 2024; originally announced December 2024.

Comments: 29 pages

arXiv:2409.00843 [pdf, other]

Global Public Sentiment on Decentralized Finance: A Spatiotemporal Analysis of Geo-tagged Tweets from 150 Countries

Authors: Yuqi Chen, Yifan Li, Kyrie Zhixuan Zhou, Xiaokang Fu, Lingbo Liu, Shuming Bao, Daniel Sui, Luyao Zhang

Abstract: Blockchain technology and decentralized finance (DeFi) are reshaping global financial systems. Despite their impact, the spatial distribution of public sentiment and its economic and geopolitical determinants are often overlooked. This study analyzes over 150 million geo-tagged, DeFi-related tweets from 2012 to 2022, sourced from a larger dataset of 7.4 billion tweets. Using sentiment scores from… ▽ More Blockchain technology and decentralized finance (DeFi) are reshaping global financial systems. Despite their impact, the spatial distribution of public sentiment and its economic and geopolitical determinants are often overlooked. This study analyzes over 150 million geo-tagged, DeFi-related tweets from 2012 to 2022, sourced from a larger dataset of 7.4 billion tweets. Using sentiment scores from a BERT-based multilingual classification model, we integrated these tweets with economic and geopolitical data to create a multimodal dataset. Employing techniques like sentiment analysis, spatial econometrics, clustering, and topic modeling, we uncovered significant global variations in DeFi engagement and sentiment. Our findings indicate that economic development significantly influences DeFi engagement, particularly after 2015. Geographically weighted regression analysis revealed GDP per capita as a key predictor of DeFi tweet proportions, with its impact growing following major increases in cryptocurrency values such as bitcoin. While wealthier nations are more actively engaged in DeFi discourse, the lowest-income countries often discuss DeFi in terms of financial security and sudden wealth. Conversely, middle-income countries relate DeFi to social and religious themes, whereas high-income countries view it mainly as a speculative instrument or entertainment. This research advances interdisciplinary studies in computational social science and finance and supports open science by making our dataset and code available on GitHub, and providing a non-code workflow on the KNIME platform. These contributions enable a broad range of scholars to explore DeFi adoption and sentiment, aiding policymakers, regulators, and developers in promoting financial inclusion and responsible DeFi engagement globally. △ Less

Submitted 3 February, 2025; v1 submitted 1 September, 2024; originally announced September 2024.

arXiv:2405.20423 [pdf, other]

Dynamics and Contracts for an Agent with Misspecified Beliefs

Authors: Yingkai Li, Argyris Oikonomou

Abstract: We study a single-agent contracting environment where the agent has misspecified beliefs about the outcome distributions for each chosen action. First, we show that for a myopic Bayesian learning agent with only two possible actions, the empirical frequency of the chosen actions converges to a Berk-Nash equilibrium. However, through a constructed example, we illustrate that this convergence in act… ▽ More We study a single-agent contracting environment where the agent has misspecified beliefs about the outcome distributions for each chosen action. First, we show that for a myopic Bayesian learning agent with only two possible actions, the empirical frequency of the chosen actions converges to a Berk-Nash equilibrium. However, through a constructed example, we illustrate that this convergence in action frequencies fails when the agent has three or more actions. Furthermore, with multiple actions, even computing an $\varepsilon$-Berk-Nash equilibrium requires at least quasi-polynomial time under the Exponential Time Hypothesis (ETH) for the PPAD-class. This finding poses a significant challenge to the existence of simple learning dynamics that converge in action frequencies. Motivated by this challenge, we focus on the contract design problems for an agent with misspecified beliefs and two possible actions. We show that the revenue-optimal contract, under a Berk-Nash equilibrium, can be computed in polynomial time. Perhaps surprisingly, we show that even a minor degree of misspecification can result in a significant reduction in optimal revenue. △ Less

Submitted 30 May, 2024; originally announced May 2024.

arXiv:2405.18521 [pdf, other]

Information Acquisition Towards Unanimous Consent

Authors: Yingkai Li, Boli Xu

Abstract: A manager facing a task of unknown difficulty can propose a plan to let a worker undertake the task; the worker can either accept the proposal or reject it. The plan benefits the worker only when the task is sufficiently easy and benefits the manager only when it is sufficiently hard. The manager can conduct a test at no cost to acquire information about the difficulty of the task; however, she ca… ▽ More A manager facing a task of unknown difficulty can propose a plan to let a worker undertake the task; the worker can either accept the proposal or reject it. The plan benefits the worker only when the task is sufficiently easy and benefits the manager only when it is sufficiently hard. The manager can conduct a test at no cost to acquire information about the difficulty of the task; however, she can misreport the test result to the worker. We find that it is optimal for the manager to conduct a threshold test and to propose the plan only when the difficulty of the task exceeds the threshold. Moreover, when the worker privately knows his capability, we find that the manager can benefit from screening the worker by offering up to two additional interval tests. △ Less

Submitted 29 March, 2025; v1 submitted 28 May, 2024; originally announced May 2024.

arXiv:2405.12804 [pdf, ps, other]

The Machiavellian frontier of stable mechanisms

Authors: Qiufu Chen, Yuanmei Li, Xiaopeng Yin, Luosai Zhang, Siyi Zhou

Abstract: The impossibility theorem in Roth (1982) states that no stable mechanism satisfies strategy-proofness. This paper explores the Machiavellian frontier of stable mechanisms by weakening strategy-proofness. For a fixed mechanism $\varphi$ and a true preference profile $\succ$, a $(\varphi,\succ)$-boost mispresentation of agent i is a preference of i that is obtained by (i) raising the ranking of the… ▽ More The impossibility theorem in Roth (1982) states that no stable mechanism satisfies strategy-proofness. This paper explores the Machiavellian frontier of stable mechanisms by weakening strategy-proofness. For a fixed mechanism $\varphi$ and a true preference profile $\succ$, a $(\varphi,\succ)$-boost mispresentation of agent i is a preference of i that is obtained by (i) raising the ranking of the truth-telling assignment $\varphi_i(\succ)$, and (ii) keeping rankings unchanged above the new position of this truth-telling assignment. We require a matching mechanism $\varphi$ neither punish nor reward any such misrepresentation, and define such axiom as $\varphi$-boost-invariance. This is strictly weaker than requiring strategy-proofness. We show that no stable mechanism $\varphi$ satisfies $\varphi$-boost-invariance. Our negative result strengthens the Roth Impossibility Theorem. △ Less

Submitted 12 July, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

arXiv:2403.08145 [pdf, other]

Algorithmic Information Disclosure in Optimal Auctions

Authors: Yang Cai, Yingkai Li, Jinzhao Wu

Abstract: This paper studies a joint design problem where a seller can design both the signal structures for the agents to learn their values, and the allocation and payment rules for selling the item. In his seminal work, Myerson (1981) shows how to design the optimal auction with exogenous signals. We show that the problem becomes NP-hard when the seller also has the ability to design the signal structure… ▽ More This paper studies a joint design problem where a seller can design both the signal structures for the agents to learn their values, and the allocation and payment rules for selling the item. In his seminal work, Myerson (1981) shows how to design the optimal auction with exogenous signals. We show that the problem becomes NP-hard when the seller also has the ability to design the signal structures. Our main result is a polynomial-time approximation scheme (PTAS) for computing the optimal joint design with at most an $ε$ multiplicative loss in expected revenue. Moreover, we show that in our joint design problem, the seller can significantly reduce the information rent of the agents by providing partial information, which ensures a revenue that is at least $1 - \frac{1}{e}$ of the optimal welfare for all valuation distributions. △ Less

Submitted 12 March, 2024; originally announced March 2024.

arXiv:2402.18392 [pdf, other]

Unveiling the Potential of Robustness in Selecting Conditional Average Treatment Effect Estimators

Authors: Yiyan Huang, Cheuk Hang Leung, Siyi Wang, Yijun Li, Qi Wu

Abstract: The growing demand for personalized decision-making has led to a surge of interest in estimating the Conditional Average Treatment Effect (CATE). Various types of CATE estimators have been developed with advancements in machine learning and causal inference. However, selecting the desirable CATE estimator through a conventional model validation procedure remains impractical due to the absence of c… ▽ More The growing demand for personalized decision-making has led to a surge of interest in estimating the Conditional Average Treatment Effect (CATE). Various types of CATE estimators have been developed with advancements in machine learning and causal inference. However, selecting the desirable CATE estimator through a conventional model validation procedure remains impractical due to the absence of counterfactual outcomes in observational data. Existing approaches for CATE estimator selection, such as plug-in and pseudo-outcome metrics, face two challenges. First, they must determine the metric form and the underlying machine learning models for fitting nuisance parameters (e.g., outcome function, propensity function, and plug-in learner). Second, they lack a specific focus on selecting a robust CATE estimator. To address these challenges, this paper introduces a Distributionally Robust Metric (DRM) for CATE estimator selection. The proposed DRM is nuisance-free, eliminating the need to fit models for nuisance parameters, and it effectively prioritizes the selection of a distributionally robust CATE estimator. The experimental results validate the effectiveness of the DRM method in selecting CATE estimators that are robust to the distribution shift incurred by covariate shift and hidden confounders. △ Less

Submitted 31 October, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

Comments: This paper was accepted by NeurIPS-2024

arXiv:2310.19147 [pdf, other]

Incentivizing Forecasters to Learn: Summarized vs. Unrestricted Advice

Authors: Yingkai Li, Jonathan Libgober

Abstract: How should forecasters be incentivized to acquire the most information when learning takes place over time? We address this question in the context of a novel dynamic mechanism design problem where a designer can incentivize learning by conditioning a reward on an event's outcome and expert reports. Eliciting summarized advice at a terminal date maximizes information acquisition if an informative… ▽ More How should forecasters be incentivized to acquire the most information when learning takes place over time? We address this question in the context of a novel dynamic mechanism design problem where a designer can incentivize learning by conditioning a reward on an event's outcome and expert reports. Eliciting summarized advice at a terminal date maximizes information acquisition if an informative signal fully reveals the outcome or has predictable content. Otherwise, richer reporting capabilities may be required. Our findings shed light on incentive design for consultation and forecasting by illustrating how learning dynamics shape qualitative properties of effort-maximizing contracts. △ Less

Submitted 11 April, 2025; v1 submitted 29 October, 2023; originally announced October 2023.

Comments: A preliminary version of this paper has been accepted in the Twenty-Fifth ACM Conference on Economics and Computation (EC'24) as a one-page abstract with the title "Optimal Scoring for Dynamic Information Acquisition."

arXiv:2310.10024 [pdf, other]

Managing Persuasion Robustly: The Optimality of Quota Rules

Authors: Dirk Bergemann, Tan Gan, Yingkai Li

Abstract: We study a sender-receiver model where the receiver can commit to a decision rule before the sender determines the information policy. The decision rule can depend on the signal structure and the signal realization that the sender adopts. This framework captures applications where a decision-maker (the receiver) solicit advice from an interested party (sender). In these applications, the receiver… ▽ More We study a sender-receiver model where the receiver can commit to a decision rule before the sender determines the information policy. The decision rule can depend on the signal structure and the signal realization that the sender adopts. This framework captures applications where a decision-maker (the receiver) solicit advice from an interested party (sender). In these applications, the receiver faces uncertainty regarding the sender's preferences and the set of feasible signal structures. Consequently, we adopt a unified robust analysis framework that includes max-min utility, min-max regret, and min-max approximation ratio as special cases. We show that it is optimal for the receiver to sacrifice ex-post optimality to perfectly align the sender's incentive. The optimal decision rule is a quota rule, i.e., the decision rule maximizes the receiver's ex-ante payoff subject to the constraint that the marginal distribution over actions adheres to a consistent quota, regardless of the sender's chosen signal structure. △ Less

Submitted 15 October, 2023; originally announced October 2023.

arXiv:2308.05201 [pdf, ps, other]

"Generate" the Future of Work through AI: Empirical Evidence from Online Labor Markets

Authors: Jin Liu, Xingchen Xu, Xi Nan, Yongjun Li, Yong Tan

Abstract: Large Language Model (LLM)-based generative AI systems, such as ChatGPT, demonstrate zero-shot learning capabilities across a wide range of downstream tasks. Owing to their general-purpose nature and potential to augment or even automate job functions, these systems are poised to reshape labor market dynamics. However, predicting their precise impact \textit{a priori} is challenging, given AI's si… ▽ More Large Language Model (LLM)-based generative AI systems, such as ChatGPT, demonstrate zero-shot learning capabilities across a wide range of downstream tasks. Owing to their general-purpose nature and potential to augment or even automate job functions, these systems are poised to reshape labor market dynamics. However, predicting their precise impact \textit{a priori} is challenging, given AI's simultaneous effects on both demand and supply, as well as the strategic responses of market participants. Leveraging an extensive dataset from a leading online labor platform, we document a pronounced displacement effect and an overall contraction in submarkets where required skills closely align with core LLM functionalities. Although demand and supply both decline, the reduction in supply is comparatively smaller, thereby intensifying competition among freelancers. Notably, further analysis shows that this heightened competition is especially pronounced in programming-intensive submarkets. This pattern is attributed to skill-transition effects: by lowering the human-capital barrier to programming, ChatGPT enables incumbent freelancers to enter programming tasks. Moreover, these transitions are not homogeneous, with high-skilled freelancers contributing disproportionately to the shift. Our findings illuminate the multifaceted impacts of general-purpose AI on labor markets, highlighting not only the displacement of certain occupations but also the inducement of skill transitions within the labor supply. These insights offer practical implications for policymakers, platform operators, and workers. △ Less

Submitted 18 June, 2025; v1 submitted 9 August, 2023; originally announced August 2023.

Comments: 92 pages, 16 figures, 34 tables

ACM Class: J.4

arXiv:2306.00869 [pdf, other]

Blockchain-based Decentralized Co-governance: Innovations and Solutions for Sustainable Crowdfunding

Authors: Bingyou Chen, Yu Luo, Jieni Li, Yujian Li, Ying Liu, Fan Yang, Junge Bo, Yanan Qiao

Abstract: This thesis provides an in-depth exploration of the Decentralized Co-governance Crowdfunding (DCC) Ecosystem, a novel solution addressing prevailing challenges in conventional crowdfunding methods faced by MSMEs and innovative projects. Among the problems it seeks to mitigate are high transaction costs, lack of transparency, fraud, and inefficient resource allocation. Leveraging a comprehensive re… ▽ More This thesis provides an in-depth exploration of the Decentralized Co-governance Crowdfunding (DCC) Ecosystem, a novel solution addressing prevailing challenges in conventional crowdfunding methods faced by MSMEs and innovative projects. Among the problems it seeks to mitigate are high transaction costs, lack of transparency, fraud, and inefficient resource allocation. Leveraging a comprehensive review of the existing literature on crowdfunding economic activities and blockchain's impact on organizational governance, we propose a transformative socio-economic model based on digital tokens and decentralized co-governance. This ecosystem is marked by a tripartite community structure - the Labor, Capital, and Governance communities - each contributing uniquely to the ecosystem's operation. Our research unfolds the evolution of the DCC ecosystem through distinct phases, offering a novel understanding of socioeconomic dynamics in a decentralized digital world. It also delves into the intricate governance mechanism of the ecosystem, ensuring integrity, fairness, and a balanced distribution of value and wealth. △ Less

Submitted 2 June, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

arXiv:2302.11829 [pdf, other]

Learning to Manipulate a Commitment Optimizer

Authors: Yurong Chen, Xiaotie Deng, Jiarui Gan, Yuhao Li

Abstract: It is shown in recent studies that in a Stackelberg game the follower can manipulate the leader by deviating from their true best-response behavior. Such manipulations are computationally tractable and can be highly beneficial for the follower. Meanwhile, they may result in significant payoff losses for the leader, sometimes completely defeating their first-mover advantage. A warning to commitment… ▽ More It is shown in recent studies that in a Stackelberg game the follower can manipulate the leader by deviating from their true best-response behavior. Such manipulations are computationally tractable and can be highly beneficial for the follower. Meanwhile, they may result in significant payoff losses for the leader, sometimes completely defeating their first-mover advantage. A warning to commitment optimizers, the risk these findings indicate appears to be alleviated to some extent by a strict information advantage the manipulations rely on. That is, the follower knows the full information about both players' payoffs whereas the leader only knows their own payoffs. In this paper, we study the manipulation problem with this information advantage relaxed. We consider the scenario where the follower is not given any information about the leader's payoffs to begin with but has to learn to manipulate by interacting with the leader. The follower can gather necessary information by querying the leader's optimal commitments against contrived best-response behaviors. Our results indicate that the information advantage is not entirely indispensable to the follower's manipulations: the follower can learn the optimal way to manipulate in polynomial time with polynomially many queries of the leader's optimal commitment. △ Less

Submitted 26 February, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

arXiv:2302.09168 [pdf, ps, other]

Screening Signal-Manipulating Agents via Contests

Authors: Yingkai Li, Xiaoyun Qiu

Abstract: We study the design of screening mechanisms subject to competition and manipulation. A social planner has limited resources to allocate to multiple agents using only signals manipulable through unproductive effort. We show that the welfare-maximizing mechanism takes the form of a contest and characterize the optimal contest. We apply our results to two settings: either the planner has one item or… ▽ More We study the design of screening mechanisms subject to competition and manipulation. A social planner has limited resources to allocate to multiple agents using only signals manipulable through unproductive effort. We show that the welfare-maximizing mechanism takes the form of a contest and characterize the optimal contest. We apply our results to two settings: either the planner has one item or a number of items proportional to the number of agents. We show that in both settings, with sufficiently many agents, a winner-takes-all contest is never optimal. In particular, the planner always benefits from randomizing the allocation to some agents. △ Less

Submitted 3 February, 2024; v1 submitted 17 February, 2023; originally announced February 2023.

arXiv:2302.02476 [pdf, other]

Estimating Time-Varying Networks for High-Dimensional Time Series

Authors: Jia Chen, Degui Li, Yuning Li, Oliver Linton

Abstract: We explore time-varying networks for high-dimensional locally stationary time series, using the large VAR model framework with both the transition and (error) precision matrices evolving smoothly over time. Two types of time-varying graphs are investigated: one containing directed edges of Granger causality linkages, and the other containing undirected edges of partial correlation linkages. Under… ▽ More We explore time-varying networks for high-dimensional locally stationary time series, using the large VAR model framework with both the transition and (error) precision matrices evolving smoothly over time. Two types of time-varying graphs are investigated: one containing directed edges of Granger causality linkages, and the other containing undirected edges of partial correlation linkages. Under the sparse structural assumption, we propose a penalised local linear method with time-varying weighted group LASSO to jointly estimate the transition matrices and identify their significant entries, and a time-varying CLIME method to estimate the precision matrices. The estimated transition and precision matrices are then used to determine the time-varying network structures. Under some mild conditions, we derive the theoretical properties of the proposed estimates including the consistency and oracle properties. In addition, we extend the methodology and theory to cover highly-correlated large-scale time series, for which the sparsity assumption becomes invalid and we allow for common factors before estimating the factor-adjusted time-varying networks. We provide extensive simulation studies and an empirical application to a large U.S. macroeconomic dataset to illustrate the finite-sample performance of our methods. △ Less

Submitted 5 February, 2023; originally announced February 2023.

arXiv:2211.06850 [pdf, other]

Approximate Optimality of Linear Contracts Under Uncertainty

Authors: Tal Alon, Paul Dütting, Yingkai Li, Inbal Talgam-Cohen

Abstract: We consider a hidden-action principal-agent model, in which actions require different amounts of effort, and the agent privately knows his ability that determines his cost of effort. We show that linear contracts admit approximation guarantees that improve with a natural metric that captures the degree of uncertainty in the contracting setting. We thus show that linear contracts are near-optimal w… ▽ More We consider a hidden-action principal-agent model, in which actions require different amounts of effort, and the agent privately knows his ability that determines his cost of effort. We show that linear contracts admit approximation guarantees that improve with a natural metric that captures the degree of uncertainty in the contracting setting. We thus show that linear contracts are near-optimal whenever there is enough uncertainty. In contrast, other simple contract formats such as debt contracts may suffer from a loss linear in the number of possible actions, even when there is sufficient uncertainty. △ Less

Submitted 4 March, 2025; v1 submitted 13 November, 2022; originally announced November 2022.

Comments: A one-page abstract of this paper was accepted at EC 2023 under the title "Bayesian Analysis of Linear Contracts"

arXiv:2211.03302 [pdf, ps, other]

Optimal Scoring Rules for Multi-dimensional Effort

Authors: Jason D. Hartline, Liren Shan, Yingkai Li, Yifan Wu

Abstract: This paper develops a framework for the design of scoring rules to optimally incentivize an agent to exert a multi-dimensional effort. This framework is a generalization to strategic agents of the classical knapsack problem (cf. Briest, Krysta, and Vöcking, 2005, Singer, 2010) and it is foundational to applying algorithmic mechanism design to the classroom. The paper identifies two simple families… ▽ More This paper develops a framework for the design of scoring rules to optimally incentivize an agent to exert a multi-dimensional effort. This framework is a generalization to strategic agents of the classical knapsack problem (cf. Briest, Krysta, and Vöcking, 2005, Singer, 2010) and it is foundational to applying algorithmic mechanism design to the classroom. The paper identifies two simple families of scoring rules that guarantee constant approximations to the optimal scoring rule. The truncated separate scoring rule is the sum of single dimensional scoring rules that is truncated to the bounded range of feasible scores. The threshold scoring rule gives the maximum score if reports exceed a threshold and zero otherwise. Approximate optimality of one or the other of these rules is similar to the bundling or selling separately result of Babaioff, Immorlica, Lucier, and Weinberg (2014). Finally, we show that the approximate optimality of the best of those two simple scoring rules is robust when the agent's choice of effort is made sequentially. △ Less

Submitted 29 June, 2023; v1 submitted 6 November, 2022; originally announced November 2022.

arXiv:2210.16525 [pdf, other]

Spectral Representation Learning for Conditional Moment Models

Authors: Ziyu Wang, Yucen Luo, Yueru Li, Jun Zhu, Bernhard Schölkopf

Abstract: Many problems in causal inference and economics can be formulated in the framework of conditional moment models, which characterize the target function through a collection of conditional moment restrictions. For nonparametric conditional moment models, efficient estimation often relies on preimposed conditions on various measures of ill-posedness of the hypothesis space, which are hard to validat… ▽ More Many problems in causal inference and economics can be formulated in the framework of conditional moment models, which characterize the target function through a collection of conditional moment restrictions. For nonparametric conditional moment models, efficient estimation often relies on preimposed conditions on various measures of ill-posedness of the hypothesis space, which are hard to validate when flexible models are used. In this work, we address this issue by proposing a procedure that automatically learns representations with controlled measures of ill-posedness. Our method approximates a linear representation defined by the spectral decomposition of a conditional expectation operator, which can be used for kernelized estimators and is known to facilitate minimax optimal estimation in certain settings. We show this representation can be efficiently estimated from data, and establish L2 consistency for the resulting estimator. We evaluate the proposed method on proximal causal inference tasks, exhibiting promising performance on high-dimensional, semi-synthetic data. △ Less

Submitted 28 December, 2022; v1 submitted 29 October, 2022; originally announced October 2022.

arXiv:2209.08211 [pdf]

Local political control in educational policy: Evidence from decentralized teacher pay reform under England's local education authorities

Authors: Yiang Li, Xingzuo Zhou

Abstract: In 2012, the School Teachers' Review Body discontinued central guidance and allowed school discretion in determining teachers' pay in England. Meanwhile, local education authorities (LEAs) offer non-statutory teacher pay recommendations to LEA-controlled schools. This study examines how LEAs' political party control determines their guidance regarding whether to adopt flexible performance pay or c… ▽ More In 2012, the School Teachers' Review Body discontinued central guidance and allowed school discretion in determining teachers' pay in England. Meanwhile, local education authorities (LEAs) offer non-statutory teacher pay recommendations to LEA-controlled schools. This study examines how LEAs' political party control determines their guidance regarding whether to adopt flexible performance pay or continue seniority-based pay. A regression discontinuity design is used to address the endogeneity of political control and educational policy-making. We find that marginally Conservative-controlled LEAs are more inclined to recommend market-oriented flexible pay structures. The results remain robust to alternative specifications. This study reveals that politics matter in England's local educational policy-making, which has broad implications for future policy. △ Less

Submitted 16 September, 2022; originally announced September 2022.

Comments: submitted to Royal Statistical Society: Series A

arXiv:2206.13119 [pdf, ps, other]

Optimal Private Payoff Manipulation against Commitment in Extensive-form Games

Authors: Yurong Chen, Xiaotie Deng, Yuhao Li

Abstract: To take advantage of strategy commitment, a useful tactic of playing games, a leader must learn enough information about the follower's payoff function. However, this leaves the follower a chance to provide fake information and influence the final game outcome. Through a carefully contrived payoff function misreported to the learning leader, the follower may induce an outcome that benefits him mor… ▽ More To take advantage of strategy commitment, a useful tactic of playing games, a leader must learn enough information about the follower's payoff function. However, this leaves the follower a chance to provide fake information and influence the final game outcome. Through a carefully contrived payoff function misreported to the learning leader, the follower may induce an outcome that benefits him more, compared to the ones when he truthfully behaves. We study the follower's optimal manipulation via such strategic behaviors in extensive-form games. Followers' different attitudes are taken into account. An optimistic follower maximizes his true utility among all game outcomes that can be induced by some payoff function. A pessimistic follower only considers misreporting payoff functions that induce a unique game outcome. For all the settings considered in this paper, we characterize all the possible game outcomes that can be induced successfully. We show that it is polynomial-time tractable for the follower to find the optimal way of misreporting his private payoff information. Our work completely resolves this follower's optimal manipulation problem on an extensive-form game tree. △ Less

Submitted 13 June, 2023; v1 submitted 27 June, 2022; originally announced June 2022.

arXiv:2202.06191 [pdf, other]

Exploration and Incentivizing Participation in Randomized Trials

Authors: Yingkai Li, Aleksandrs Slivkins

Abstract: Participation incentives is a well-known issue inhibiting randomized controlled trials (RCTs) in medicine, as well as a potential cause of user dissatisfaction for RCTs in online platforms. We frame this issue as a non-standard exploration-exploitation tradeoff: an RCT would like to explore as uniformly as possible, whereas each "agent" (a patient or a user) prefers "exploitation", i.e., treatment… ▽ More Participation incentives is a well-known issue inhibiting randomized controlled trials (RCTs) in medicine, as well as a potential cause of user dissatisfaction for RCTs in online platforms. We frame this issue as a non-standard exploration-exploitation tradeoff: an RCT would like to explore as uniformly as possible, whereas each "agent" (a patient or a user) prefers "exploitation", i.e., treatments that seem best. We incentivize participation by leveraging information asymmetry between the trial and the agents. We measure statistical performance via worst-case estimation error under adversarially generated outcomes, a standard objective for RCTs. We obtain a near-optimal solution in terms of this objective: an incentive-compatible mechanism with a particular guarantee, and a nearly matching impossibility result for any incentive-compatible mechanism. We consider three model variants: homogeneous agents (of the same "type" comprising beliefs and preferences), heterogeneous agents, and an extension with estimated type frequencies. △ Less

Submitted 9 March, 2025; v1 submitted 12 February, 2022; originally announced February 2022.

Comments: Previous versions focused on clinical RCTs as an application domain, and were titled "Exploration and Incentivizing Participation in Clinical Trials" and "Incentivizing Participation in Clinical Trials" (pre-2024)

arXiv:2110.11581 [pdf]

A Two-stage Pricing Strategy Considering Learning Effects and Word-of-Mouth

Authors: Yanrong Li, Lai Wei, Wei Jiang

Abstract: This paper proposes a two-stage pricing strategy for nondurable (such as typical electronics) products, where retail price is cut down at certain time points of the product lifecycle. We consider learning effect of electronic products that, with the accumulation of production, average production cost decreases over time as manufacturers get familiar with the production process. Moreover, word-of-m… ▽ More This paper proposes a two-stage pricing strategy for nondurable (such as typical electronics) products, where retail price is cut down at certain time points of the product lifecycle. We consider learning effect of electronic products that, with the accumulation of production, average production cost decreases over time as manufacturers get familiar with the production process. Moreover, word-of-mouth (WOM) of existing customers is used to analyze future demand, which is sensitive to the difference between the actual reliability and the perceived reliability of products. We theoretically prove the existence and uniqueness of the optimal switch time between the two stages and the optimal price in each stage. In addition, warranty as another important factor of electronic products is also considered, whose interaction with word-of-mouth as well as the corresponding influences on total profit are analyzed. Interestingly, our findings indicate that (1) the main reason for manufacturers to cut down prices for electronic products pertains to the learning effects; (2) even through both internal factors (e.g., the learning effects of manufacturers) and external factors (e.g., the price elasticity of customers) have impacts on product price, their influence on manufacturer's profit is widely divergent; (3) generally warranty weakens the influence of external advertising on the reliability estimate, because warranty price only partially reflects the actual reliability information of products; (4) and the optimal warranty price can increase the profits for the manufacturer by approximately 10%. △ Less

Submitted 22 October, 2021; originally announced October 2021.

arXiv:2110.09673 [pdf, other]

Bridging the short-term and long-term dynamics of economic structural change

Authors: James McNerney, Yang Li, Andres Gomez-Lievano, Frank Neffke

Abstract: Economic transformation -- change in what an economy produces -- is foundational to development and rising standards of living. Our understanding of this process has been propelled recently by two branches of work in the field of economic complexity, one studying how economies diversify, the other how the complexity of an economy is expressed in the makeup of its output. However, the connection be… ▽ More Economic transformation -- change in what an economy produces -- is foundational to development and rising standards of living. Our understanding of this process has been propelled recently by two branches of work in the field of economic complexity, one studying how economies diversify, the other how the complexity of an economy is expressed in the makeup of its output. However, the connection between these branches is not well understood, nor how they relate to a classic understanding of structural transformation. Here, we present a simple dynamical modeling framework that unifies these areas of work, based on the widespread observation that economies diversify preferentially into activities that are related to ones they do already. We show how stylized facts of long-run structural change, as well as complexity metrics, can both emerge naturally from this one observation. However, complexity metrics take on new meanings, as descriptions of the long-term changes an economy experiences rather than measures of complexity per se. This suggests relatedness and complexity metrics are connected, in a hitherto overlooked way: Both describe structural change, on different time scales. Whereas relatedness probes transformation on short time scales, complexity metrics capture long-term change. △ Less

Submitted 24 March, 2023; v1 submitted 18 October, 2021; originally announced October 2021.

Comments: 18 pages + 11 pages supplementary text, 12 figures

arXiv:2109.13971 [pdf]

doi 10.1080/21645515.2021.2017216

Forecasting the COVID-19 vaccine uptake rate: An infodemiological study in the US

Authors: Xingzuo Zhou, Yiang Li

Abstract: A year following the initial COVID-19 outbreak in China, many countries have approved emergency vaccines. Public-health practitioners and policymakers must understand the predicted populational willingness for vaccines and implement relevant stimulation measures. This study developed a framework for predicting vaccination uptake rate based on traditional clinical data-involving an autoregressive m… ▽ More A year following the initial COVID-19 outbreak in China, many countries have approved emergency vaccines. Public-health practitioners and policymakers must understand the predicted populational willingness for vaccines and implement relevant stimulation measures. This study developed a framework for predicting vaccination uptake rate based on traditional clinical data-involving an autoregressive model with autoregressive integrated moving average (ARIMA)- and innovative web search queries-involving a linear regression with ordinary least squares/least absolute shrinkage and selection operator, and machine-learning with boost and random forest. For accuracy, we implemented a stacking regression for the clinical data and web search queries. The stacked regression of ARIMA (1,0,8) for clinical data and boost with support vector machine for web data formed the best model for forecasting vaccination speed in the US. The stacked regression provided a more accurate forecast. These results can help governments and policymakers predict vaccine demand and finance relevant programs. △ Less

Submitted 9 December, 2021; v1 submitted 28 September, 2021; originally announced September 2021.

Comments: 29 pages, 6 figures, 8 tables; This article has been accepted for publication in Human Vaccines & Immunotherapeutics, published by Taylor & Francis

arXiv:2107.05853 [pdf, other]

Making Auctions Robust to Aftermarkets

Authors: Moshe Babaioff, Nicole Immorlica, Yingkai Li, Brendan Lucier

Abstract: A prevalent assumption in auction theory is that the auctioneer has full control over the market and that the allocation she dictates is final. In practice, however, agents might be able to resell acquired items in an aftermarket. A prominent example is the market for carbon emission allowances. These allowances are commonly allocated by the government using uniform-price auctions, and firms can t… ▽ More A prevalent assumption in auction theory is that the auctioneer has full control over the market and that the allocation she dictates is final. In practice, however, agents might be able to resell acquired items in an aftermarket. A prominent example is the market for carbon emission allowances. These allowances are commonly allocated by the government using uniform-price auctions, and firms can typically trade these allowances among themselves in an aftermarket that may not be fully under the auctioneer's control. While the uniform-price auction is approximately efficient in isolation, we show that speculation and resale in aftermarkets might result in a significant welfare loss. Motivated by this issue, we consider three approaches, each ensuring high equilibrium welfare in the combined market. The first approach is to adopt smooth auctions such as discriminatory auctions. This approach is robust to correlated valuations and to participants acquiring information about others' types. However, discriminatory auctions have several downsides, notably that of charging bidders different prices for identical items, resulting in fairness concerns that make the format unpopular. Two other approaches we suggest are either using posted-pricing mechanisms, or using uniform-price auctions with anonymous reserves. We show that when using balanced prices, both these approaches ensure high equilibrium welfare in the combined market. The latter also inherits many of the benefits from uniform-price auctions such as price discovery, and can be introduced with a minor modification to auctions currently in use to sell carbon emission allowances. △ Less

Submitted 15 November, 2022; v1 submitted 13 July, 2021; originally announced July 2021.

arXiv:2103.05788 [pdf, ps, other]

Selling Data to an Agent with Endogenous Information

Authors: Yingkai Li

Abstract: We consider a model of a data broker selling information to a single agent to maximize his revenue. The agent has a private valuation of the additional information, and upon receiving the signal from the data broker, the agent can conduct her own experiment to refine her posterior belief on the states with additional costs. To maximize expected revenue, only offering full information in general is… ▽ More We consider a model of a data broker selling information to a single agent to maximize his revenue. The agent has a private valuation of the additional information, and upon receiving the signal from the data broker, the agent can conduct her own experiment to refine her posterior belief on the states with additional costs. To maximize expected revenue, only offering full information in general is suboptimal, and the optimal mechanism may contain a continuum of menu options with partial information to prevent the agent from having incentives to acquire additional information from other sources. However, our main result shows that the additional benefit from price discrimination is limited, i.e., posting a deterministic price for revealing full information obtains at least half of the optimal revenue for arbitrary prior and cost functions. △ Less

Submitted 6 August, 2023; v1 submitted 9 March, 2021; originally announced March 2021.

Comments: accepted in EC 2022

arXiv:2103.03980 [pdf, other]

Revenue Maximization for Buyers with Costly Participation

Authors: Yannai A. Gonczarowski, Nicole Immorlica, Yingkai Li, Brendan Lucier

Abstract: We study mechanisms for selling a single item when buyers have private costs for participating in the mechanism. An agent's participation cost can also be interpreted as an outside option value that she must forego to participate. This substantially changes the revenue maximization problem, which becomes non-convex in the presence of participation costs. For multiple buyers, we show how to constru… ▽ More We study mechanisms for selling a single item when buyers have private costs for participating in the mechanism. An agent's participation cost can also be interpreted as an outside option value that she must forego to participate. This substantially changes the revenue maximization problem, which becomes non-convex in the presence of participation costs. For multiple buyers, we show how to construct a $(2+ε)$-approximately revenue-optimal mechanism in polynomial time. Our approach makes use of a many-buyers-to-single-buyer reduction, and in the single-buyer case our mechanism improves to an FPTAS. We also bound the menu size and the sample complexity for the optimal single-buyer mechanism. Moreover, we show that posting a single price in the single-buyer case is in fact optimal under the assumption that either (1) the participation cost is independent of the value, and the value distribution has decreasing marginal revenue or monotone hazard rate; or (2) the participation cost is a concave function of the value. When there are multiple buyers, we show that sequential posted pricing guarantees a large fraction of the optimal revenue under similar conditions. △ Less

Submitted 5 November, 2023; v1 submitted 5 March, 2021; originally announced March 2021.

Comments: accepted at SODA 2024

arXiv:2012.07238 [pdf, ps, other]

Misspecified Beliefs about Time Lags

Authors: Yingkai Li, Harry Pei

Abstract: We examine the long-term behavior of a Bayesian agent who has a misspecified belief about the time lag between actions and feedback, and learns about the payoff consequences of his actions over time. Misspecified beliefs about time lags result in attribution errors, which have no long-term effect when the agent's action converges, but can lead to arbitrarily large long-term inefficiencies when his… ▽ More We examine the long-term behavior of a Bayesian agent who has a misspecified belief about the time lag between actions and feedback, and learns about the payoff consequences of his actions over time. Misspecified beliefs about time lags result in attribution errors, which have no long-term effect when the agent's action converges, but can lead to arbitrarily large long-term inefficiencies when his action cycles. Our proof uses concentration inequalities to bound the frequency of action switches, which are useful to study learning problems with history dependence. We apply our methods to study a policy choice game between a policy-maker who has a correctly specified belief about the time lag and the public who has a misspecified belief. △ Less

Submitted 13 December, 2020; originally announced December 2020.

arXiv:2007.14002 [pdf, ps, other]

Equilibrium Behaviors in Repeated Games

Authors: Yingkai Li, Harry Pei

Abstract: We examine a patient player's behavior when he can build reputations in front of a sequence of myopic opponents. With positive probability, the patient player is a commitment type who plays his Stackelberg action in every period. We characterize the patient player's action frequencies in equilibrium. Our results clarify the extent to which reputations can refine the patient player's behavior and p… ▽ More We examine a patient player's behavior when he can build reputations in front of a sequence of myopic opponents. With positive probability, the patient player is a commitment type who plays his Stackelberg action in every period. We characterize the patient player's action frequencies in equilibrium. Our results clarify the extent to which reputations can refine the patient player's behavior and provide new insights to entry deterrence, business transactions, and capital taxation. Our proof makes a methodological contribution by establishing a new concentration inequality. △ Less

Submitted 10 February, 2021; v1 submitted 28 July, 2020; originally announced July 2020.

Comments: accepted at Journal of Economic Theory

arXiv:2006.13489 [pdf, other]

doi 10.1080/07350015.2021.1938085

Unified Principal Component Analysis for Sparse and Dense Functional Data under Spatial Dependency

Authors: Haozhe Zhang, Yehua Li

Abstract: We consider spatially dependent functional data collected under a geostatistics setting, where locations are sampled from a spatial point process. The functional response is the sum of a spatially dependent functional effect and a spatially independent functional nugget effect. Observations on each function are made on discrete time points and contaminated with measurement errors. Under the assump… ▽ More We consider spatially dependent functional data collected under a geostatistics setting, where locations are sampled from a spatial point process. The functional response is the sum of a spatially dependent functional effect and a spatially independent functional nugget effect. Observations on each function are made on discrete time points and contaminated with measurement errors. Under the assumption of spatial stationarity and isotropy, we propose a tensor product spline estimator for the spatio-temporal covariance function. When a coregionalization covariance structure is further assumed, we propose a new functional principal component analysis method that borrows information from neighboring functions. The proposed method also generates nonparametric estimators for the spatial covariance functions, which can be used for functional kriging. Under a unified framework for sparse and dense functional data, infill and increasing domain asymptotic paradigms, we develop the asymptotic convergence rates for the proposed estimators. Advantages of the proposed approach are demonstrated through simulation studies and two real data applications representing sparse and dense functional data, respectively. △ Less

Submitted 17 June, 2021; v1 submitted 24 June, 2020; originally announced June 2020.

arXiv:2005.07346 [pdf]

Mercury-related health benefits from retrofitting coal-fired power plants in China

Authors: Jiashuo Li, Sili Zhou, Wendong Wei, Jianchuan Qi, Yumeng Li, Bin Chen, Ning Zhang, Dabo Guan, Haoqi Qian, Xiaohui Wu, Jiawen Miao, Long Chen, Sai Liang, Kuishuang Feng

Abstract: China has implemented retrofitting measures in coal-fired power plants (CFPPs) to reduce air pollution through small unit shutdown (SUS), the installation of air pollution control devices (APCDs) and power generation efficiency (PGE) improvement. The reductions in highly toxic Hg emissions and their related health impacts by these measures have not been well studied. To refine mitigation options,… ▽ More China has implemented retrofitting measures in coal-fired power plants (CFPPs) to reduce air pollution through small unit shutdown (SUS), the installation of air pollution control devices (APCDs) and power generation efficiency (PGE) improvement. The reductions in highly toxic Hg emissions and their related health impacts by these measures have not been well studied. To refine mitigation options, we evaluated the health benefits of reduced Hg emissions via retrofitting measures during China's 12th Five-Year Plan by combining plant-level Hg emission inventories with the China Hg Risk Source-Tracking Model. We found that the measures reduced Hg emissions by 23.5 tons (approximately 1/5 of that from CFPPs in 2010), preventing 0.0021 points of per-foetus intelligence quotient (IQ) decrements and 114 deaths from fatal heart attacks. These benefits were dominated by CFPP shutdowns and APCD installations. Provincial health benefits were largely attributable to Hg reductions in other regions. We also demonstrated the necessity of considering human health impacts, rather than just Hg emission reductions, in selecting Hg control devices. This study also suggests that Hg control strategies should consider various factors, such as CFPP locations, population densities and trade-offs between reductions of total Hg (THg) and Hg2+. △ Less

Submitted 14 May, 2020; originally announced May 2020.

arXiv:2005.03010 [pdf, other]

Quantifying the Economic Impact of COVID-19 in Mainland China Using Human Mobility Data

Authors: Jizhou Huang, Haifeng Wang, Haoyi Xiong, Miao Fan, An Zhuo, Ying Li, Dejing Dou

Abstract: To contain the pandemic of coronavirus (COVID-19) in Mainland China, the authorities have put in place a series of measures, including quarantines, social distancing, and travel restrictions. While these strategies have effectively dealt with the critical situations of outbreaks, the combination of the pandemic and mobility controls has slowed China's economic growth, resulting in the first quarte… ▽ More To contain the pandemic of coronavirus (COVID-19) in Mainland China, the authorities have put in place a series of measures, including quarantines, social distancing, and travel restrictions. While these strategies have effectively dealt with the critical situations of outbreaks, the combination of the pandemic and mobility controls has slowed China's economic growth, resulting in the first quarterly decline of Gross Domestic Product (GDP) since GDP began to be calculated, in 1992. To characterize the potential shrinkage of the domestic economy, from the perspective of mobility, we propose two new economic indicators: the New Venues Created (NVC) and the Volumes of Visits to Venue (V^3), as the complementary measures to domestic investments and consumption activities, using the data of Baidu Maps. The historical records of these two indicators demonstrated strong correlations with the past figures of Chinese GDP, while the status quo has dramatically changed this year, due to the pandemic. We hereby presented a quantitative analysis to project the impact of the pandemic on economies, using the recent trends of NVC and V^3. We found that the most affected sectors would be travel-dependent businesses, such as hotels, educational institutes, and public transportation, while the sectors that are mandatory to human life, such as workplaces, residential areas, restaurants, and shopping sites, have been recovering rapidly. Analysis at the provincial level showed that the self-sufficient and self-sustainable economic regions, with internal supplies, production, and consumption, have recovered faster than those regions relying on global supply chains. △ Less

Submitted 6 May, 2020; originally announced May 2020.

Comments: 29 pages, 10 figures

arXiv:2003.00545 [pdf, ps, other]

Simple Mechanisms for Agents with Non-linear Utilities

Authors: Yiding Feng, Jason Hartline, Yingkai Li

Abstract: We show that economic conclusions derived from Bulow and Roberts (1989) for linear utility models approximately extend to non-linear utility models. Specifically, we quantify the extent to which agents with non-linear utilities resemble agents with linear utilities, and we show that the approximation of mechanisms for agents with linear utilities approximately extend for agents with non-linear uti… ▽ More We show that economic conclusions derived from Bulow and Roberts (1989) for linear utility models approximately extend to non-linear utility models. Specifically, we quantify the extent to which agents with non-linear utilities resemble agents with linear utilities, and we show that the approximation of mechanisms for agents with linear utilities approximately extend for agents with non-linear utilities. We illustrate the framework for the objectives of revenue and welfare on non-linear models that include agents with budget constraints, agents with risk aversion, and agents with endogenous valuations. We derive bounds on how much these models resemble the linear utility model and combine these bounds with well-studied approximation results for linear utility models. We conclude that simple mechanisms are approximately optimal for these non-linear agent models. △ Less

Submitted 26 October, 2022; v1 submitted 1 March, 2020; originally announced March 2020.

arXiv:2002.07964 [pdf]

Tourism Demand Forecasting: An Ensemble Deep Learning Approach

Authors: Shaolong Sun, Yanzhao Li, Ju-e Guo, Shouyang Wang

Abstract: The availability of tourism-related big data increases the potential to improve the accuracy of tourism demand forecasting, but presents significant challenges for forecasting, including curse of dimensionality and high model complexity. A novel bagging-based multivariate ensemble deep learning approach integrating stacked autoencoders and kernel-based extreme learning machines (B-SAKE) is propose… ▽ More The availability of tourism-related big data increases the potential to improve the accuracy of tourism demand forecasting, but presents significant challenges for forecasting, including curse of dimensionality and high model complexity. A novel bagging-based multivariate ensemble deep learning approach integrating stacked autoencoders and kernel-based extreme learning machines (B-SAKE) is proposed to address these challenges in this study. By using historical tourist arrival data, economic variable data and search intensity index (SII) data, we forecast tourist arrivals in Beijing from four countries. The consistent results of multiple schemes suggest that our proposed B-SAKE approach outperforms benchmark models in terms of level accuracy, directional accuracy and even statistical significance. Both bagging and stacked autoencoder can effectively alleviate the challenges brought by tourism big data and improve the forecasting performance of the models. The ensemble deep learning model we propose contributes to tourism forecasting literature and benefits relevant government officials and tourism practitioners. △ Less

Submitted 16 January, 2021; v1 submitted 18 February, 2020; originally announced February 2020.

arXiv:1905.11795 [pdf, other]

Credit Scoring by Incorporating Dynamic Networked Information

Authors: Yibei Li, Ximei Wang, Boualem Djehiche, Xiaoming Hu

Abstract: In this paper, the credit scoring problem is studied by incorporating networked information, where the advantages of such incorporation are investigated theoretically in two scenarios. Firstly, a Bayesian optimal filter is proposed to provide risk prediction for lenders assuming that published credit scores are estimated merely from structured financial data. Such prediction can then be used as a… ▽ More In this paper, the credit scoring problem is studied by incorporating networked information, where the advantages of such incorporation are investigated theoretically in two scenarios. Firstly, a Bayesian optimal filter is proposed to provide risk prediction for lenders assuming that published credit scores are estimated merely from structured financial data. Such prediction can then be used as a monitoring indicator for the risk management in lenders' future decisions. Secondly, a recursive Bayes estimator is further proposed to improve the precision of credit scoring by incorporating the dynamic interaction topology of clients. It is shown that under the proposed evolution framework, the designed estimator has a higher precision than any efficient estimator, and the mean square errors are strictly smaller than the Cramér-Rao lower bound for clients within a certain range of scores. Finally, simulation results for a special case illustrate the feasibility and effectiveness of the proposed algorithms. △ Less

Submitted 31 October, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

Comments: 19 pages, simulations and discussion added, motivation clarified, references updated

arXiv:1808.03070 [pdf]

Network-based Referral Mechanism in a Crowdfunding-based Marketing Pattern

Authors: Yongli Li, Zhi-Ping Fan, Wei Zhang

Abstract: Crowdfunding is gradually becoming a modern marketing pattern. By noting that the success of crowdfunding depends on network externalities, our research aims to utilize them to provide an applicable referral mechanism in a crowdfunding-based marketing pattern. In the context of network externalities, measuring the value of leading customers is chosen as the key to coping with the research problem… ▽ More Crowdfunding is gradually becoming a modern marketing pattern. By noting that the success of crowdfunding depends on network externalities, our research aims to utilize them to provide an applicable referral mechanism in a crowdfunding-based marketing pattern. In the context of network externalities, measuring the value of leading customers is chosen as the key to coping with the research problem by considering that leading customers take a critical stance in forming a referral network. Accordingly, two sequential-move game models (i.e., basic model and extended model) were established to measure the value of leading customers, and a skill of matrix transformation was adopted to solve the model by transforming a complicated multi-sequence game into a simple simultaneous-move game. Based on the defined value of leading customers, a network-based referral mechanism was proposed by exploring exactly how many awards are allocated along the customer sequence to encourage the leading customers' actions of successful recommendation and by demonstrating two general rules of awarding the referrals in our model setting. Moreover, the proposed solution approach helps deepen an understanding of the effect of the leading position, which is meaningful for designing more numerous referral approaches. △ Less

Submitted 9 August, 2018; originally announced August 2018.

arXiv:1801.00973 [pdf, ps, other]

A New Wald Test for Hypothesis Testing Based on MCMC outputs

Authors: Yong Li, Xiaobin Liu, Jun Yu, Tao Zeng

Abstract: In this paper, a new and convenient $χ^2$ wald test based on MCMC outputs is proposed for hypothesis testing. The new statistic can be explained as MCMC version of Wald test and has several important advantages that make it very convenient in practical applications. First, it is well-defined under improper prior distributions and avoids Jeffrey-Lindley's paradox. Second, it's asymptotic distributi… ▽ More In this paper, a new and convenient $χ^2$ wald test based on MCMC outputs is proposed for hypothesis testing. The new statistic can be explained as MCMC version of Wald test and has several important advantages that make it very convenient in practical applications. First, it is well-defined under improper prior distributions and avoids Jeffrey-Lindley's paradox. Second, it's asymptotic distribution can be proved to follow the $χ^2$ distribution so that the threshold values can be easily calibrated from this distribution. Third, it's statistical error can be derived using the Markov chain Monte Carlo (MCMC) approach. Fourth, most importantly, it is only based on the posterior MCMC random samples drawn from the posterior distribution. Hence, it is only the by-product of the posterior outputs and very easy to compute. In addition, when the prior information is available, the finite sample theory is derived for the proposed test statistic. At last, the usefulness of the test is illustrated with several applications to latent variable models widely used in economics and finance. △ Less

Submitted 3 January, 2018; originally announced January 2018.

Comments: Bayesian $χ^2$ test; Decision theory; Wald test; Markov chain Monte Carlo; Latent variable models

Showing 1–48 of 48 results for author: Li, Y