-
Bounded Foresight Equilibrium in Large Dynamic Economies with Heterogeneous Agents and Aggregate Shocks
Authors:
Bilal Islah,
Bar Light
Abstract:
Large dynamic economies with heterogeneous agents and aggregate shocks are central to many important applications, yet their equilibrium analysis remains computationally challenging. This is because the standard solution approach, rational expectations equilibria require agents to predict the evolution of the full cross-sectional distribution of state variables, leading to an extreme curse of dime…
▽ More
Large dynamic economies with heterogeneous agents and aggregate shocks are central to many important applications, yet their equilibrium analysis remains computationally challenging. This is because the standard solution approach, rational expectations equilibria require agents to predict the evolution of the full cross-sectional distribution of state variables, leading to an extreme curse of dimensionality. In this paper, we introduce a novel equilibrium concept, N-Bounded Foresight Equilibrium (N-BFE), and establish its existence under mild conditions. In N-BFE, agents optimize over an infinite horizon but form expectations about key economic variables only for the next N periods. Beyond this horizon, they assume that economic variables remain constant and use a predetermined continuation value. This equilibrium notion reduces computational complexity and draws a direct parallel to lookahead policies in reinforcement learning, where agents make near-term calculations while relying on approximate valuations beyond a computationally feasible horizon. At the same time, it lowers cognitive demands on agents while better aligning with the behavioral literature by incorporating time inconsistency and limited attention, all while preserving desired forward-looking behavior and ensuring that agents still respond to policy changes. Importantly, in N-BFE equilibria, forecast errors arise endogenously. We measure the foresight errors for different foresight horizons and show that foresight significantly influences the variation in endogenous equilibrium variables, distinguishing our findings from traditional risk aversion or precautionary savings channels. This variation arises from a feedback mechanism between individual decision-making and equilibrium variables, where increased foresight induces greater non-stationarity in agents' decisions and, consequently, in economic variables.
△ Less
Submitted 23 February, 2025;
originally announced February 2025.
-
Computing and Learning Stationary Mean Field Equilibria with Scalar Interactions: Algorithms and Applications
Authors:
Bar Light
Abstract:
Mean field equilibrium (MFE) has emerged as a computationally tractable solution concept for large dynamic games. However, computing MFE remains challenging due to nonlinearities and the absence of contraction properties, limiting its reliability for counterfactual analysis and comparative statics. This paper focuses on MFE in dynamic models where agents interact through a scalar function of the p…
▽ More
Mean field equilibrium (MFE) has emerged as a computationally tractable solution concept for large dynamic games. However, computing MFE remains challenging due to nonlinearities and the absence of contraction properties, limiting its reliability for counterfactual analysis and comparative statics. This paper focuses on MFE in dynamic models where agents interact through a scalar function of the population distribution, referred to as the scalar interaction function. Such models naturally arise in a wide range of applications involving market dynamics and strategic competition. The main contribution of this paper is to introduce iterative algorithms that leverage the scalar interaction structure and are guaranteed to converge to the MFE under mild assumptions. Leveraging this structure, we also establish an MFE existence result for non-compact state spaces and analytical comparative statics. To the best of our knowledge, these are the first algorithms with global convergence guarantees in such settings. Unlike existing approaches, our algorithms do not rely on monotonicity or contraction properties, significantly broadening their applicability. Furthermore, we provide a model-free algorithm that learns the MFE via simulation and reinforcement learning techniques such as Q-learning and policy gradient methods without requiring prior knowledge of payoff or transition functions. We apply our algorithms to classic models of dynamic competition, such as capacity competition, and to competitive models motivated by online marketplaces, including ridesharing and inventory competition, as well as to social learning models. We show how key market parameters influence equilibrium outcomes through reliable comparative statics in these representative models, providing insights into the design of competitive systems.
△ Less
Submitted 20 June, 2025; v1 submitted 2 February, 2025;
originally announced February 2025.
-
Equilibria under Dynamic Benchmark Consistency in Non-Stationary Multi-Agent Systems
Authors:
Ludovico Crippa,
Yonatan Gur,
Bar Light
Abstract:
We formulate and study a general time-varying multi-agent system where players repeatedly compete under incomplete information. Our work is motivated by scenarios commonly observed in online advertising and retail marketplaces, where agents and platform designers optimize algorithmic decision-making in dynamic competitive settings. In these systems, no-regret algorithms that provide guarantees rel…
▽ More
We formulate and study a general time-varying multi-agent system where players repeatedly compete under incomplete information. Our work is motivated by scenarios commonly observed in online advertising and retail marketplaces, where agents and platform designers optimize algorithmic decision-making in dynamic competitive settings. In these systems, no-regret algorithms that provide guarantees relative to \emph{static} benchmarks can perform poorly and the distributions of play that emerge from their interaction do not correspond anymore to static solution concepts such as coarse correlated equilibria. Instead, we analyze the interaction of \textit{dynamic benchmark} consistent policies that have performance guarantees relative to \emph{dynamic} sequences of actions, and through a novel \textit{tracking error} notion we delineate when their empirical joint distribution of play can approximate an evolving sequence of static equilibria. In systems that change sufficiently slowly (sub-linearly in the horizon length), we show that the resulting distributions of play approximate the sequence of coarse correlated equilibria, and apply this result to establish improved welfare bounds for smooth games. On a similar vein, we formulate internal dynamic benchmark consistent policies and establish that they approximate sequences of correlated equilibria. Our findings therefore suggest that in a broad range of multi-agent systems where non-stationarity is prevalent, algorithms designed to compete with dynamic benchmarks can improve both individual and welfare guarantees, and their emerging dynamics approximate a sequence of static equilibrium outcomes.
△ Less
Submitted 26 May, 2025; v1 submitted 21 January, 2025;
originally announced January 2025.
-
A Course in Dynamic Optimization
Authors:
Bar Light
Abstract:
These lecture notes are derived from a graduate-level course in dynamic optimization, offering an introduction to techniques and models extensively used in management science, economics, operations research, engineering, and computer science. The course emphasizes the theoretical underpinnings of discrete-time dynamic programming models and advanced algorithmic strategies for solving these models.…
▽ More
These lecture notes are derived from a graduate-level course in dynamic optimization, offering an introduction to techniques and models extensively used in management science, economics, operations research, engineering, and computer science. The course emphasizes the theoretical underpinnings of discrete-time dynamic programming models and advanced algorithmic strategies for solving these models. Unlike typical treatments, it provides a proof for the principle of optimality for upper semi-continuous dynamic programming, a middle ground between the simpler countable state space case \cite{bertsekas2012dynamic}, and the involved universally measurable case \cite{bertsekas1996stochastic}. This approach is sufficiently rigorous to include important examples such as dynamic pricing, consumption-savings, and inventory management models. The course also delves into the properties of value and policy functions, leveraging classical results \cite{topkis1998supermodularity} and recent developments. Additionally, it offers an introduction to reinforcement learning, including a formal proof of the convergence of Q-learning algorithms. Furthermore, the notes delve into policy gradient methods for the average reward case, presenting a convergence result for the tabular case in this context. This result is simple and similar to the discounted case but appears to be new.
△ Less
Submitted 10 October, 2024; v1 submitted 6 August, 2024;
originally announced August 2024.
-
A Note on the Stability of Monotone Markov Chains
Authors:
Bar Light
Abstract:
This note studies monotone Markov chains, a subclass of Markov chains with extensive applications in operations research and economics. While the properties that ensure the global stability of these chains are well studied, their establishment often relies on the fulfillment of a certain splitting condition. We address the challenges of verifying the splitting condition by introducing simple, appl…
▽ More
This note studies monotone Markov chains, a subclass of Markov chains with extensive applications in operations research and economics. While the properties that ensure the global stability of these chains are well studied, their establishment often relies on the fulfillment of a certain splitting condition. We address the challenges of verifying the splitting condition by introducing simple, applicable conditions that ensure global stability. The simplicity of these conditions is demonstrated through various examples including autoregressive processes, portfolio allocation problems and resource allocation dynamics.
△ Less
Submitted 19 September, 2024; v1 submitted 21 January, 2024;
originally announced January 2024.
-
Equilibria in Repeated Games under No-Regret with Dynamic Benchmarks
Authors:
Ludovico Crippa,
Yonatan Gur,
Bar Light
Abstract:
In repeated games, strategies are often evaluated by their ability to guarantee the performance of the single best action that is selected in hindsight, a property referred to as \emph{Hannan consistency}, or \emph{no-regret}. However, the effectiveness of the single best action as a yardstick to evaluate strategies is limited, as any static action may perform poorly in common dynamic settings. Ou…
▽ More
In repeated games, strategies are often evaluated by their ability to guarantee the performance of the single best action that is selected in hindsight, a property referred to as \emph{Hannan consistency}, or \emph{no-regret}. However, the effectiveness of the single best action as a yardstick to evaluate strategies is limited, as any static action may perform poorly in common dynamic settings. Our work therefore turns to a more ambitious notion of \emph{dynamic benchmark consistency}, which guarantees the performance of the best \emph{dynamic} sequence of actions, selected in hindsight subject to a constraint on the allowable number of action changes. Our main result establishes that for any joint empirical distribution of play that may arise when all players deploy no-regret strategies, there exist dynamic benchmark consistent strategies such that if all players deploy these strategies the same empirical distribution emerges when the horizon is large enough. This result demonstrates that although dynamic benchmark consistent strategies have a different algorithmic structure and provide significantly enhanced individual assurances, they lead to the same equilibrium set as no-regret strategies. Moreover, the proof of our main result uncovers the capacity of independent algorithms with strong individual guarantees to foster a strong form of coordination.
△ Less
Submitted 21 January, 2025; v1 submitted 6 December, 2022;
originally announced December 2022.
-
Social Learning under Platform Influence: Consensus and Persistent Disagreement
Authors:
Ozan Candogan,
Nicole Immorlica,
Bar Light,
Jerry Anunrojwong
Abstract:
Individuals increasingly rely on social networking platforms to form opinions. However, these platforms typically aim to maximize engagement, which may not align with social good. In this paper, we introduce an opinion dynamics model where agents are connected in a social network, and update their opinions based on their neighbors' opinions and on the content shown to them by the platform. We focu…
▽ More
Individuals increasingly rely on social networking platforms to form opinions. However, these platforms typically aim to maximize engagement, which may not align with social good. In this paper, we introduce an opinion dynamics model where agents are connected in a social network, and update their opinions based on their neighbors' opinions and on the content shown to them by the platform. We focus on a stochastic block model with two blocks, where the initial opinions of the individuals in different blocks are different. We prove that for large and dense enough networks the trajectory of opinion dynamics in such networks can be approximated well by a simple two-agent system. The latter admits tractable analytical analysis, which we leverage to provide interesting insights into the platform's impact on the social learning outcome in our original two-block model. Specifically, by using our approximation result, we show that agents' opinions approximately converge to some limiting opinion, which is either: consensus, where all agents agree, or persistent disagreement, where agents' opinions differ. We find that when the platform is weak and there is a high number of connections between agents with different initial opinions, a consensus equilibrium is likely. In this case, even if a persistent disagreement equilibrium arises, the polarization in this equilibrium, i.e., the degree of disagreement, is low. When the platform is strong, a persistent disagreement equilibrium is likely and the equilibrium polarization is high. A moderate platform typically leads to a persistent disagreement equilibrium with moderate polarization. We analyze the effect of initial polarization on consensus and explore various extensions including a three block stochastic model and a correlation between initial opinions and agents' connection probabilities.
△ Less
Submitted 5 June, 2025; v1 submitted 24 February, 2022;
originally announced February 2022.
-
New Jensen-type inequalities and their applications
Authors:
Bar Light
Abstract:
Convex analysis is fundamental to proving inequalities that have a wide variety of applications in economics and mathematics. In this paper we provide Jensen-type inequalities for functions that are, intuitively, "very" convex. These inequalities are simple to apply and can be used to generalize and extend previous results or to derive new results. We apply our inequalities to quantify the notion…
▽ More
Convex analysis is fundamental to proving inequalities that have a wide variety of applications in economics and mathematics. In this paper we provide Jensen-type inequalities for functions that are, intuitively, "very" convex. These inequalities are simple to apply and can be used to generalize and extend previous results or to derive new results. We apply our inequalities to quantify the notion "more risk averse" provided in \cite{pratt1978risk}. We also apply our results in other applications from different fields, including risk measures, Poisson approximation, moment generating functions, log-likelihood functions, and Hermite-Hadamard type inequalities.
△ Less
Submitted 7 August, 2021; v1 submitted 17 July, 2020;
originally announced July 2020.
-
Quality Selection in Two-Sided Markets: A Constrained Price Discrimination Approach
Authors:
Bar Light,
Ramesh Johari,
Gabriel Weintraub
Abstract:
Online platforms collect rich information about participants and then share some of this information back with them to improve market outcomes. In this paper we study the following information disclosure problem in two-sided markets: If a platform wants to maximize revenue, which sellers should the platform allow to participate, and how much of its available information about participating sellers…
▽ More
Online platforms collect rich information about participants and then share some of this information back with them to improve market outcomes. In this paper we study the following information disclosure problem in two-sided markets: If a platform wants to maximize revenue, which sellers should the platform allow to participate, and how much of its available information about participating sellers' quality should the platform share with buyers? We study this information disclosure problem in the context of two distinct two-sided market models: one in which the platform chooses prices and the sellers choose quantities (similar to ride-sharing), and one in which the sellers choose prices (similar to e-commerce). Our main results provide conditions under which simple information structures commonly observed in practice, such as banning certain sellers from the platform while not distinguishing between participating sellers, maximize the platform's revenue. The platform's information disclosure problem naturally transforms into a constrained price discrimination problem where the constraints are determined by the equilibrium outcomes of the specific two-sided market model being studied. We analyze this constrained price discrimination problem to obtain our structural results.
△ Less
Submitted 31 August, 2023; v1 submitted 4 December, 2019;
originally announced December 2019.
-
The Family of Alpha,[a,b] Stochastic Orders: Risk vs. Expected Value
Authors:
Bar Light,
Andres Perlroth
Abstract:
In this paper we provide a novel family of stochastic orders that generalizes second order stochastic dominance, which we call the $α,[a,b]$-concave stochastic orders.
These stochastic orders are generated by a novel set of "very" concave functions where $α$ parameterizes the degree of concavity. The $α,[a,b]$-concave stochastic orders allow us to derive novel comparative statics results for imp…
▽ More
In this paper we provide a novel family of stochastic orders that generalizes second order stochastic dominance, which we call the $α,[a,b]$-concave stochastic orders.
These stochastic orders are generated by a novel set of "very" concave functions where $α$ parameterizes the degree of concavity. The $α,[a,b]$-concave stochastic orders allow us to derive novel comparative statics results for important applications in economics that cannot be derived using previous stochastic orders. In particular, our comparative statics results are useful when an increase in a lottery's riskiness changes the agent's optimal action in the opposite direction to an increase in the lottery's expected value. For this kind of situation, we provide a tool to determine which of these two forces dominates -- riskiness or expected value. We apply our results in consumption-savings problems, self-protection problems, and in a Bayesian game.
△ Less
Submitted 27 April, 2021; v1 submitted 18 August, 2019;
originally announced August 2019.
-
General equilibrium in a heterogeneous-agent incomplete-market economy with many consumption goods and a risk-free bond
Authors:
Bar Light
Abstract:
We study a pure-exchange incomplete-market economy with heterogeneous agents. In each period, the agents choose how much to save (i.e., invest in a risk-free bond), how much to consume, and which bundle of goods to consume while their endowments are fluctuating. We focus on a competitive stationary equilibrium (CSE) in which the wealth distribution is invariant, the agents maximize their expected…
▽ More
We study a pure-exchange incomplete-market economy with heterogeneous agents. In each period, the agents choose how much to save (i.e., invest in a risk-free bond), how much to consume, and which bundle of goods to consume while their endowments are fluctuating. We focus on a competitive stationary equilibrium (CSE) in which the wealth distribution is invariant, the agents maximize their expected discounted utility, and both the prices of consumption goods and the interest rate are market-clearing. Our main contribution is to extend some general equilibrium results to an incomplete-market Bewley-type economy with many consumption goods. Under mild conditions on the agents' preferences, we show that the aggregate demand for goods depends only on their relative prices and that the aggregate demand for savings is homogeneous of degree in prices, and we prove the existence of a CSE. When the agents' preferences can be represented by a CES (constant elasticity of substitution) utility function with an elasticity of substitution that is higher than or equal to one, we prove that the CSE is unique. Under the same preferences, we show that a higher inequality of endowments does not change the equilibrium prices of goods, and decreases the equilibrium interest rate. Our results shed light on the impact of market incompleteness on the properties of general equilibrium models.
△ Less
Submitted 22 March, 2021; v1 submitted 16 June, 2019;
originally announced June 2019.
-
Stochastic Comparative Statics in Markov Decision Processes
Authors:
Bar Light
Abstract:
In multi-period stochastic optimization problems, the future optimal decision is a random variable whose distribution depends on the parameters of the optimization problem. We analyze how the expected value of this random variable changes as a function of the dynamic optimization parameters in the context of Markov decision processes. We call this analysis \emph{stochastic comparative statics}. We…
▽ More
In multi-period stochastic optimization problems, the future optimal decision is a random variable whose distribution depends on the parameters of the optimization problem. We analyze how the expected value of this random variable changes as a function of the dynamic optimization parameters in the context of Markov decision processes. We call this analysis \emph{stochastic comparative statics}. We derive both \emph{comparative statics} results and \emph{stochastic comparative statics} results showing how the current and future optimal decisions change in response to changes in the single-period payoff function, the discount factor, the initial state of the system, and the transition probability function. We apply our results to various models from the economics and operations research literature, including investment theory, dynamic pricing models, controlled random walks, and comparisons of stationary distributions.
△ Less
Submitted 25 January, 2020; v1 submitted 10 April, 2019;
originally announced April 2019.
-
Mean Field Equilibrium: Uniqueness, Existence, and Comparative Statics
Authors:
Bar Light,
Gabriel Weintraub
Abstract:
The standard solution concept for stochastic games is Markov perfect equilibrium (MPE); however, its computation becomes intractable as the number of players increases. Instead, we consider mean field equilibrium (MFE) that has been popularized in the recent literature. MFE takes advantage of averaging effects in models with a large number of players. We make three main contributions. First, our m…
▽ More
The standard solution concept for stochastic games is Markov perfect equilibrium (MPE); however, its computation becomes intractable as the number of players increases. Instead, we consider mean field equilibrium (MFE) that has been popularized in the recent literature. MFE takes advantage of averaging effects in models with a large number of players. We make three main contributions. First, our main result provides conditions that ensure the uniqueness of an MFE. We believe this uniqueness result is the first of its nature in the class of models we study. Second, we generalize previous MFE existence results. Third, we provide general comparative statics results. We apply our results to dynamic oligopoly models and to heterogeneous agent macroeconomic models commonly used in previous work in economics and operations.
△ Less
Submitted 4 June, 2020; v1 submitted 6 March, 2019;
originally announced March 2019.