-
Approximations of expectations under infinite product measures
Authors:
Galit Ashkenazi-Golan,
János Flesch,
Arkadi Predtetchinski,
Eilon Solan
Abstract:
We are given a bounded Borel-measurable real-valued function on a product of countably many Polish spaces, and a product probability measure. We are interested in points in the product space that can be used to approximate the expected value of this function. We define two notions. A point is called a weak $ε$-approximation, where $ε\geq 0$, if the Dirac measure on this point, except in finitely m…
▽ More
We are given a bounded Borel-measurable real-valued function on a product of countably many Polish spaces, and a product probability measure. We are interested in points in the product space that can be used to approximate the expected value of this function. We define two notions. A point is called a weak $ε$-approximation, where $ε\geq 0$, if the Dirac measure on this point, except in finitely many coordinates where another measure can be taken, gives an expected value that is $ε$-close to the original expected value. A point is called a strong $ε$-approximation if the same holds under the restriction that in those finitely many coordinates the measure is equal to the original one. We prove that both the set of weak 0-approximation points and the set of strong $ε$-approximation points, for any $ε>0$, have measure 1 under the original measure. Finally, we provide two applications: (i) in Game Theory on the minmax guarantee levels of the players in games with infinitely many players, and (ii) in Decision Theory on the set of feasible expected payoffs in infinite duration problems.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
Zero-one Laws for a Control Problem with Random Action Sets
Authors:
János Flesch,
Arkadi Predtetchinski,
William D Sudderth,
Xavier Venel
Abstract:
In many control problems there is only limited information about the actions that will be available at future stages. We introduce a framework where the Controller chooses actions $a_{0}, a_{1}, \ldots$, one at a time. Her goal is to maximize the probability that the infinite sequence $(a_{0}, a_{1}, \ldots)$ is an element of a given subset $G$ of $\mathbb{N}^{\mathbb{N}}$. The set $G$, called the…
▽ More
In many control problems there is only limited information about the actions that will be available at future stages. We introduce a framework where the Controller chooses actions $a_{0}, a_{1}, \ldots$, one at a time. Her goal is to maximize the probability that the infinite sequence $(a_{0}, a_{1}, \ldots)$ is an element of a given subset $G$ of $\mathbb{N}^{\mathbb{N}}$. The set $G$, called the goal, is assumed to be a Borel tail set. The Controller's choices are restricted: having taken a sequence $h_{t} = (a_{0}, \ldots, a_{t-1})$ of actions prior to stage $t \in \mathbb{N}$, she must choose an action $a_{t}$ at stage $t$ from a non-empty, finite subset $A(h_{t})$ of $\mathbb{N}$. The set $A(h_{t})$ is chosen from a distribution $p_{t}$, independently over all $t \in \mathbb{N}$ and all $h_{t} \in \mathbb{N}^{t}$. We consider several information structures defined by how far ahead into the future the Controller knows what actions will be available.
In the special case where all the action sets are singletons (and thus the Controller is a dummy), Kolmogorov's 0-1 law says that the probability for the goal to be reached is 0 or 1. We construct a number of counterexamples to show that in general the value of the control problem can be strictly between 0 and 1, and derive several sufficient conditions for the 0-1 ``law" to hold.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
Stochastic Games with General Payoff Functions
Authors:
János Flesch,
Eilon Solan
Abstract:
We consider multiplayer stochastic games in which the payoff of each player is a bounded and Borel-measurable function of the infinite play. By using a generalization of the technique of Martin (1998) and Maitra and Sudderth (1998), we show four different existence results. In each stochastic game, it holds for every $ε>0$ that (i) each player has a strategy that guarantees in each subgame that th…
▽ More
We consider multiplayer stochastic games in which the payoff of each player is a bounded and Borel-measurable function of the infinite play. By using a generalization of the technique of Martin (1998) and Maitra and Sudderth (1998), we show four different existence results. In each stochastic game, it holds for every $ε>0$ that (i) each player has a strategy that guarantees in each subgame that this player's payoff is at least her maxmin value up to $ε$, (ii) there exists a strategy profile under which in each subgame each player's payoff is at least her minmax value up to $ε$, (iii) the game admits an extensive-form correlated $ε$-equilibrium, and (iv) there exists a subgame that admits an $ε$-equilibrium.
△ Less
Submitted 25 August, 2022;
originally announced August 2022.
-
Absorbing Blackwell Games
Authors:
Galit Ashkenazi-Golan,
János Flesch,
Eilon Solan
Abstract:
It was shown in Flesch and Solan (2022) with a rather involved proof that all two-player stochastic games with finite state and action spaces and shift-invariant payoffs admit an $ε$-equilibrium, for every $ε>0$. Their proof also holds for two-player absorbing games with tail-measurable payoffs. In this paper we provide a simpler proof for the existence of $ε$-equilibrium in two-player absorbing g…
▽ More
It was shown in Flesch and Solan (2022) with a rather involved proof that all two-player stochastic games with finite state and action spaces and shift-invariant payoffs admit an $ε$-equilibrium, for every $ε>0$. Their proof also holds for two-player absorbing games with tail-measurable payoffs. In this paper we provide a simpler proof for the existence of $ε$-equilibrium in two-player absorbing games with tail-measurable payoffs, by combining recent mathematical tools for such payoff functions with classical tools for absorbing games.
△ Less
Submitted 24 August, 2022;
originally announced August 2022.
-
Repeated Games with Tail-Measurable Payoffs
Authors:
János Flesch,
Eilon Solan
Abstract:
We study multiplayer Blackwell games, which are repeated games where the payoff of each player is a bounded and Borel-measurable function of the infinite stream of actions played by the players during the game. These games are an extension of the two-player perfect-information games studied by David Gale and Frank Stewart (1953). Recently, various new ideas have been discovered to study Blackwell…
▽ More
We study multiplayer Blackwell games, which are repeated games where the payoff of each player is a bounded and Borel-measurable function of the infinite stream of actions played by the players during the game. These games are an extension of the two-player perfect-information games studied by David Gale and Frank Stewart (1953). Recently, various new ideas have been discovered to study Blackwell games. In this paper, we give an overview of these ideas by proving, in four different ways, that Blackwell games with a finite number of players, finite action sets, and tail-measurable payoffs admit an $\varepsilon$-equilibrium, for all $\varepsilon>0$.
△ Less
Submitted 24 May, 2022;
originally announced May 2022.
-
Equilibrium in Two-Player Stochastic Games with Shift-Invariant Payoffs
Authors:
János Flesch,
Eilon Solan
Abstract:
We show that every two-player stochastic game with finite state and action sets and bounded, Borel-measurable, and shift-invariant payoffs, admits an $\ep$-equilibrium for all $\varepsilon>0$.
We show that every two-player stochastic game with finite state and action sets and bounded, Borel-measurable, and shift-invariant payoffs, admits an $\ep$-equilibrium for all $\varepsilon>0$.
△ Less
Submitted 28 March, 2022;
originally announced March 2022.
-
Regularity of the minmax value and equilibria in multiplayer Blackwell games
Authors:
Galit Ashkenazi-Golan,
János Flesch,
Arkadi Predtetchinski,
Eilon Solan
Abstract:
A real-valued function $\varphi$ that is defined over all Borel sets of a topological space is \emph{regular} if for every Borel set $W$, $\varphi(W)$ is the supremum of $\varphi(C)$, over all closed sets $C$ that are contained in $W$, and the infimum of $\varphi(O)$, over all open sets $O$ that contain $W$.
We study Blackwell games with finitely many players. We show that when each player has a…
▽ More
A real-valued function $\varphi$ that is defined over all Borel sets of a topological space is \emph{regular} if for every Borel set $W$, $\varphi(W)$ is the supremum of $\varphi(C)$, over all closed sets $C$ that are contained in $W$, and the infimum of $\varphi(O)$, over all open sets $O$ that contain $W$.
We study Blackwell games with finitely many players. We show that when each player has a countable set of actions and the objective of a certain player is represented by a Borel winning set, that player's minmax value is regular.
We then use the regularity of the minmax value to establish the existence of $\varepsilon$-equilibria in two distinct classes of Blackwell games. One is the class of $n$-player Blackwell games where each player has a finite action space and an analytic winning set, and the sum of the minmax values over the players exceeds $n-1$. The other class is that of Blackwell games with bounded upper semi-analytic payoff functions, history-independent finite action spaces, and history-independent minmax values.
For the latter class, we obtain a characterization of the set of equilibrium payoffs.
△ Less
Submitted 11 January, 2022;
originally announced January 2022.
-
Equilibria in Repeated Games with Countably Many Players and Tail-Measurable Payoffs
Authors:
Galit Ashkenazi-Golan,
Janos Flesch,
Arkadi Predtetchinski,
Eilon Solan
Abstract:
We prove that every repeated game with countably many players, finite action sets, and tail-measurable payoffs admits an $ε$-equilibrium, for every $ε> 0$.
We prove that every repeated game with countably many players, finite action sets, and tail-measurable payoffs admits an $ε$-equilibrium, for every $ε> 0$.
△ Less
Submitted 7 June, 2021;
originally announced June 2021.
-
Random perfect information games
Authors:
János Flesch,
Arkadi Predtetchinski,
Ville Suomala
Abstract:
The paper proposes a natural measure space of zero-sum perfect information games with upper semicontinuous payoffs. Each game is specified by the game tree, and by the assignment of the active player and of the capacity to each node of the tree. The payoff in a game is defined as the infimum of the capacity over the nodes that have been visited during the play. The active player, the number of chi…
▽ More
The paper proposes a natural measure space of zero-sum perfect information games with upper semicontinuous payoffs. Each game is specified by the game tree, and by the assignment of the active player and of the capacity to each node of the tree. The payoff in a game is defined as the infimum of the capacity over the nodes that have been visited during the play. The active player, the number of children, and the capacity are drawn from a given joint distribution independently across the nodes. We characterize the cumulative distribution function of the value $v$ using the fixed points of the so-called value generating function. The characterization leads to a necessary and sufficient condition for the event $v \geq k$ to occur with positive probability. We also study probabilistic properties of the set of Player I's $k$-optimal strategies and the corresponding plays.
△ Less
Submitted 21 April, 2021;
originally announced April 2021.
-
Games characterizing limsup functions and Baire class 1 functions
Authors:
Márton Elekes,
János Flesch,
Viktor Kiss,
Donát Nagy,
Márk Poór,
Arkadi Predtetchinski
Abstract:
We consider a real-valued function $f$ defined on the set of infinite branches $X$ of a countably branching pruned tree $T$. The function $f$ is said to be a \textit{limsup function} if there is a function $u \colon T \to \mathbb{R}$ such that $f(x) = \limsup_{t \to \infty} u(x_{0},\dots,x_{t})$ for each $x \in X$. We study a game characterization of limsup functions, as well as a novel game chara…
▽ More
We consider a real-valued function $f$ defined on the set of infinite branches $X$ of a countably branching pruned tree $T$. The function $f$ is said to be a \textit{limsup function} if there is a function $u \colon T \to \mathbb{R}$ such that $f(x) = \limsup_{t \to \infty} u(x_{0},\dots,x_{t})$ for each $x \in X$. We study a game characterization of limsup functions, as well as a novel game characterization of functions of Baire class 1.
△ Less
Submitted 7 October, 2020;
originally announced October 2020.
-
A competitive search game with a moving target
Authors:
Benoit Duvocelle,
János Flesch,
Mathias Staudigl,
Dries Vermeulen
Abstract:
We introduce a discrete-time search game, in which two players compete to find an object first. The object moves according to a time-varying Markov chain on finitely many states. The players know the Markov chain and the initial probability distribution of the object, but do not observe the current state of the object. The players are active in turns. The active player chooses a state, and this ch…
▽ More
We introduce a discrete-time search game, in which two players compete to find an object first. The object moves according to a time-varying Markov chain on finitely many states. The players know the Markov chain and the initial probability distribution of the object, but do not observe the current state of the object. The players are active in turns. The active player chooses a state, and this choice is observed by the other player. If the object is in the chosen state, this player wins and the game ends. Otherwise, the object moves according to the Markov chain and the game continues at the next period.
We show that this game admits a value, and for any error-term $\veps>0$, each player has a pure (subgame-perfect) $\veps$-optimal strategy. Interestingly, a 0-optimal strategy does not always exist. The $\veps$-optimal strategies are robust in the sense that they are $2\veps$-optimal on all finite but sufficiently long horizons, and also $2\veps$-optimal in the discounted version of the game provided that the discount factor is close to 1. We derive results on the analytic and structural properties of the value and the $\veps$-optimal strategies. Moreover, we examine the performance of the finite truncation strategies, which are easy to calculate and to implement. We devote special attention to the important time-homogeneous case, where additional results hold.
△ Less
Submitted 27 August, 2020;
originally announced August 2020.
-
Search for a moving target in a competitive environment
Authors:
Benoit Duvocelle,
János Flesch,
Hui Min Shi,
Dries Vermeulen
Abstract:
We consider a discrete-time dynamic search game in which a number of players compete to find an invisible object that is moving according to a time-varying Markov chain. We examine the subgame perfect equilibria of these games. The main result of the paper is that the set of subgame perfect equilibria is exactly the set of greedy strategy profiles, i.e. those strategy profiles in which the players…
▽ More
We consider a discrete-time dynamic search game in which a number of players compete to find an invisible object that is moving according to a time-varying Markov chain. We examine the subgame perfect equilibria of these games. The main result of the paper is that the set of subgame perfect equilibria is exactly the set of greedy strategy profiles, i.e. those strategy profiles in which the players always choose an action that maximizes their probability of immediately finding the object. We discuss various variations and extensions of the model.
△ Less
Submitted 25 August, 2020; v1 submitted 21 August, 2020;
originally announced August 2020.
-
Reachability and safety objectives in Markov decision processes on long but finite horizons
Authors:
Galit Ashkenazi-Golan,
János Flesch,
Arkadi Predtetchinski,
Eilon Solan
Abstract:
We consider discrete-time Markov decision processes in which the decision maker is interested in long but finite horizons. First we consider reachability objective: the decision maker's goal is to reach a specific target state with the highest possible probability. Formally, strategy $σ$ overtakes another strategy $σ'$, if the probability of reaching the target state within horizon $t$ is larger u…
▽ More
We consider discrete-time Markov decision processes in which the decision maker is interested in long but finite horizons. First we consider reachability objective: the decision maker's goal is to reach a specific target state with the highest possible probability. Formally, strategy $σ$ overtakes another strategy $σ'$, if the probability of reaching the target state within horizon $t$ is larger under $σ$ than under $σ'$, for all sufficiently large $t\in\NN$. We prove that there exists a pure stationary strategy that is not overtaken by any pure strategy nor by any stationary strategy, under some condition on the transition structure and respectively under genericity. A strategy that is not overtaken by any other strategy, called an overtaking optimal strategy, does not always exist. We provide sufficient conditions for its existence.
Next we consider safety objective: the decision maker's goal is to avoid a specific state with the highest possible probability. We argue that the results proven for reachability objective extend to this model. We finally discuss extensions of our results to two-player zero-sum perfect information games.
△ Less
Submitted 13 November, 2019;
originally announced November 2019.
-
The doubling metric and doubling measures
Authors:
János Flesch,
Arkadi Predtetchinski,
Ville Suomala
Abstract:
We introduce the so--called doubling metric on the collection of non--empty bounded open subsets of a metric space. Given a subset $U$ of a metric space $X$, the predecessor $U_{*}$ of $U$ is defined by doubling the radii of all open balls contained inside $U$, and taking their union. If $U$ is open, the predecessor of $U$ is an open set containing $U$. The directed doubling distance between $U$ a…
▽ More
We introduce the so--called doubling metric on the collection of non--empty bounded open subsets of a metric space. Given a subset $U$ of a metric space $X$, the predecessor $U_{*}$ of $U$ is defined by doubling the radii of all open balls contained inside $U$, and taking their union. If $U$ is open, the predecessor of $U$ is an open set containing $U$. The directed doubling distance between $U$ and another subset $V$ is the number of times that the predecessor operation needs to be applied to $U$ to obtain a set that contains $V$. Finally, the doubling distance between $U$ and $V$ is the maximum of the directed distance between $U$ and $V$ and the directed distance between $V$ and $U$.
△ Less
Submitted 2 March, 2020; v1 submitted 20 August, 2019;
originally announced August 2019.