-
Thresholds for sensitive optimality and Blackwell optimality in stochastic games
Authors:
Stéphane Gaubert,
Julien Grand-Clément,
Ricardo D. Katz
Abstract:
We investigate refinements of the mean-payoff criterion in two-player zero-sum perfect-information stochastic games. A strategy is Blackwell optimal if it is optimal in the discounted game for all discount factors sufficiently close to $1$. The notion of $d$-sensitive optimality interpolates between mean-payoff optimality (corresponding to the case $d=-1$) and Blackwell optimality ($d=+\infty$). T…
▽ More
We investigate refinements of the mean-payoff criterion in two-player zero-sum perfect-information stochastic games. A strategy is Blackwell optimal if it is optimal in the discounted game for all discount factors sufficiently close to $1$. The notion of $d$-sensitive optimality interpolates between mean-payoff optimality (corresponding to the case $d=-1$) and Blackwell optimality ($d=+\infty$). The Blackwell threshold $α_{\sf Bw} \in [0,1[$ is the discount factor above which all optimal strategies in the discounted game are guaranteed to be Blackwell optimal. The $d$-sensitive threshold $α_{\sf d} \in [0,1[$ is defined analogously. Bounding $α_{\sf Bw}$ and $α_{\sf d}$ are fundamental problems in algorithmic game theory, since these thresholds control the complexity for computing Blackwell and $d$-sensitive optimal strategies, by reduction to discounted games which can be solved in $O\left((1-α)^{-1}\right)$ iterations. We provide the first bounds on the $d$-sensitive threshold $α_{\sf d}$ beyond the case $d=-1$, and we establish improved bounds for the Blackwell threshold $α_{\sf Bw}$. This is achieved by leveraging separation bounds on algebraic numbers, relying on Lagrange bounds and more advanced techniques based on Mahler measures and multiplicity theorems.
△ Less
Submitted 23 June, 2025;
originally announced June 2025.
-
Directed Metric Structures arising in Large Language Models
Authors:
Stéphane Gaubert,
Yiannis Vlassopoulos
Abstract:
Large Language Models are transformer neural networks which are trained to produce a probability distribution on the possible next words to given texts in a corpus, in such a way that the most likely word predicted is the actual word in the training text. In this paper we find what is the mathematical structure defined by such conditional probability distributions of text extensions. Changing the…
▽ More
Large Language Models are transformer neural networks which are trained to produce a probability distribution on the possible next words to given texts in a corpus, in such a way that the most likely word predicted is the actual word in the training text. In this paper we find what is the mathematical structure defined by such conditional probability distributions of text extensions. Changing the view point from probabilities to -log probabilities we observe that the subtext order is completely encoded in a metric structure defined on the space of texts $\mathcal{L}$, by -log probabilities. We then construct a metric polyhedron $P(\mathcal{L})$ and an isometric embedding (called Yoneda embedding) of $\mathcal{L}$ into $P(\mathcal{L})$ such that texts map to generators of certain special extremal rays. We explain that $P(\mathcal{L})$ is a $(\min,+)$ (tropical) linear span of these extremal ray generators. The generators also satisfy a system of $(\min+)$ linear equations. We then show that $P(\mathcal{L})$ is compatible with adding more text and from this we derive an approximation of a text vector as a Boltzmann weighted linear combination of the vectors for words in that text. We then prove a duality theorem showing that texts extensions and text restrictions give isometric polyhedra (even though they look a priory very different). Moreover we prove that $P(\mathcal{L})$ is the lattice closure of (a version of) the so called, Isbell completion of $\mathcal{L}$ which turns out to be the $(\max,+)$ span of the text extremal ray generators. All constructions have interpretations in category theory but we don't use category theory explicitly. The categorical interpretations are briefly explained in an appendix. In the final appendix we describe how the syntax to semantics problem could fit in a general well known mathematical duality.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
Solving irreducible stochastic mean-payoff games and entropy games by relative Krasnoselskii-Mann iteration
Authors:
Marianne Akian,
Stéphane Gaubert,
Ulysse Naepels,
Basile Terver
Abstract:
We analyse an algorithm solving stochastic mean-payoff games, combining the ideas of relative value iteration and of Krasnoselskii-Mann damping. We derive parameterized complexity bounds for several classes of games satisfying irreducibility conditions. We show in particular that an $ε$-approximation of the value of an irreducible concurrent stochastic game can be computed in a number of iteration…
▽ More
We analyse an algorithm solving stochastic mean-payoff games, combining the ideas of relative value iteration and of Krasnoselskii-Mann damping. We derive parameterized complexity bounds for several classes of games satisfying irreducibility conditions. We show in particular that an $ε$-approximation of the value of an irreducible concurrent stochastic game can be computed in a number of iterations in $O(|\logε|)$ where the constant in the $O(\cdot)$ is explicit, depending on the smallest non-zero transition probabilities. This should be compared with a bound in $O(|ε|^{-1}|\log(ε)|)$ obtained by Chatterjee and Ibsen-Jensen (ICALP 2014) for the same class of games, and to a $O(|ε|^{-1})$ bound by Allamigeon, Gaubert, Katz and Skomra (ICALP 2022) for turn-based games. We also establish parameterized complexity bounds for entropy games, a class of matrix multiplication games introduced by Asarin, Cervelle, Degorre, Dima, Horn and Kozyakin. We derive these results by methods of variational analysis, establishing contraction properties of the relative Krasnoselskii-Mann iteration with respect to Hilbert's semi-norm.
△ Less
Submitted 3 May, 2023;
originally announced May 2023.
-
Complexity of Geometric programming in the Turing model and application to nonnegative tensors
Authors:
Shmuel Friedland,
Stéphane Gaubert
Abstract:
We consider a version of geometric programming problem consisting in minimizing a function given by the maximum of finitely many log-Laplace transforms of discrete nonnegative measures on a Euclidean space. Under a coerciveness assumption, we show that an $\varepsilon$-minimizer can be computed in a time that is polynomial in the input size and in $|\log\varepsilon|$. This is obtained by establish…
▽ More
We consider a version of geometric programming problem consisting in minimizing a function given by the maximum of finitely many log-Laplace transforms of discrete nonnegative measures on a Euclidean space. Under a coerciveness assumption, we show that an $\varepsilon$-minimizer can be computed in a time that is polynomial in the input size and in $|\log\varepsilon|$. This is obtained by establishing bit-size estimates on approximate minimizers and by applying the ellipsoid method. We also derive polynomial iteration complexity bounds for the interior-point method applied to the same class of problems. We deduce that the spectral radius of a partially symmetric, weakly irreducible nonnegative tensor can be approximated within an $\varepsilon$-error in polynomial time. For strongly irreducible tensors, we show in addition that the logarithm of the positive eigenvector is polynomial time approximable. Our results also yield that the the maximum of a nonnegative homogeneous $d$-form in the $\ell_d$ unit ball can be approximated in polynomial time. In particular, the spectral radius of uniform weighted hypergraphs and some known upper bounds for the clique number of uniform hypergraphs are polynomial time computable. In contrast, we provide an example showing that the Phase I approach needs exponentially many bits to solve the feasibility problem in geometric programming.
△ Less
Submitted 3 June, 2025; v1 submitted 25 January, 2023;
originally announced January 2023.
-
Universal Complexity Bounds Based on Value Iteration for Stochastic Mean Payoff Games and Entropy Games
Authors:
Xavier Allamigeon,
Stéphane Gaubert,
Ricardo D. Katz,
Mateusz Skomra
Abstract:
We develop value iteration-based algorithms to solve in a unified manner different classes of combinatorial zero-sum games with mean-payoff type rewards. These algorithms rely on an oracle, evaluating the dynamic programming operator up to a given precision. We show that the number of calls to the oracle needed to determine exact optimal (positional) strategies is, up to a factor polynomial in the…
▽ More
We develop value iteration-based algorithms to solve in a unified manner different classes of combinatorial zero-sum games with mean-payoff type rewards. These algorithms rely on an oracle, evaluating the dynamic programming operator up to a given precision. We show that the number of calls to the oracle needed to determine exact optimal (positional) strategies is, up to a factor polynomial in the dimension, of order R/sep, where the "separation" sep is defined as the minimal difference between distinct values arising from strategies, and R is a metric estimate, involving the norm of approximate sub and super-eigenvectors of the dynamic programming operator. We illustrate this method by two applications. The first one is a new proof, leading to improved complexity estimates, of a theorem of Boros, Elbassioni, Gurvich and Makino, showing that turn-based mean-payoff games with a fixed number of random positions can be solved in pseudo-polynomial time. The second one concerns entropy games, a model introduced by Asarin, Cervelle, Degorre, Dima, Horn and Kozyakin. The rank of an entropy game is defined as the maximal rank among all the ambiguity matrices determined by strategies of the two players. We show that entropy games with a fixed rank, in their original formulation, can be solved in polynomial time, and that an extension of entropy games incorporating weights can be solved in pseudo-polynomial time under the same fixed rank condition.
△ Less
Submitted 11 November, 2024; v1 submitted 17 June, 2022;
originally announced June 2022.
-
Computing Transience Bounds of Emergency Call Centers: a Hierarchical Timed Petri Net Approach
Authors:
Xavier Allamigeon,
Marin Boyet,
Stephane Gaubert
Abstract:
A fundamental issue in the analysis of emergency call centers is to estimate the time needed to return to a congestion-free regime after an unusual event with a massive arrival of calls. Call centers can generally be represented by timed Petri nets with a hierarchical structure, in which several layers describe the successive steps of treatments of calls. We study a continuous approximation of the…
▽ More
A fundamental issue in the analysis of emergency call centers is to estimate the time needed to return to a congestion-free regime after an unusual event with a massive arrival of calls. Call centers can generally be represented by timed Petri nets with a hierarchical structure, in which several layers describe the successive steps of treatments of calls. We study a continuous approximation of the Petri net dynamics (with infinitesimal tokens). Then, we show that a counter function, measuring the deviation to the stationary regime, coincides with the value function of a semi-Markov decision problem. Then, we establish a finite time convergence result, exploiting the hierarchical structure of the Petri net. We obtain an explicit bound for the transience time, as a function of the initial marking and sojourn times. This is based on methods from the theory of stochastic shortest paths and non-linear Perron--Frobenius theory. We illustrate the bound on a case study of a medical emergency call center.
△ Less
Submitted 6 February, 2022;
originally announced February 2022.
-
No self-concordant barrier interior point method is strongly polynomial
Authors:
Xavier Allamigeon,
Stéphane Gaubert,
Nicolas Vandame
Abstract:
It is an open question to determine if the theory of self-concordant barriers can provide an interior point method with strongly polynomial complexity in linear programming. In the special case of the logarithmic barrier, it was shown in [Allamigeon, Benchimol, Gaubert and Joswig, SIAM J. on Applied Algebra and Geometry, 2018] that the answer is negative. In this paper, we show that none of the se…
▽ More
It is an open question to determine if the theory of self-concordant barriers can provide an interior point method with strongly polynomial complexity in linear programming. In the special case of the logarithmic barrier, it was shown in [Allamigeon, Benchimol, Gaubert and Joswig, SIAM J. on Applied Algebra and Geometry, 2018] that the answer is negative. In this paper, we show that none of the self-concordant barrier interior point methods is strongly polynomial. This result is obtained by establishing that, on parametric families of convex optimization problems, the log-limit of the central path degenerates to a piecewise linear curve, independently of the choice of the barrier function. We provide an explicit linear program that falls in the same class as the Klee-Minty counterexample, i.e., in dimension $n$ with $2n$ constraints, in which the number of iterations is $Ω(2^n)$.
△ Less
Submitted 6 January, 2022;
originally announced January 2022.
-
Tropical linear regression and mean payoff games: or, how to measure the distance to equilibria
Authors:
Marianne Akian,
Stéphane Gaubert,
Yang Qi,
Omar Saadi
Abstract:
We study a tropical linear regression problem consisting in finding the best approximation of a set of points by a tropical hyperplane. We establish a strong duality theorem, showing that the value of this problem coincides with the maximal radius of a Hilbert's ball included in a tropical polyhedron. We also show that this regression problem is polynomial-time equivalent to mean payoff games. We…
▽ More
We study a tropical linear regression problem consisting in finding the best approximation of a set of points by a tropical hyperplane. We establish a strong duality theorem, showing that the value of this problem coincides with the maximal radius of a Hilbert's ball included in a tropical polyhedron. We also show that this regression problem is polynomial-time equivalent to mean payoff games. We illustrate our results by solving an inverse problem from auction theory. In this setting, a tropical hyperplane represents the set of equilibrium prices. Tropical linear regression allows us to quantify the distance of a market to the set of equilibria, and infer secret preferences of a decision maker.
△ Less
Submitted 21 June, 2021; v1 submitted 3 June, 2021;
originally announced June 2021.
-
Understanding and monitoring the evolution of the Covid-19 epidemic from medical emergency calls: the example of the Paris area
Authors:
Stéphane Gaubert,
Marianne Akian,
Xavier Allamigeon,
Marin Boyet,
Baptiste Colin,
Théotime Grohens,
Laurent Massoulié,
David P. Parsons,
Frédéric Adnet,
Érick Chanzy,
Laurent Goix,
Frédéric Lapostolle,
Éric Lecarpentier,
Christophe Leroy,
Thomas Loeb,
Jean-Sébastien Marx,
Caroline Télion,
Laurent Tréluyer,
Pierre Carli
Abstract:
We portray the evolution of the Covid-19 epidemic during the crisis of March-April 2020 in the Paris area, by analyzing the medical emergency calls received by the EMS of the four central departments of this area (Centre 15 of SAMU 75, 92, 93 and 94). Our study reveals strong dissimilarities between these departments. We show that the logarithm of each epidemic observable can be approximated by a…
▽ More
We portray the evolution of the Covid-19 epidemic during the crisis of March-April 2020 in the Paris area, by analyzing the medical emergency calls received by the EMS of the four central departments of this area (Centre 15 of SAMU 75, 92, 93 and 94). Our study reveals strong dissimilarities between these departments. We show that the logarithm of each epidemic observable can be approximated by a piecewise linear function of time. This allows us to distinguish the different phases of the epidemic, and to identify the delay between sanitary measures and their influence on the load of EMS. This also leads to an algorithm, allowing one to detect epidemic resurgences. We rely on a transport PDE epidemiological model, and we use methods from Perron-Frobenius theory and tropical geometry.
△ Less
Submitted 20 July, 2020; v1 submitted 28 May, 2020;
originally announced May 2020.
-
A Privacy-preserving Method to Optimize Distributed Resource Allocation
Authors:
Olivier Beaude,
Pascal Benchimol,
Stéphane Gaubert,
Paulin Jacquot,
Nadia Oudjane
Abstract:
We consider a resource allocation problem involving a large number of agents with individual constraints subject to privacy, and a central operator whose objective is to optimize a global, possibly nonconvex, cost while satisfying the agents' constraints, for instance an energy operator in charge of the management of energy consumption flexibilities of many individual consumers. We provide a priva…
▽ More
We consider a resource allocation problem involving a large number of agents with individual constraints subject to privacy, and a central operator whose objective is to optimize a global, possibly nonconvex, cost while satisfying the agents' constraints, for instance an energy operator in charge of the management of energy consumption flexibilities of many individual consumers. We provide a privacy-preserving algorithm that does compute the optimal allocation of resources, avoiding each agent to reveal her private information (constraints and individual solution profile) neither to the central operator nor to a third party. Our method relies on an aggregation procedure: we compute iteratively a global allocation of resources, and gradually ensure existence of a disaggregation, that is individual profiles satisfying agents' private constraints, by a protocol involving the generation of polyhedral cuts and secure multiparty computations (SMC). To obtain these cuts, we use an alternate projection method, which is implemented locally by each agent, preserving her privacy needs. We adress especially the case in which the local and global constraints define a transportation polytope. Then, we provide theoretical convergence estimates together with numerical results, showing that the algorithm can be effectively used to solve the allocation problem in high dimension, while addressing privacy issues.
△ Less
Submitted 22 June, 2020; v1 submitted 7 August, 2019;
originally announced August 2019.
-
A Universal Approximation Result for Difference of log-sum-exp Neural Networks
Authors:
Giuseppe C. Calafiore,
Stephane Gaubert,
Member,
Corrado Possieri
Abstract:
We show that a neural network whose output is obtained as the difference of the outputs of two feedforward networks with exponential activation function in the hidden layer and logarithmic activation function in the output node (LSE networks) is a smooth universal approximator of continuous functions over convex, compact sets. By using a logarithmic transform, this class of networks maps to a fami…
▽ More
We show that a neural network whose output is obtained as the difference of the outputs of two feedforward networks with exponential activation function in the hidden layer and logarithmic activation function in the output node (LSE networks) is a smooth universal approximator of continuous functions over convex, compact sets. By using a logarithmic transform, this class of networks maps to a family of subtraction-free ratios of generalized posynomials, which we also show to be universal approximators of positive functions over log-convex, compact subsets of the positive orthant. The main advantage of Difference-LSE networks with respect to classical feedforward neural networks is that, after a standard training phase, they provide surrogate models for design that possess a specific difference-of-convex-functions form, which makes them optimizable via relatively efficient numerical methods. In particular, by adapting an existing difference-of-convex algorithm to these models, we obtain an algorithm for performing effective optimization-based design. We illustrate the proposed approach by applying it to data-driven design of a diet for a patient with type-2 diabetes.
△ Less
Submitted 21 May, 2019;
originally announced May 2019.
-
The operator approach to entropy games
Authors:
Marianne Akian,
Stéphane Gaubert,
Julien Grand-Clément,
Jérémie Guillaud
Abstract:
Entropy games and matrix multiplication games have been recently introduced by Asarin et al. They model the situation in which one player (Despot) wishes to minimize the growth rate of a matrix product, whereas the other player (Tribune) wishes to maximize it. We develop an operator approach to entropy games. This allows us to show that entropy games can be cast as stochastic mean payoff games in…
▽ More
Entropy games and matrix multiplication games have been recently introduced by Asarin et al. They model the situation in which one player (Despot) wishes to minimize the growth rate of a matrix product, whereas the other player (Tribune) wishes to maximize it. We develop an operator approach to entropy games. This allows us to show that entropy games can be cast as stochastic mean payoff games in which some action spaces are simplices and payments are given by a relative entropy (Kullback-Leibler divergence). In this way, we show that entropy games with a fixed number of states belonging to Despot can be solved in polynomial time. This approach also allows us to solve these games by a policy iteration algorithm, which we compare with the spectral simplex algorithm developed by Protasov.
△ Less
Submitted 10 April, 2019;
originally announced April 2019.
-
A Privacy-preserving Disaggregation Algorithm for Non-intrusive Management of Flexible Energy
Authors:
Paulin Jacquot,
Olivier Beaude,
Pascal Benchimol,
Stéphane Gaubert,
Nadia Oudjane
Abstract:
We consider a resource allocation problem involving a large number of agents with individual constraints subject to privacy, and a central operator whose objective is to optimizing a global, possibly non-convex, cost while satisfying the agents'c onstraints. We focus on the practical case of the management of energy consumption flexibilities by the operator of a microgrid. This paper provides a pr…
▽ More
We consider a resource allocation problem involving a large number of agents with individual constraints subject to privacy, and a central operator whose objective is to optimizing a global, possibly non-convex, cost while satisfying the agents'c onstraints. We focus on the practical case of the management of energy consumption flexibilities by the operator of a microgrid. This paper provides a privacy-preserving algorithm that does compute the optimal allocation of resources, avoiding each agent to reveal her private information (constraints and individual solution profile) neither to the central operator nor to a third party. Our method relies on an aggregation procedure: we maintain a global allocation of resources, and gradually disaggregate this allocation to enforce the satisfaction of private contraints, by a protocol involving the generation of polyhedral cuts and secure multiparty computations (SMC). To obtain these cuts, we use an alternate projections method à la Von Neumann, which is implemented locally by each agent, preserving her privacy needs. Our theoretical and numerical results show that the method scales well as the number of agents gets large, and thus can be used to solve the allocation problem in high dimension, while addressing privacy issues.
△ Less
Submitted 7 March, 2019;
originally announced March 2019.
-
Log-sum-exp neural networks and posynomial models for convex and log-log-convex data
Authors:
Giuseppe C. Calafiore,
Stephane Gaubert,
Corrado Possieri
Abstract:
We show in this paper that a one-layer feedforward neural network with exponential activation functions in the inner layer and logarithmic activation in the output neuron is an universal approximator of convex functions. Such a network represents a family of scaled log-sum exponential functions, here named LSET. Under a suitable exponential transformation, the class of LSET functions maps to a fam…
▽ More
We show in this paper that a one-layer feedforward neural network with exponential activation functions in the inner layer and logarithmic activation in the output neuron is an universal approximator of convex functions. Such a network represents a family of scaled log-sum exponential functions, here named LSET. Under a suitable exponential transformation, the class of LSET functions maps to a family of generalized posynomials GPOST, which we similarly show to be universal approximators for log-log-convex functions. A key feature of an LSET network is that, once it is trained on data, the resulting model is convex in the variables, which makes it readily amenable to efficient design based on convex optimization. Similarly, once a GPOST model is trained on data, it yields a posynomial model that can be efficiently optimized with respect to its variables by using geometric programming (GP). The proposed methodology is illustrated by two numerical examples, in which, first, models are constructed from simulation data of the two physical processes (namely, the level of vibration in a vehicle suspension system, and the peak power generated by the combustion of propane), and then optimization-based design is performed on these models.
△ Less
Submitted 8 December, 2018; v1 submitted 20 June, 2018;
originally announced June 2018.
-
Condition numbers of stochastic mean payoff games and what they say about nonarchimedean semidefinite programming
Authors:
Xavier Allamigeon,
Stéphane Gaubert,
Ricardo D. Katz,
Mateusz Skomra
Abstract:
Semidefinite programming can be considered over any real closed field, including fields of Puiseux series equipped with their nonarchimedean valuation. Nonarchimedean semidefinite programs encode parametric families of classical semidefinite programs, for sufficiently large values of the parameter. Recently, a correspondence has been established between nonarchimedean semidefinite programs and sto…
▽ More
Semidefinite programming can be considered over any real closed field, including fields of Puiseux series equipped with their nonarchimedean valuation. Nonarchimedean semidefinite programs encode parametric families of classical semidefinite programs, for sufficiently large values of the parameter. Recently, a correspondence has been established between nonarchimedean semidefinite programs and stochastic mean payoff games with perfect information. This correspondence relies on tropical geometry. It allows one to solve generic nonarchimedean semidefinite feasibility problems, of large scale, by means of stochastic game algorithms. In this paper, we show that the mean payoff of these games can be interpreted as a condition number for the corresponding nonarchimedean feasibility problems. This number measures how close a feasible instance is from being infeasible, and vice versa. We show that it coincides with the maximal radius of a ball in Hilbert's projective metric, that is included in the feasible set. The geometric interpretation of the condition number relies in particular on a duality theorem for tropical semidefinite feasibility programs. Then, we bound the complexity of the feasibility problem in terms of the condition number. We finally give explicit bounds for this condition number, in terms of the characteristics of the stochastic game. As a consequence, we show that the simplest algorithm to decide whether a stochastic mean payoff game is winning, namely value iteration, has a pseudopolynomial complexity when the number of random positions is fixed.
△ Less
Submitted 21 February, 2018;
originally announced February 2018.
-
Analysis and Implementation of a Hourly Billing Mechanism for Demand Response Management
Authors:
Paulin Jacquot,
Olivier Beaude,
Stéphane Gaubert,
Nadia Oudjane
Abstract:
An important part of the Smart Grid literature on residential Demand Response deals with game-theoretic consumption models. Among those papers, the hourly billing model is of special interest as an intuitive and fair mechanism. We focus on this model and answer to several theoretical and practical questions. First, we prove the uniqueness of the consumption profile corresponding to the Nash equili…
▽ More
An important part of the Smart Grid literature on residential Demand Response deals with game-theoretic consumption models. Among those papers, the hourly billing model is of special interest as an intuitive and fair mechanism. We focus on this model and answer to several theoretical and practical questions. First, we prove the uniqueness of the consumption profile corresponding to the Nash equilibrium, and we analyze its efficiency by providing a bound on the Price of Anarchy. Next, we address the computational issue of the equilibrium profile by providing two algorithms: the cycling best response dynamics and a projected gradient descent method, and by giving an upper bound on their convergence rate to the equilibrium. Last, we simulate this demand response framework in a stochastic environment where the parameters depend on forecasts. We show numerically the relevance of an online demand response procedure, which reduces the impact of inaccurate forecasts.
△ Less
Submitted 22 December, 2017;
originally announced December 2017.
-
Demand Response in the Smart Grid: the Impact of Consumers Temporal Preferences
Authors:
Paulin Jacquot,
Olivier Beaude,
Nadia Oudjane,
Stephane Gaubert
Abstract:
In Demand Response programs, price incentives might not be sufficient to modify residential consumers load profile. Here, we consider that each consumer has a preferred profile and a discomfort cost when deviating from it. Consumers can value this discomfort at a varying level that we take as a parameter. This work analyses Demand Response as a game theoretic environment. We study the equilibria o…
▽ More
In Demand Response programs, price incentives might not be sufficient to modify residential consumers load profile. Here, we consider that each consumer has a preferred profile and a discomfort cost when deviating from it. Consumers can value this discomfort at a varying level that we take as a parameter. This work analyses Demand Response as a game theoretic environment. We study the equilibria of the game between consumers with preferences within two different dynamic pricing mechanisms, respectively the daily proportional mechanism introduced by Mohsenian-Rad et al, and an hourly proportional mechanism. We give new results about equilibria as functions of the preference level in the case of quadratic system costs and prove that, whatever the preference level, system costs are smaller with the hourly mechanism. We simulate the Demand Response environment using real consumption data from PecanStreet database. While the Price of Anarchy remains always close to one up to 0.1% with the hourly mechanism, it can be more than 10% bigger with the daily mechanism.
△ Less
Submitted 30 November, 2017;
originally announced November 2017.
-
Demand Side Management in the Smart Grid: an Efficiency and Fairness Tradeoff
Authors:
Paulin Jacquot,
Olivier Beaude,
Stéphane Gaubert,
Nadia Oudjane
Abstract:
We compare two Demand Side Management (DSM) mechanisms, introduced respectively by Mohsenian-Rad et al (2010) and Baharlouei et al (2012), in terms of efficiency and fairness. Each mechanism defines a game where the consumers optimize their flexible consumption to reduce their electricity bills. Mohsenian-Rad et al propose a daily mechanism for which they prove the social optimality. Baharlouei et…
▽ More
We compare two Demand Side Management (DSM) mechanisms, introduced respectively by Mohsenian-Rad et al (2010) and Baharlouei et al (2012), in terms of efficiency and fairness. Each mechanism defines a game where the consumers optimize their flexible consumption to reduce their electricity bills. Mohsenian-Rad et al propose a daily mechanism for which they prove the social optimality. Baharlouei et al propose a hourly billing mechanism for which we give theoretical results: we prove the uniqueness of an equilibrium in the associated game and give an upper bound on its price of anarchy. We evaluate numerically the two mechanisms, using real consumption data from Pecan Street Inc. The simulations show that the equilibrium reached with the hourly mechanism is socially optimal up to 0.1%, and that it achieves an important fairness property according to a quantitative indicator we define. We observe that the two DSM mechanisms avoid the synchronization effect induced by non- game theoretic mechanisms, e.g. Peak/OffPeak hours contracts.
△ Less
Submitted 29 November, 2017;
originally announced November 2017.
-
Approximating the Volume of Tropical Polytopes is Difficult
Authors:
Stephane Gaubert,
Marie MacCaig
Abstract:
We investigate the complexity of counting the number of integer points in tropical polytopes, and the complexity of calculating their volume. We study the tropical analogue of the outer parallel body and establish bounds for its volume. We deduce that there is no approximation algorithm of factor $α=2^{\text{poly}(m,n)}$ for the volume of a tropical polytope given by $n$ vertices in a space of dim…
▽ More
We investigate the complexity of counting the number of integer points in tropical polytopes, and the complexity of calculating their volume. We study the tropical analogue of the outer parallel body and establish bounds for its volume. We deduce that there is no approximation algorithm of factor $α=2^{\text{poly}(m,n)}$ for the volume of a tropical polytope given by $n$ vertices in a space of dimension $m$, unless P$=$NP. Neither is there such an approximation algorithm for counting the number of integer points in tropical polytopes described by vertices. If follows that approximating these values for tropical polytopes is more difficult than for classical polytopes. Our proofs use a reduction from the problem of calculating the tropical rank. For tropical polytopes described by inequalities we prove that counting the number of integer points and calculating the volume are $\#$P-hard.
△ Less
Submitted 20 June, 2017;
originally announced June 2017.
-
The tropical shadow-vertex algorithm solves mean payoff games in polynomial time on average
Authors:
Xavier Allamigeon,
Pascal Benchimol,
Stéphane Gaubert
Abstract:
We introduce an algorithm which solves mean payoff games in polynomial time on average, assuming the distribution of the games satisfies a flip invariance property on the set of actions associated with every state. The algorithm is a tropical analogue of the shadow-vertex simplex algorithm, which solves mean payoff games via linear feasibility problems over the tropical semiring…
▽ More
We introduce an algorithm which solves mean payoff games in polynomial time on average, assuming the distribution of the games satisfies a flip invariance property on the set of actions associated with every state. The algorithm is a tropical analogue of the shadow-vertex simplex algorithm, which solves mean payoff games via linear feasibility problems over the tropical semiring $(\mathbb{R} \cup \{-\infty\}, \max, +)$. The key ingredient in our approach is that the shadow-vertex pivoting rule can be transferred to tropical polyhedra, and that its computation reduces to optimal assignment problems through Plücker relations.
△ Less
Submitted 11 September, 2014; v1 submitted 20 June, 2014;
originally announced June 2014.
-
Formal Proofs for Nonlinear Optimization
Authors:
Victor Magron,
Xavier Allamigeon,
Stéphane Gaubert,
Benjamin Werner
Abstract:
We present a formally verified global optimization framework. Given a semialgebraic or transcendental function $f$ and a compact semialgebraic domain $K$, we use the nonlinear maxplus template approximation algorithm to provide a certified lower bound of $f$ over $K$. This method allows to bound in a modular way some of the constituents of $f$ by suprema of quadratic forms with a well chosen curva…
▽ More
We present a formally verified global optimization framework. Given a semialgebraic or transcendental function $f$ and a compact semialgebraic domain $K$, we use the nonlinear maxplus template approximation algorithm to provide a certified lower bound of $f$ over $K$. This method allows to bound in a modular way some of the constituents of $f$ by suprema of quadratic forms with a well chosen curvature. Thus, we reduce the initial goal to a hierarchy of semialgebraic optimization problems, solved by sums of squares relaxations. Our implementation tool interleaves semialgebraic approximations with sums of squares witnesses to form certificates. It is interfaced with Coq and thus benefits from the trusted arithmetic available inside the proof assistant. This feature is used to produce, from the certificates, both valid underestimators and lower bounds for each approximated constituent. The application range for such a tool is widespread; for instance Hales' proof of Kepler's conjecture yields thousands of multivariate transcendental inequalities. We illustrate the performance of our formal framework on some of these inequalities as well as on examples from the global optimization literature.
△ Less
Submitted 5 January, 2015; v1 submitted 29 April, 2014;
originally announced April 2014.
-
Checking the strict positivity of Kraus maps is NP-hard
Authors:
Stephane Gaubert,
Zheng Qu
Abstract:
Basic properties in Perron-Frobenius theory are strict positivity, primitivityand irreducibility. Whereas for nonnegative matrices, these properties are equivalent to elementary graph properties which can be checked in polynomial time, we show that for Kraus maps- the noncommutative generalization of stochastic matrices - checking strict positivity (whether the map sends the cone to its interior)…
▽ More
Basic properties in Perron-Frobenius theory are strict positivity, primitivityand irreducibility. Whereas for nonnegative matrices, these properties are equivalent to elementary graph properties which can be checked in polynomial time, we show that for Kraus maps- the noncommutative generalization of stochastic matrices - checking strict positivity (whether the map sends the cone to its interior) is NP-hard. The proof proceeds by reducing to the latter problem the existence of a non-zero solution of a special system of bilinear equations. The complexity of irreducibility and primitivity is also discussed in the noncommutative setting.
△ Less
Submitted 6 February, 2014;
originally announced February 2014.
-
Tropical Fourier-Motzkin elimination, with an application to real-time verification
Authors:
Xavier Allamigeon,
Uli Fahrenberg,
Stéphane Gaubert,
Ricardo D. Katz,
Axel Legay
Abstract:
We introduce a generalization of tropical polyhedra able to express both strict and non-strict inequalities. Such inequalities are handled by means of a semiring of germs (encoding infinitesimal perturbations). We develop a tropical analogue of Fourier-Motzkin elimination from which we derive geometrical properties of these polyhedra. In particular, we show that they coincide with the tropically c…
▽ More
We introduce a generalization of tropical polyhedra able to express both strict and non-strict inequalities. Such inequalities are handled by means of a semiring of germs (encoding infinitesimal perturbations). We develop a tropical analogue of Fourier-Motzkin elimination from which we derive geometrical properties of these polyhedra. In particular, we show that they coincide with the tropically convex union of (non-necessarily closed) cells that are convex both classically and tropically. We also prove that the redundant inequalities produced when performing successive elimination steps can be dynamically deleted by reduction to mean payoff game problems. As a complement, we provide a coarser (polynomial time) deletion procedure which is enough to arrive at a simply exponential bound for the total execution time. These algorithms are illustrated by an application to real-time systems (reachability analysis of timed automata).
△ Less
Submitted 25 June, 2014; v1 submitted 9 August, 2013;
originally announced August 2013.
-
Certification of Bounds of Non-linear Functions: the Templates Method
Authors:
Xavier Allamigeon,
Stéphane Gaubert,
Victor Magron,
Benjamin Werner
Abstract:
The aim of this work is to certify lower bounds for real-valued multivariate functions, defined by semialgebraic or transcendental expressions. The certificate must be, eventually, formally provable in a proof system such as Coq. The application range for such a tool is widespread; for instance Hales' proof of Kepler's conjecture yields thousands of inequalities. We introduce an approximation algo…
▽ More
The aim of this work is to certify lower bounds for real-valued multivariate functions, defined by semialgebraic or transcendental expressions. The certificate must be, eventually, formally provable in a proof system such as Coq. The application range for such a tool is widespread; for instance Hales' proof of Kepler's conjecture yields thousands of inequalities. We introduce an approximation algorithm, which combines ideas of the max-plus basis method (in optimal control) and of the linear templates method developed by Manna et al. (in static analysis). This algorithm consists in bounding some of the constituents of the function by suprema of quadratic forms with a well chosen curvature. This leads to semialgebraic optimization problems, solved by sum-of-squares relaxations. Templates limit the blow up of these relaxations at the price of coarsening the approximation. We illustrate the efficiency of our framework with various examples from the literature and discuss the interfacing with Coq.
△ Less
Submitted 10 July, 2013;
originally announced July 2013.
-
Dobrushin ergodicity coefficient for Markov operators on cones, and beyond
Authors:
Stéphane Gaubert,
Zheng Qu
Abstract:
The analysis of classical consensus algorithms relies on contraction properties of adjoints of Markov operators, with respect to Hilbert's projective metric or to a related family of seminorms (Hopf's oscillation or Hilbert's seminorm). We generalize these properties to abstract consensus operators over normal cones, which include the unital completely positive maps (Kraus operators) arising in qu…
▽ More
The analysis of classical consensus algorithms relies on contraction properties of adjoints of Markov operators, with respect to Hilbert's projective metric or to a related family of seminorms (Hopf's oscillation or Hilbert's seminorm). We generalize these properties to abstract consensus operators over normal cones, which include the unital completely positive maps (Kraus operators) arising in quantum information theory. In particular, we show that the contraction rate of such operators, with respect to the Hopf oscillation seminorm, is given by an analogue of Dobrushin's ergodicity coefficient. We derive from this result a characterization of the contraction rate of a non-linear flow, with respect to Hopf's oscillation seminorm and to Hilbert's projective metric.
△ Less
Submitted 24 November, 2014; v1 submitted 21 February, 2013;
originally announced February 2013.
-
Policy iteration algorithm for zero-sum multichain stochastic games with mean payoff and perfect information
Authors:
Marianne Akian,
Jean Cochet-Terrasson,
Sylvie Detournay,
Stéphane Gaubert
Abstract:
We consider zero-sum stochastic games with finite state and action spaces, perfect information, mean payoff criteria, without any irreducibility assumption on the Markov chains associated to strategies (multichain games). The value of such a game can be characterized by a system of nonlinear equations, involving the mean payoff vector and an auxiliary vector (relative value or bias). We develop he…
▽ More
We consider zero-sum stochastic games with finite state and action spaces, perfect information, mean payoff criteria, without any irreducibility assumption on the Markov chains associated to strategies (multichain games). The value of such a game can be characterized by a system of nonlinear equations, involving the mean payoff vector and an auxiliary vector (relative value or bias). We develop here a policy iteration algorithm for zero-sum stochastic games with mean payoff, following an idea of two of the authors (Cochet-Terrasson and Gaubert, C. R. Math. Acad. Sci. Paris, 2006). The algorithm relies on a notion of nonlinear spectral projection (Akian and Gaubert, Nonlinear Analysis TMA, 2003), which is analogous to the notion of reduction of super-harmonic functions in linear potential theory. To avoid cycling, at each degenerate iteration (in which the mean payoff vector is not improved), the new relative value is obtained by reducing the earlier one. We show that the sequence of values and relative values satisfies a lexicographical monotonicity property, which implies that the algorithm does terminate. We illustrate the algorithm by a mean-payoff version of Richman games (stochastic tug-of-war or discrete infinity Laplacian type equation), in which degenerate iterations are frequent. We report numerical experiments on large scale instances, arising from the latter games, as well as from monotone discretizations of a mean-payoff pursuit-evasion deterministic differential game.
△ Less
Submitted 2 August, 2012;
originally announced August 2012.
-
Coupling policy iteration with semi-definite relaxation to compute accurate numerical invariants in static analysis
Authors:
Assalé Adjé,
Stéphane Gaubert,
Eric Goubault
Abstract:
We introduce a new domain for finding precise numerical invariants of programs by abstract interpretation. This domain, which consists of level sets of non-linear functions, generalizes the domain of linear "templates" introduced by Manna, Sankaranarayanan, and Sipma. In the case of quadratic templates, we use Shor's semi-definite relaxation to derive computable yet precise abstractions of semant…
▽ More
We introduce a new domain for finding precise numerical invariants of programs by abstract interpretation. This domain, which consists of level sets of non-linear functions, generalizes the domain of linear "templates" introduced by Manna, Sankaranarayanan, and Sipma. In the case of quadratic templates, we use Shor's semi-definite relaxation to derive computable yet precise abstractions of semantic functionals, and we show that the abstract fixpoint equation can be solved accurately by coupling policy iteration and semi-definite programming. We demonstrate the interest of our approach on a series of examples (filters, integration schemes) including a degenerate one (symplectic scheme).
△ Less
Submitted 18 January, 2012; v1 submitted 22 November, 2011;
originally announced November 2011.
-
Ergodic Control and Polyhedral approaches to PageRank Optimization
Authors:
Olivier Fercoq,
Marianne Akian,
Mustapha Bouhtou,
Stéphane Gaubert
Abstract:
We study a general class of PageRank optimization problems which consist in finding an optimal outlink strategy for a web site subject to design constraints. We consider both a continuous problem, in which one can choose the intensity of a link, and a discrete one, in which in each page, there are obligatory links, facultative links and forbidden links. We show that the continuous problem, as well…
▽ More
We study a general class of PageRank optimization problems which consist in finding an optimal outlink strategy for a web site subject to design constraints. We consider both a continuous problem, in which one can choose the intensity of a link, and a discrete one, in which in each page, there are obligatory links, facultative links and forbidden links. We show that the continuous problem, as well as its discrete variant when there are no constraints coupling different pages, can both be modeled by constrained Markov decision processes with ergodic reward, in which the webmaster determines the transition probabilities of websurfers. Although the number of actions turns out to be exponential, we show that an associated polytope of transition measures has a concise representation, from which we deduce that the continuous problem is solvable in polynomial time, and that the same is true for the discrete problem when there are no coupling constraints. We also provide efficient algorithms, adapted to very large networks. Then, we investigate the qualitative features of optimal outlink strategies, and identify in particular assumptions under which there exists a "master" page to which all controlled pages should point. We report numerical results on fragments of the real web graph.
△ Less
Submitted 19 September, 2011; v1 submitted 10 November, 2010;
originally announced November 2010.
-
The set of realizations of a max-plus linear sequence is semi-polyhedral
Authors:
Vincent Blondel,
Stéphane Gaubert,
Natacha Portier
Abstract:
We show that the set of realizations of a given dimension of a max-plus linear sequence is a finite union of polyhedral sets, which can be computed from any realization of the sequence. This yields an (expensive) algorithm to solve the max-plus minimal realization problem. These results are derived from general facts on rational expressions over idempotent commutative semirings: we show more gener…
▽ More
We show that the set of realizations of a given dimension of a max-plus linear sequence is a finite union of polyhedral sets, which can be computed from any realization of the sequence. This yields an (expensive) algorithm to solve the max-plus minimal realization problem. These results are derived from general facts on rational expressions over idempotent commutative semirings: we show more generally that the set of values of the coefficients of a commutative rational expression in one letter that yield a given max-plus linear sequence is a semi-algebraic set in the max-plus sense. In particular, it is a finite union of polyhedral sets.
△ Less
Submitted 18 October, 2010;
originally announced October 2010.
-
Submodular spectral functions of principal submatrices of a hermitian matrix, extensions and applications
Authors:
S. Friedland,
S. Gaubert
Abstract:
We extend the multiplicative submodularity of the principal determinants of a nonnegative definite hermitian matrix to other spectral functions. We show that if $f$ is the primitive of a function that is operator monotone on an interval containing the spectrum of a hermitian matrix $A$, then the function $I\mapsto {\rm tr} f(A[I])$ is supermodular, meaning that…
▽ More
We extend the multiplicative submodularity of the principal determinants of a nonnegative definite hermitian matrix to other spectral functions. We show that if $f$ is the primitive of a function that is operator monotone on an interval containing the spectrum of a hermitian matrix $A$, then the function $I\mapsto {\rm tr} f(A[I])$ is supermodular, meaning that ${\rm tr} f(A[I])+{\rm tr} f(A[J])\leq {\rm tr} f(A[I\cup J])+{\rm tr} f(A[I\cap J])$, where $A[I]$ denotes the $I\times I$ principal submatrix of $A$. We discuss extensions to self-adjoint operators on infinite dimensional Hilbert space and to $M$-matrices. We discuss an application to CUR approximation of nonnegative hermitian matrices.
△ Less
Submitted 19 June, 2012; v1 submitted 20 July, 2010;
originally announced July 2010.
-
Tropical polar cones, hypergraph transversals, and mean payoff games
Authors:
Xavier Allamigeon,
Stephane Gaubert,
Ricardo D. Katz
Abstract:
We discuss the tropical analogues of several basic questions of convex duality. In particular, the polar of a tropical polyhedral cone represents the set of linear inequalities that its elements satisfy. We characterize the extreme rays of the polar in terms of certain minimal set covers which may be thought of as weighted generalizations of minimal transversals in hypergraphs. We also give a trop…
▽ More
We discuss the tropical analogues of several basic questions of convex duality. In particular, the polar of a tropical polyhedral cone represents the set of linear inequalities that its elements satisfy. We characterize the extreme rays of the polar in terms of certain minimal set covers which may be thought of as weighted generalizations of minimal transversals in hypergraphs. We also give a tropical analogue of Farkas lemma, which allows one to check whether a linear inequality is implied by a finite family of linear inequalities. Here, the certificate is a strategy of a mean payoff game. We discuss examples, showing that the number of extreme rays of the polar of the tropical cyclic polyhedral cone is polynomially bounded, and that there is no unique minimal system of inequalities defining a given tropical polyhedral cone.
△ Less
Submitted 29 October, 2010; v1 submitted 16 April, 2010;
originally announced April 2010.
-
The tropical double description method
Authors:
Xavier Allamigeon,
Stephane Gaubert,
Eric Goubault
Abstract:
We develop a tropical analogue of the classical double description method allowing one to compute an internal representation (in terms of vertices) of a polyhedron defined externally (by inequalities). The heart of the tropical algorithm is a characterization of the extreme points of a polyhedron in terms of a system of constraints which define it. We show that checking the extremality of a poin…
▽ More
We develop a tropical analogue of the classical double description method allowing one to compute an internal representation (in terms of vertices) of a polyhedron defined externally (by inequalities). The heart of the tropical algorithm is a characterization of the extreme points of a polyhedron in terms of a system of constraints which define it. We show that checking the extremality of a point reduces to checking whether there is only one minimal strongly connected component in an hypergraph. The latter problem can be solved in almost linear time, which allows us to eliminate quickly redundant generators. We report extensive tests (including benchmarks from an application to static analysis) showing that the method outperforms experimentally the previous ones by orders of magnitude. The present tools also lead to worst case bounds which improve the ones provided by previous methods.
△ Less
Submitted 3 February, 2010; v1 submitted 22 January, 2010;
originally announced January 2010.