Skip to main content

Showing 1–18 of 18 results for author: Pavel, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.18086  [pdf, ps, other

    cs.GT econ.TH

    Generalizing Better Response Paths and Weakly Acyclic Games

    Authors: Bora Yongacoglu, Gürdal Arslan, Lacra Pavel, Serdar Yüksel

    Abstract: Weakly acyclic games generalize potential games and are fundamental to the study of game theoretic control. In this paper, we present a generalization of weakly acyclic games, and we observe its importance in multi-agent learning when agents employ experimental strategy updates in periods where they fail to best respond. While weak acyclicity is defined in terms of path connectivity properties of… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  2. arXiv:2403.18079  [pdf, ps, other

    cs.GT cs.AI cs.LG

    Paths to Equilibrium in Games

    Authors: Bora Yongacoglu, Gürdal Arslan, Lacra Pavel, Serdar Yüksel

    Abstract: In multi-agent reinforcement learning (MARL) and game theory, agents repeatedly interact and revise their strategies as new data arrives, producing a sequence of strategy profiles. This paper studies sequences of strategies satisfying a pairwise constraint inspired by policy updating in reinforcement learning, where an agent who is best responding in one period does not switch its strategy in the… ▽ More

    Submitted 1 October, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted to NeurIPS 2024

  3. arXiv:2210.16482  [pdf, other

    cs.LG cs.GT

    Recursive Reasoning in Minimax Games: A Level $k$ Gradient Play Method

    Authors: Zichu Liu, Lacra Pavel

    Abstract: Despite the success of generative adversarial networks (GANs) in generating visually appealing images, they are notoriously challenging to train. In order to stabilize the learning dynamics in minimax games, we propose a novel recursive reasoning algorithm: Level $k$ Gradient Play (Lv.$k$ GP) algorithm. In contrast to many existing algorithms, our algorithm does not require sophisticated heuristic… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: For the code associated with this paper, see https://github.com/ZichuLiu/submission

  4. arXiv:2111.09982  [pdf, other

    math.OC cs.GT cs.LG eess.SY math.DS

    Second-Order Mirror Descent: Convergence in Games Beyond Averaging and Discounting

    Authors: Bolin Gao, Lacra Pavel

    Abstract: In this paper, we propose a second-order extension of the continuous-time game-theoretic mirror descent (MD) dynamics, referred to as MD2, which provably converges to mere (but not necessarily strict) variationally stable states (VSS) without using common auxiliary techniques such as time-averaging or discounting. We show that MD2 enjoys no-regret as well as an exponential rate of convergence towa… ▽ More

    Submitted 30 June, 2023; v1 submitted 18 November, 2021; originally announced November 2021.

    Comments: 16 pages, 12 figures. This work has been submitted to the IEEE for possible publication

  5. arXiv:2011.10682  [pdf, other

    math.OC cs.GT cs.MA eess.SY math.DS

    Continuous-Time Convergence Rates in Potential and Monotone Games

    Authors: Bolin Gao, Lacra Pavel

    Abstract: In this paper, we provide exponential rates of convergence to the interior Nash equilibrium for continuous-time dual-space game dynamics such as mirror descent (MD) and actor-critic (AC). We perform our analysis in $N$-player continuous concave games that satisfy certain monotonicity assumptions while possibly also admitting potential functions. In the first part of this paper, we provide a novel… ▽ More

    Submitted 2 February, 2022; v1 submitted 20 November, 2020; originally announced November 2020.

    Comments: 20 pages, 5 figures, manuscript submitted to SIAM Journal on Control and Optimization (SICON) for possible publication

  6. arXiv:1912.03460  [pdf, other

    math.OC cs.GT cs.LG cs.MA eess.SY

    Continuous-time Discounted Mirror-Descent Dynamics in Monotone Concave Games

    Authors: Bolin Gao, Lacra Pavel

    Abstract: In this paper, we consider concave continuous-kernel games characterized by monotonicity properties and propose discounted mirror descent-type dynamics. We introduce two classes of dynamics whereby the associated mirror map is constructed based on a strongly convex or a Legendre regularizer. Depending on the properties of the regularizer we show that these new dynamics can converge asymptotically… ▽ More

    Submitted 7 December, 2019; originally announced December 2019.

    Comments: 8 pages, 9 figures. This work has been submitted to the IEEE for possible publication

    Journal ref: IEEE Transactions on Automatic Control, vol 66 (11), 2021

  7. arXiv:1808.04465  [pdf, other

    math.OC cs.GT eess.SY

    Distributed GNE seeking under partial-decision information over networks via a doubly-augmented operator splitting approach

    Authors: Lacra Pavel

    Abstract: We consider distributed computation of generalized Nash equilibrium (GNE) over networks, in games with shared coupling constraints. Existing methods require that each player has full access to opponents' decisions. In this paper, we assume that players have only partial-decision information, and can communicate with their neighbours over an arbitrary undirected graph. We recast the problem as that… ▽ More

    Submitted 13 August, 2018; originally announced August 2018.

    Comments: 12 pages, 5 figures. This work has been submitted to the IEEE for possible publication

    Journal ref: IEEE Transactions on Automatic Control, vol. 65 (4), pp. 1584-1597, 2020

  8. arXiv:1808.04464  [pdf, other

    math.OC cs.GT eess.SY

    On Passivity, Reinforcement Learning and Higher-Order Learning in Multi-Agent Finite Games

    Authors: Bolin Gao, Lacra Pavel

    Abstract: In this paper, we propose a passivity-based methodology for analysis and design of reinforcement learning in multi-agent finite games. Starting from a known exponentially-discounted reinforcement learning scheme, we show that convergence to a Nash distribution can be shown in the class of games characterized by the monotonicity property of their (negative) payoff. We further exploit passivity to p… ▽ More

    Submitted 13 August, 2018; originally announced August 2018.

    Comments: 14 pages, 19 figures. This work has been submitted to the IEEE for possible publication

    Journal ref: IEEE Transactions on Automatic Control, 2020

  9. arXiv:1802.02277  [pdf, other

    cs.LG cs.MA

    From Game-theoretic Multi-agent Log Linear Learning to Reinforcement Learning

    Authors: Mohammadhosein Hasanbeig, Lacra Pavel

    Abstract: The main focus of this paper is on enhancement of two types of game-theoretic learning algorithms: log-linear learning and reinforcement learning. The standard analysis of log-linear learning needs a highly structured environment, i.e. strong assumptions about the game from an implementation perspective. In this paper, we introduce a variant of log-linear learning that provides asymptotic guarante… ▽ More

    Submitted 18 September, 2018; v1 submitted 6 February, 2018; originally announced February 2018.

  10. arXiv:1704.00805  [pdf, other

    math.OC cs.LG

    On the Properties of the Softmax Function with Application in Game Theory and Reinforcement Learning

    Authors: Bolin Gao, Lacra Pavel

    Abstract: In this paper, we utilize results from convex analysis and monotone operator theory to derive additional properties of the softmax function that have not yet been covered in the existing literature. In particular, we show that the softmax function is the monotone gradient map of the log-sum-exp function. By exploiting this connection, we show that the inverse temperature parameter determines the L… ▽ More

    Submitted 20 August, 2018; v1 submitted 3 April, 2017; originally announced April 2017.

    Comments: 10 pages, 4 figures. Comments are welcome

  11. A Distributed Nash Equilibrium Seeking in Networked Graphical Games

    Authors: Farzad Salehisadaghiani, Lacra Pavel

    Abstract: This paper considers a distributed gossip approach for finding a Nash equilibrium in networked games on graphs. In such games a player's cost function may be affected by the actions of any subset of players. An interference graph is employed to illustrate the partially-coupled cost functions and the asymmetric information requirements. For a given interference graph, network communication between… ▽ More

    Submitted 28 March, 2017; originally announced March 2017.

    Journal ref: Automatica, 87, pp. 17 - 24, 2018

  12. arXiv:1703.08509  [pdf, other

    cs.GT eess.SY

    Generalized Nash Equilibrium Problem by the Alternating Direction Method of Multipliers

    Authors: Farzad Salehisadaghiani, Lacra Pavel

    Abstract: In this paper, the problem of finding a generalized Nash equilibrium (GNE) of a networked game is studied. Players are only able to choose their decisions from a feasible action set. The feasible set is considered to be a private linear equality constraint that is coupled through decisions of the other players. We consider that each player has his own private constraint and it has not to be shared… ▽ More

    Submitted 24 March, 2017; originally announced March 2017.

    Comments: arXiv admin note: text overlap with arXiv:1612.00414

  13. A distributed primal-dual algorithm for computation of generalized Nash equilibria with shared affine coupling constraints via operator splitting methods

    Authors: Peng Yi, Lacra Pavel

    Abstract: In this paper, we propose a distributed primal-dual algorithm for computation of a generalized Nash equilibrium (GNE) in noncooperative games over network systems. In the considered game, not only each player's local objective function depends on other players' decisions, but also the feasible decision sets of all the players are coupled together with a globally shared affine inequality constraint… ▽ More

    Submitted 15 March, 2017; originally announced March 2017.

    Comments: 21 pages,8 figures, parts are submitted to IEEE CDC

    Journal ref: Automatica, vol 102, pp. 111 -121, 2019

  14. Nash Equilibrium Seeking with Non-doubly Stochastic Communication Weight Matrix

    Authors: Farzad Salehisadaghiani, Lacra Pavel

    Abstract: A distributed Nash equilibrium seeking algorithm is presented for networked games. We assume an incomplete information available to each player about the other players' actions. The players communicate over a strongly connected digraph to send/receive the estimates of the other players' actions to/from the other local players according to a gossip communication protocol. Due to asymmetric informat… ▽ More

    Submitted 30 March, 2017; v1 submitted 21 December, 2016; originally announced December 2016.

    Journal ref: EAI Transactions on Collaborative Computing, 2019

  15. arXiv:1612.00414  [pdf, other

    eess.SY cs.GT math.OC

    Distributed Nash Equilibrium Seeking via the Alternating Direction Method of Multipliers

    Authors: Farzad Salehisadaghiani, Lacra Pavel

    Abstract: In this paper, the problem of finding a Nash equilibrium of a multi-player game is considered. The players are only aware of their own cost functions as well as the action space of all players. We develop a relatively fast algorithm within the framework of inexact-ADMM. It requires a communication graph for the information exchange between the players as well as a few mild assumptions on cost func… ▽ More

    Submitted 1 December, 2016; originally announced December 2016.

  16. arXiv:1610.01896  [pdf, other

    eess.SY cs.GT math.OC

    Distributed Nash Equilibrium Seeking By Gossip in Games on Graphs

    Authors: Farzad Salehisadaghiani, Lacra Pavel

    Abstract: We consider a gossip approach for finding a Nash equilibrium in a distributed multi-player network game. We extend previous results on Nash equilibrium seeking to the case when the players' cost functions may be affected by the actions of any subset of players. An interference graph is employed to illustrate the partially-coupled cost functions and the asymmetric information requirements. For a gi… ▽ More

    Submitted 6 October, 2016; originally announced October 2016.

  17. arXiv:1103.2490  [pdf, ps, other

    cs.GT

    Enabling Differentiated Services Using Generalized Power Control Model in Optical Networks

    Authors: Quanyan Zhu, Lacra Pavel

    Abstract: This paper considers a generalized framework to study OSNR optimization-based end-to-end link level power control problems in optical networks. We combine favorable features of game-theoretical approach and central cost approach to allow different service groups within the network. We develop solutions concepts for both cases of empty and nonempty feasible sets. In addition, we derive and prove th… ▽ More

    Submitted 12 March, 2011; originally announced March 2011.

  18. arXiv:1007.0144  [pdf, ps, other

    cs.GT cs.NI math.OC

    An Optimization and Control Theoretic Approach to Noncooperative Game Design

    Authors: Tansu Alpcan, Lacra Pavel, Nem Stefanovic

    Abstract: This paper investigates design of noncooperative games from an optimization and control theoretic perspective. Pricing mechanisms are used as a design tool to ensure that the Nash equilibrium of a fairly general class of noncooperative games satisfies certain global objectives such as welfare maximization or achieving a certain level of quality-of-service (QoS). The class of games considered provi… ▽ More

    Submitted 1 July, 2010; originally announced July 2010.

    Comments: Earlier versions of this work have appeared partly in the International Conference on Game Theory for Networks (GameNets), in Istanbul, Turkey in May 2009 and Conference on Decision and Control (CDC), Shanghai, China, in December 2009