Search | arXiv e-print repository

doi 10.1103/PhysRevE.88.012815

Coevolutionary networks of reinforcement-learning agents

Authors: Ardeshir Kianercy, Aram Galstyan

Abstract: This paper presents a model of network formation in repeated games where the players adapt their strategies and network ties simultaneously using a simple reinforcement-learning scheme. It is demonstrated that the coevolutionary dynamics of such systems can be described via coupled replicator equations. We provide a comprehensive analysis for three-player two-action games, which is the minimum sys… ▽ More This paper presents a model of network formation in repeated games where the players adapt their strategies and network ties simultaneously using a simple reinforcement-learning scheme. It is demonstrated that the coevolutionary dynamics of such systems can be described via coupled replicator equations. We provide a comprehensive analysis for three-player two-action games, which is the minimum system size with nontrivial structural dynamics. In particular, we characterize the Nash equilibria (NE) in such games and examine the local stability of the rest points corresponding to those equilibria. We also study general n-player networks via both simulations and analytical methods and find that in the absence of exploration, the stable equilibria consist of star motifs as the main building blocks of the network. Furthermore, in all stable equilibria the agents play pure strategies, even when the game allows mixed NE. Finally, we study the impact of exploration on learning outcomes, and observe that there is a critical exploration rate above which the symmetric and uniformly connected network topology becomes stable. △ Less

Submitted 5 August, 2013; originally announced August 2013.

Journal ref: Phys. Rev. E 88, 012815 (2013)

arXiv:1303.5656 [pdf, other]

doi 10.1103/PhysRevE.88.022806

Replicator dynamics with turnover of players

Authors: Jeppe Juul, Ardeshir Kianercy, Sebastian Bernhardsson, Simone Pigolotti

Abstract: We study adaptive dynamics in games where players abandon the population at a given rate, and are replaced by naive players characterized by a prior distribution over the admitted strategies. We demonstrate how such process leads macroscopically to a variant of the replicator equation, with an additional term accounting for player turnover. We study how Nash equilibria and the dynamics of the syst… ▽ More We study adaptive dynamics in games where players abandon the population at a given rate, and are replaced by naive players characterized by a prior distribution over the admitted strategies. We demonstrate how such process leads macroscopically to a variant of the replicator equation, with an additional term accounting for player turnover. We study how Nash equilibria and the dynamics of the system are modified by this additional term, for prototypical examples such as the rock-scissor-paper game and different classes of two-action games played between two distinct populations. We conclude by showing how player turnover can account for non-trivial departures from Nash equilibria observed in data from lowest unique bid auctions. △ Less

Submitted 12 August, 2013; v1 submitted 22 March, 2013; originally announced March 2013.

Comments: 14 pages, 7 figures

Journal ref: Physical Review E 88, 022806 (2013)

arXiv:1109.1528 [pdf, ps, other]

doi 10.1103/PhysRevE.85.041145

Dynamics of Boltzmann Q-Learning in Two-Player Two-Action Games

Authors: Ardeshir Kianercy, Aram Galstyan

Abstract: We consider the dynamics of Q-learning in two-player two-action games with a Boltzmann exploration mechanism. For any non-zero exploration rate the dynamics is dissipative, which guarantees that agent strategies converge to rest points that are generally different from the game's Nash Equlibria (NE). We provide a comprehensive characterization of the rest point structure for different games, and e… ▽ More We consider the dynamics of Q-learning in two-player two-action games with a Boltzmann exploration mechanism. For any non-zero exploration rate the dynamics is dissipative, which guarantees that agent strategies converge to rest points that are generally different from the game's Nash Equlibria (NE). We provide a comprehensive characterization of the rest point structure for different games, and examine the sensitivity of this structure with respect to the noise due to exploration. Our results indicate that for a class of games with multiple NE the asymptotic behavior of learning dynamics can undergo drastic changes at critical exploration rates. Furthermore, we demonstrate that for certain games with a single NE, it is possible to have additional rest points (not corresponding to any NE) that persist for a finite range of the exploration rates and disappear when the exploration rates of both players tend to zero. △ Less

Submitted 1 March, 2012; v1 submitted 7 September, 2011; originally announced September 2011.

Comments: 10 pages, 12 figures. Version 2: added more extensive discussion of asymmetric equilibria; clarified conditions for continuous/discontinuous bifurcations in coordination/anti-coordination games

Journal ref: Physical Review E, vol.85, 4, 041145, 2012

arXiv:1107.5354 [pdf, other]

Replicator Dynamics of Co-Evolving Networks

Authors: Aram Galstyan, Ardeshir Kianercy, Armen Allahverdyan

Abstract: We propose a simple model of network co-evolution in a game-dynamical system of interacting agents that play repeated games with their neighbors, and adapt their behaviors and network links based on the outcome of those games. The adaptation is achieved through a simple reinforcement learning scheme. We show that the collective evolution of such a system can be described by appropriately defined r… ▽ More We propose a simple model of network co-evolution in a game-dynamical system of interacting agents that play repeated games with their neighbors, and adapt their behaviors and network links based on the outcome of those games. The adaptation is achieved through a simple reinforcement learning scheme. We show that the collective evolution of such a system can be described by appropriately defined replicator dynamics equations. In particular, we suggest an appropriate factorization of the agents' strategies that results in a coupled system of equations characterizing the evolution of both strategies and network structure, and illustrate the framework on two simple examples. △ Less

Submitted 26 July, 2011; originally announced July 2011.

Comments: AAAI Complex Adaptive System Symposium, 2010

Showing 1–4 of 4 results for author: Kianercy, A