Hedging Algorithms and Repeated Matrix Games

Bouzy, Bruno; Métivier, Marc; Pellier, Damien

Computer Science > Machine Learning

arXiv:1810.06443 (cs)

[Submitted on 15 Oct 2018]

Title:Hedging Algorithms and Repeated Matrix Games

Authors:Bruno Bouzy, Marc Métivier, Damien Pellier

View PDF

Abstract:Playing repeated matrix games (RMG) while maximizing the cumulative returns is a basic method to evaluate multi-agent learning (MAL) algorithms. Previous work has shown that $UCB$, $M3$, $S$ or $Exp3$ algorithms have good behaviours on average in RMG. Besides, hedging algorithms have been shown to be effective on prediction problems. An hedging algorithm is made up with a top-level algorithm and a set of basic algorithms. To make its decision, an hedging algorithm uses its top-level algorithm to choose a basic algorithm, and the chosen algorithm makes the decision. This paper experimentally shows that well-selected hedging algorithms are better on average than all previous MAL algorithms on the task of playing RMG against various players. $S$ is a very good top-level algorithm, and $UCB$ and $M3$ are very good basic algorithms. Furthermore, two-level hedging algorithms are more effective than one-level hedging algorithms, and three levels are not better than two levels.

Comments:	12 pages, Workshop of the European Conference on Machine Learning on Machine Learning and Data Mining in and around Games, 2011
Subjects:	Machine Learning (cs.LG); Computer Science and Game Theory (cs.GT); Multiagent Systems (cs.MA); Machine Learning (stat.ML)
Cite as:	arXiv:1810.06443 [cs.LG]
	(or arXiv:1810.06443v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1810.06443
Journal reference:	Workshop of the European Conference on Machine Learning on Machine Learning and Data Mining in and around Games, 2011

Submission history

From: Damien Pellier [view email]
[v1] Mon, 15 Oct 2018 15:05:21 UTC (13 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2018-10

Change to browse by:

cs
cs.GT
cs.MA
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Bruno Bouzy
Marc Métivier
Damien Pellier

export BibTeX citation

Computer Science > Machine Learning

Title:Hedging Algorithms and Repeated Matrix Games

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Hedging Algorithms and Repeated Matrix Games

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators