Automatically Reinforcing a Game AI

St-Pierre, David L.; Hoock, Jean-Baptiste; Liu, Jialin; Teytaud, Fabien; Teytaud, Olivier

Computer Science > Artificial Intelligence

arXiv:1607.08100 (cs)

[Submitted on 27 Jul 2016]

Title:Automatically Reinforcing a Game AI

Authors:David L. St-Pierre, Jean-Baptiste Hoock, Jialin Liu, Fabien Teytaud, Olivier Teytaud

View PDF

Abstract:A recent research trend in Artificial Intelligence (AI) is the combination of several programs into one single, stronger, program; this is termed portfolio methods. We here investigate the application of such methods to Game Playing Programs (GPPs). In addition, we consider the case in which only one GPP is available - by decomposing this single GPP into several ones through the use of parameters or even simply random seeds. These portfolio methods are trained in a learning phase. We propose two different offline approaches. The simplest one, BestArm, is a straightforward optimization of seeds or parame- ters; it performs quite well against the original GPP, but performs poorly against an opponent which repeats games and learns. The second one, namely Nash-portfolio, performs similarly in a "one game" test, and is much more robust against an opponent who learns. We also propose an online learning portfolio, which tests several of the GPP repeatedly and progressively switches to the best one - using a bandit algorithm.

Comments:	17 pages, 31 figures, 2 tables
Subjects:	Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT)
MSC classes:	68T20
ACM classes:	I.2.8
Cite as:	arXiv:1607.08100 [cs.AI]
	(or arXiv:1607.08100v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1607.08100

Submission history

From: Jialin Liu Ph.D [view email]
[v1] Wed, 27 Jul 2016 14:10:28 UTC (434 KB)

Computer Science > Artificial Intelligence

Title:Automatically Reinforcing a Game AI

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Automatically Reinforcing a Game AI

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators