Observational Learning by Reinforcement Learning

Borsa, Diana; Piot, Bilal; Munos, Rémi; Pietquin, Olivier

Computer Science > Machine Learning

arXiv:1706.06617 (cs)

[Submitted on 20 Jun 2017]

Title:Observational Learning by Reinforcement Learning

Authors:Diana Borsa, Bilal Piot, Rémi Munos, Olivier Pietquin

View PDF

Abstract:Observational learning is a type of learning that occurs as a function of observing, retaining and possibly replicating or imitating the behaviour of another agent. It is a core mechanism appearing in various instances of social learning and has been found to be employed in several intelligent species, including humans. In this paper, we investigate to what extent the explicit modelling of other agents is necessary to achieve observational learning through machine learning. Especially, we argue that observational learning can emerge from pure Reinforcement Learning (RL), potentially coupled with memory. Through simple scenarios, we demonstrate that an RL agent can leverage the information provided by the observations of an other agent performing a task in a shared environment. The other agent is only observed through the effect of its actions on the environment and never explicitly modeled. Two key aspects are borrowed from observational learning: i) the observer behaviour needs to change as a result of viewing a 'teacher' (another agent) and ii) the observer needs to be motivated somehow to engage in making use of the other agent's behaviour. The later is naturally modeled by RL, by correlating the learning agent's reward with the teacher agent's behaviour.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:1706.06617 [cs.LG]
	(or arXiv:1706.06617v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1706.06617

Submission history

From: Olivier Pietquin [view email]
[v1] Tue, 20 Jun 2017 18:44:49 UTC (6,515 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2017-06

Change to browse by:

cs
cs.AI
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Diana Borsa
Bilal Piot
Rémi Munos
Olivier Pietquin

export BibTeX citation

Computer Science > Machine Learning

Title:Observational Learning by Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Observational Learning by Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators