Student-Initiated Action Advising via Advice Novelty

Ilhan, Ercument; Gow, Jeremy; Perez-Liebana, Diego

doi:10.1109/TG.2021.3113644

Computer Science > Machine Learning

arXiv:2010.00381 (cs)

[Submitted on 1 Oct 2020 (v1), last revised 27 Feb 2021 (this version, v2)]

Title:Student-Initiated Action Advising via Advice Novelty

Authors:Ercument Ilhan, Jeremy Gow, Diego Perez-Liebana

View PDF

Abstract:Action advising is a budget-constrained knowledge exchange mechanism between teacher-student peers that can help tackle exploration and sample inefficiency problems in deep reinforcement learning (RL). Most recently, student-initiated techniques that utilise state novelty and uncertainty estimations have obtained promising results. However, the approaches built on these estimations have some potential weaknesses. First, they assume that the convergence of the student's RL model implies less need for advice. This can be misleading in scenarios with teacher absence early on where the student is likely to learn suboptimally by itself; yet also ignore the teacher's assistance later. Secondly, the delays between encountering states and having them to take effect in the RL model updates in presence of the experience replay dynamics cause a feedback lag in what the student actually needs advice for. We propose a student-initiated algorithm that alleviates these by employing Random Network Distillation (RND) to measure the novelty of a piece of advice. Furthermore, we perform RND updates only for the advised states to ensure that the student's own learning does not impair its ability to leverage the teacher. Experiments in GridWorld and MinAtar show that our approach performs on par with the state-of-the-art and demonstrates significant advantages in the scenarios where the existing methods are prone to fail.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2010.00381 [cs.LG]
	(or arXiv:2010.00381v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2010.00381
Related DOI:	https://doi.org/10.1109/TG.2021.3113644

Submission history

From: Ercument Ilhan [view email]
[v1] Thu, 1 Oct 2020 13:20:28 UTC (1,847 KB)
[v2] Sat, 27 Feb 2021 08:49:43 UTC (1,781 KB)

Computer Science > Machine Learning

Title:Student-Initiated Action Advising via Advice Novelty

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Student-Initiated Action Advising via Advice Novelty

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators