Skip to main content

Showing 1–1 of 1 results for author: Borsa, D L

Searching in archive stat. Search in all archives.
.
  1. arXiv:2010.02255  [pdf, other

    cs.AI cs.LG stat.ML

    Temporal Difference Uncertainties as a Signal for Exploration

    Authors: Sebastian Flennerhag, Jane X. Wang, Pablo Sprechmann, Francesco Visin, Alexandre Galashov, Steven Kapturowski, Diana L. Borsa, Nicolas Heess, Andre Barreto, Razvan Pascanu

    Abstract: An effective approach to exploration in reinforcement learning is to rely on an agent's uncertainty over the optimal policy, which can yield near-optimal exploration strategies in tabular settings. However, in non-tabular settings that involve function approximators, obtaining accurate uncertainty estimates is almost as challenging a problem. In this paper, we highlight that value estimates are ea… ▽ More

    Submitted 1 July, 2021; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: 9 pages, 11 figures, 5 tables