Skip to main content

Showing 1–3 of 3 results for author: Dasagi, V

Searching in archive stat. Search in all archives.
.
  1. arXiv:1911.08666  [pdf, other

    cs.LG cs.RO stat.ML

    Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks

    Authors: Vibhavari Dasagi, Robert Lee, Jake Bruce, Jürgen Leitner

    Abstract: Deep reinforcement learning has been shown to solve challenging tasks where large amounts of training experience is available, usually obtained online while learning the task. Robotics is a significant potential application domain for many of these algorithms, but generating robot experience in the real world is expensive, especially when each task requires a lengthy online training procedure. Off… ▽ More

    Submitted 19 November, 2019; originally announced November 2019.

  2. arXiv:1910.03732  [pdf, other

    cs.LG cs.RO stat.ML

    Ctrl-Z: Recovering from Instability in Reinforcement Learning

    Authors: Vibhavari Dasagi, Jake Bruce, Thierry Peynot, Jürgen Leitner

    Abstract: When learning behavior, training data is often generated by the learner itself; this can result in unstable training dynamics, and this problem has particularly important applications in safety-sensitive real-world control tasks such as robotics. In this work, we propose a principled and model-agnostic approach to mitigate the issue of unstable learning dynamics by maintaining a history of a reinf… ▽ More

    Submitted 8 October, 2019; originally announced October 2019.

    Comments: Submitted to ICRA2020, under review

  3. arXiv:1809.07480  [pdf, other

    cs.LG stat.ML

    Sim-to-Real Transfer of Robot Learning with Variable Length Inputs

    Authors: Vibhavari Dasagi, Robert Lee, Serena Mou, Jake Bruce, Niko Sünderhauf, Jürgen Leitner

    Abstract: Current end-to-end deep Reinforcement Learning (RL) approaches require jointly learning perception, decision-making and low-level control from very sparse reward signals and high-dimensional inputs, with little capability of incorporating prior knowledge. This results in prohibitively long training times for use on real-world robotic tasks. Existing algorithms capable of extracting task-level repr… ▽ More

    Submitted 8 October, 2019; v1 submitted 20 September, 2018; originally announced September 2018.