Skip to main content

Showing 1–7 of 7 results for author: Schaal, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2112.00597  [pdf, other

    cs.RO stat.ML

    Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulation

    Authors: Todor Davchev, Oleg Sushkov, Jean-Baptiste Regli, Stefan Schaal, Yusuf Aytar, Markus Wulfmeier, Jon Scholz

    Abstract: Complex sequential tasks in continuous-control settings often require agents to successfully traverse a set of "narrow passages" in their state space. Solving such tasks with a sparse reward in a sample-efficient manner poses a challenge to modern reinforcement learning (RL) due to the associated long-horizon nature of the problem and the lack of sufficient positive signal during learning. Various… ▽ More

    Submitted 22 March, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

    Journal ref: International Conference on Learning Representations (ICLR 2022)

  2. arXiv:1810.02422  [pdf, other

    cs.RO cs.AI cs.LG stat.ML

    Simulator Predictive Control: Using Learned Task Representations and MPC for Zero-Shot Generalization and Sequencing

    Authors: Zhanpeng He, Ryan Julian, Eric Heiden, Hejia Zhang, Stefan Schaal, Joseph J. Lim, Gaurav Sukhatme, Karol Hausman

    Abstract: Simulation-to-real transfer is an important strategy for making reinforcement learning practical with real robots. Successful sim-to-real transfer systems have difficulty producing policies which generalize across tasks, despite training for thousands of hours equivalent real robot time. To address this shortcoming, we present a novel approach to efficiently learning new robotic skills directly on… ▽ More

    Submitted 27 January, 2021; v1 submitted 4 October, 2018; originally announced October 2018.

    Comments: Presented at NeurIPS 2018 Workshop: Deep Reinforcement Learning. See https://youtu.be/te4JWe7LPKw for supplemental video

  3. arXiv:1809.10253  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Scaling simulation-to-real transfer by learning composable robot skills

    Authors: Ryan Julian, Eric Heiden, Zhanpeng He, Hejia Zhang, Stefan Schaal, Joseph J. Lim, Gaurav Sukhatme, Karol Hausman

    Abstract: We present a novel solution to the problem of simulation-to-real transfer, which builds on recent advances in robot skill decomposition. Rather than focusing on minimizing the simulation-reality gap, we learn a set of diverse policies that are parameterized in a way that makes them easily reusable. This diversity and parameterization of low-level skills allows us to find a transferable policy that… ▽ More

    Submitted 13 November, 2018; v1 submitted 26 September, 2018; originally announced September 2018.

    Comments: Presented at ISER 2018. See https://www.youtube.com/watch?v=Syr2RQTHqTs for supplemental video

  4. arXiv:1801.10395  [pdf, other

    stat.ML

    Probabilistic Recurrent State-Space Models

    Authors: Andreas Doerr, Christian Daniel, Martin Schiegg, Duy Nguyen-Tuong, Stefan Schaal, Marc Toussaint, Sebastian Trimpe

    Abstract: State-space models (SSMs) are a highly expressive model class for learning patterns in time series data and for system identification. Deterministic versions of SSMs (e.g. LSTMs) proved extremely successful in modeling complex time series data. Fully probabilistic SSMs, however, are often found hard to train, even for smaller problems. To overcome this limitation, we propose a novel model formulat… ▽ More

    Submitted 10 February, 2018; v1 submitted 31 January, 2018; originally announced January 2018.

  5. arXiv:1709.07089  [pdf, other

    eess.SY cs.LG stat.ML

    On the Design of LQR Kernels for Efficient Controller Learning

    Authors: Alonso Marco, Philipp Hennig, Stefan Schaal, Sebastian Trimpe

    Abstract: Finding optimal feedback controllers for nonlinear dynamic systems from data is hard. Recently, Bayesian optimization (BO) has been proposed as a powerful framework for direct controller tuning from experimental trials. For selecting the next query point and finding the global optimum, BO relies on a probabilistic description of the latent objective function, typically a Gaussian process (GP). As… ▽ More

    Submitted 20 September, 2017; originally announced September 2017.

    Comments: 8 pages, 5 figures, to appear in 56th IEEE Conference on Decision and Control (CDC 2017)

  6. arXiv:1703.02899  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Model-Based Policy Search for Automatic Tuning of Multivariate PID Controllers

    Authors: Andreas Doerr, Duy Nguyen-Tuong, Alonso Marco, Stefan Schaal, Sebastian Trimpe

    Abstract: PID control architectures are widely used in industrial applications. Despite their low number of open parameters, tuning multiple, coupled PID controllers can become tedious in practice. In this paper, we extend PILCO, a model-based policy search framework, to automatically tune multivariate PID controllers purely based on data observed on an otherwise unknown system. The system's state is extend… ▽ More

    Submitted 8 March, 2017; originally announced March 2017.

    Comments: Accepted final version to appear in 2017 IEEE International Conference on Robotics and Automation (ICRA)

  7. arXiv:1509.04072  [pdf, other

    stat.ML eess.SY

    Robust Gaussian Filtering using a Pseudo Measurement

    Authors: Manuel Wüthrich, Cristina Garcia Cifuentes, Sebastian Trimpe, Franziska Meier, Jeannette Bohg, Jan Issac, Stefan Schaal

    Abstract: Many sensors, such as range, sonar, radar, GPS and visual devices, produce measurements which are contaminated by outliers. This problem can be addressed by using fat-tailed sensor models, which account for the possibility of outliers. Unfortunately, all estimation algorithms belonging to the family of Gaussian filters (such as the widely-used extended Kalman filter and unscented Kalman filter) ar… ▽ More

    Submitted 30 May, 2016; v1 submitted 14 September, 2015; originally announced September 2015.