Skip to main content

Showing 1–12 of 12 results for author: Toussaint, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2402.14402  [pdf, other

    cs.LG stat.ML

    Global Safe Sequential Learning via Efficient Knowledge Transfer

    Authors: Cen-You Li, Olaf Duennbier, Marc Toussaint, Barbara Rakitsch, Christoph Zimmer

    Abstract: Sequential learning methods, such as active learning and Bayesian optimization, aim to select the most informative data for task learning. In many applications, however, data selection is constrained by unknown safety conditions, motivating the development of safe learning approaches. A promising line of safe learning methods uses Gaussian processes to model safety conditions, restricting data sel… ▽ More

    Submitted 18 January, 2025; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: Accepted for publication in TMLR 2025

  2. arXiv:2108.00819  [pdf, other

    cs.LG cs.AI stat.ML

    Active Learning in Gaussian Process State Space Model

    Authors: Hon Sum Alec Yu, Dingling Yao, Christoph Zimmer, Marc Toussaint, Duy Nguyen-Tuong

    Abstract: We investigate active learning in Gaussian Process state-space models (GPSSM). Our problem is to actively steer the system through latent states by determining its inputs such that the underlying dynamics can be optimally learned by a GPSSM. In order that the most informative inputs are selected, we employ mutual information as our active learning criterion. In particular, we present two approache… ▽ More

    Submitted 30 July, 2021; originally announced August 2021.

    Comments: European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD) 2021

  3. arXiv:2007.07582  [pdf, other

    cs.LG stat.ML

    Qgraph-bounded Q-learning: Stabilizing Model-Free Off-Policy Deep Reinforcement Learning

    Authors: Sabrina Hoppe, Marc Toussaint

    Abstract: In state of the art model-free off-policy deep reinforcement learning, a replay memory is used to store past experience and derive all network updates. Even if both state and action spaces are continuous, the replay memory only holds a finite number of transitions. We represent these transitions in a data graph and link its structure to soft divergence. By selecting a subgraph with a favorable str… ▽ More

    Submitted 15 July, 2020; originally announced July 2020.

    Comments: 15 pages, 8 figures

  4. arXiv:2006.05398  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Deep Visual Reasoning: Learning to Predict Action Sequences for Task and Motion Planning from an Initial Scene Image

    Authors: Danny Driess, Jung-Su Ha, Marc Toussaint

    Abstract: In this paper, we propose a deep convolutional recurrent neural network that predicts action sequences for task and motion planning (TAMP) from an initial scene image. Typical TAMP problems are formalized by combining reasoning on a symbolic, discrete level (e.g. first-order logic) with continuous motion planning such as nonlinear trajectory optimization. Due to the great combinatorial complexity… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

    Comments: Robotics: Science and Systems (R:SS) 2020

  5. arXiv:1905.05710  [pdf, other

    cs.LG cs.AI stat.ML

    Trajectory-Based Off-Policy Deep Reinforcement Learning

    Authors: Andreas Doerr, Michael Volpp, Marc Toussaint, Sebastian Trimpe, Christian Daniel

    Abstract: Policy gradient methods are powerful reinforcement learning algorithms and have been demonstrated to solve many complex tasks. However, these methods are also data-inefficient, afflicted with high variance gradient estimates, and frequently get stuck in local optima. This work addresses these weaknesses by combining recent improvements in the reuse of off-policy data and exploration in parameter s… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

    Comments: Includes appendix. Accepted for ICML 2019

  6. arXiv:1801.10395  [pdf, other

    stat.ML

    Probabilistic Recurrent State-Space Models

    Authors: Andreas Doerr, Christian Daniel, Martin Schiegg, Duy Nguyen-Tuong, Stefan Schaal, Marc Toussaint, Sebastian Trimpe

    Abstract: State-space models (SSMs) are a highly expressive model class for learning patterns in time series data and for system identification. Deterministic versions of SSMs (e.g. LSTMs) proved extremely successful in modeling complex time series data. Fully probabilistic SSMs, however, are often found hard to train, even for smaller problems. To overcome this limitation, we propose a novel model formulat… ▽ More

    Submitted 10 February, 2018; v1 submitted 31 January, 2018; originally announced January 2018.

  7. arXiv:1707.08212  [pdf, other

    cs.AI cs.RO stat.ML

    Physical problem solving: Joint planning with symbolic, geometric, and dynamic constraints

    Authors: Ilker Yildirim, Tobias Gerstenberg, Basil Saeed, Marc Toussaint, Josh Tenenbaum

    Abstract: In this paper, we present a new task that investigates how people interact with and make judgments about towers of blocks. In Experiment~1, participants in the lab solved a series of problems in which they had to re-configure three blocks from an initial to a final configuration. We recorded whether they used one hand or two hands to do so. In Experiment~2, we asked participants online to judge wh… ▽ More

    Submitted 25 July, 2017; originally announced July 2017.

  8. arXiv:1701.06450  [pdf, other

    stat.ML cs.AI

    Identification of Unmodeled Objects from Symbolic Descriptions

    Authors: Andrea Baisero, Stefan Otte, Peter Englert, Marc Toussaint

    Abstract: Successful human-robot cooperation hinges on each agent's ability to process and exchange information about the shared environment and the task at hand. Human communication is primarily based on symbolic abstractions of object properties, rather than precise quantitative measures. A comprehensive robotic framework thus requires an integrated communication module which is able to establish a link a… ▽ More

    Submitted 23 January, 2017; originally announced January 2017.

  9. arXiv:1612.03117  [pdf, other

    cs.LG cs.AI stat.ML

    Advancing Bayesian Optimization: The Mixed-Global-Local (MGL) Kernel and Length-Scale Cool Down

    Authors: Kim Peter Wabersich, Marc Toussaint

    Abstract: Bayesian Optimization (BO) has become a core method for solving expensive black-box optimization problems. While much research focussed on the choice of the acquisition function, we focus on online length-scale adaption and the choice of kernel function. Instead of choosing hyperparameters in view of maximum likelihood on past data, we propose to use the acquisition function to decide on hyperpara… ▽ More

    Submitted 9 December, 2016; originally announced December 2016.

    Comments: Long version of accepted NIPS BayesOpt 2016 paper

    MSC Class: 68T99; 78M50; 68T05

  10. arXiv:1409.7552  [pdf, other

    stat.ML cs.LG

    The Advantage of Cross Entropy over Entropy in Iterative Information Gathering

    Authors: Johannes Kulick, Robert Lieck, Marc Toussaint

    Abstract: Gathering the most information by picking the least amount of data is a common task in experimental design or when exploring an unknown environment in reinforcement learning and robotics. A widely used measure for quantifying the information contained in some distribution of interest is its entropy. Greedily minimizing the expected entropy is therefore a standard method for choosing samples in ord… ▽ More

    Submitted 16 September, 2015; v1 submitted 26 September, 2014; originally announced September 2014.

    Comments: 24 pages

  11. arXiv:1208.2523  [pdf, other

    cs.LG stat.ML

    Path Integral Control by Reproducing Kernel Hilbert Space Embedding

    Authors: Konrad Rawlik, Marc Toussaint, Sethu Vijayakumar

    Abstract: We present an embedding of stochastic optimal control problems, of the so called path integral form, into reproducing kernel Hilbert spaces. Using consistent, sample based estimates of the embedding leads to a model free, non-parametric approach for calculation of an approximate solution to the control problem. This formulation admits a decomposition of the problem into an invariant and task depen… ▽ More

    Submitted 13 August, 2012; originally announced August 2012.

  12. arXiv:1009.3958  [pdf, other

    cs.LG stat.ML

    Approximate Inference and Stochastic Optimal Control

    Authors: Konrad Rawlik, Marc Toussaint, Sethu Vijayakumar

    Abstract: We propose a novel reformulation of the stochastic optimal control problem as an approximate inference problem, demonstrating, that such a interpretation leads to new practical methods for the original problem. In particular we characterise a novel class of iterative solutions to the stochastic optimal control problem based on a natural relaxation of the exact dual formulation. These theoretical i… ▽ More

    Submitted 20 September, 2010; originally announced September 2010.