Skip to main content

Showing 1–11 of 11 results for author: Hasenclever, L

Searching in archive stat. Search in all archives.
.
  1. arXiv:2009.04875  [pdf, other

    cs.LG cs.AI stat.ML

    Importance Weighted Policy Learning and Adaptation

    Authors: Alexandre Galashov, Jakub Sygnowski, Guillaume Desjardins, Jan Humplik, Leonard Hasenclever, Rae Jeong, Yee Whye Teh, Nicolas Heess

    Abstract: The ability to exploit prior experience to solve novel problems rapidly is a hallmark of biological learning systems and of great practical importance for artificial ones. In the meta reinforcement learning literature much recent work has focused on the problem of optimizing the learning process itself. In this paper we study a complementary approach which is conceptually simple, general, modular… ▽ More

    Submitted 4 June, 2021; v1 submitted 10 September, 2020; originally announced September 2020.

  2. arXiv:2005.07513  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    A Distributional View on Multi-Objective Policy Optimization

    Authors: Abbas Abdolmaleki, Sandy H. Huang, Leonard Hasenclever, Michael Neunert, H. Francis Song, Martina Zambelli, Murilo F. Martins, Nicolas Heess, Raia Hadsell, Martin Riedmiller

    Abstract: Many real-world problems require trading off multiple competing objectives. However, these objectives are often in different units and/or scales, which can make it challenging for practitioners to express numerical preferences over objectives in their native units. In this paper we propose a novel algorithm for multi-objective reinforcement learning that enables setting desired preferences for obj… ▽ More

    Submitted 15 May, 2020; originally announced May 2020.

  3. arXiv:2004.11410  [pdf, other

    cs.LG cs.AI stat.ML

    Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning

    Authors: Giambattista Parascandolo, Lars Buesing, Josh Merel, Leonard Hasenclever, John Aslanides, Jessica B. Hamrick, Nicolas Heess, Alexander Neitz, Theophane Weber

    Abstract: Standard planners for sequential decision making (including Monte Carlo planning, tree search, dynamic programming, etc.) are constrained by an implicit sequential planning assumption: The order in which a plan is constructed is the same in which it is executed. We consider alternatives to this assumption for the class of goal-directed Reinforcement Learning (RL) problems. Instead of an environmen… ▽ More

    Submitted 23 April, 2020; originally announced April 2020.

  4. arXiv:1905.06424  [pdf, other

    cs.LG cs.AI stat.ML

    Meta reinforcement learning as task inference

    Authors: Jan Humplik, Alexandre Galashov, Leonard Hasenclever, Pedro A. Ortega, Yee Whye Teh, Nicolas Heess

    Abstract: Humans achieve efficient learning by relying on prior knowledge about the structure of naturally occurring tasks. There is considerable interest in designing reinforcement learning (RL) algorithms with similar properties. This includes proposals to learn the learning algorithm itself, an idea also known as meta learning. One formal interpretation of this idea is as a partially observable multi-tas… ▽ More

    Submitted 22 October, 2019; v1 submitted 15 May, 2019; originally announced May 2019.

  5. arXiv:1905.01240  [pdf, other

    cs.LG cs.AI stat.ML

    Information asymmetry in KL-regularized RL

    Authors: Alexandre Galashov, Siddhant M. Jayakumar, Leonard Hasenclever, Dhruva Tirumala, Jonathan Schwarz, Guillaume Desjardins, Wojciech M. Czarnecki, Yee Whye Teh, Razvan Pascanu, Nicolas Heess

    Abstract: Many real world tasks exhibit rich structure that is repeated across different parts of the state space or in time. In this work we study the possibility of leveraging such repeated structure to speed up and regularize learning. We start from the KL regularized expected reward objective which introduces an additional component, a default policy. Instead of relying on a fixed default policy, we lea… ▽ More

    Submitted 3 May, 2019; originally announced May 2019.

    Comments: Accepted as a conference paper at ICLR 2019

  6. arXiv:1903.07438  [pdf, other

    cs.LG stat.ML

    Exploiting Hierarchy for Learning and Transfer in KL-regularized RL

    Authors: Dhruva Tirumala, Hyeonwoo Noh, Alexandre Galashov, Leonard Hasenclever, Arun Ahuja, Greg Wayne, Razvan Pascanu, Yee Whye Teh, Nicolas Heess

    Abstract: As reinforcement learning agents are tasked with solving more challenging and diverse tasks, the ability to incorporate prior knowledge into the learning system and to exploit reusable structure in solution space is likely to become increasingly important. The KL-regularized expected reward objective constitutes one possible tool to this end. It introduces an additional component, a default or pri… ▽ More

    Submitted 23 January, 2020; v1 submitted 18 March, 2019; originally announced March 2019.

  7. arXiv:1806.01780  [pdf, other

    cs.LG stat.ML

    Mix&Match - Agent Curricula for Reinforcement Learning

    Authors: Wojciech Marian Czarnecki, Siddhant M. Jayakumar, Max Jaderberg, Leonard Hasenclever, Yee Whye Teh, Simon Osindero, Nicolas Heess, Razvan Pascanu

    Abstract: We introduce Mix&Match (M&M) - a training framework designed to facilitate rapid and effective learning in RL agents, especially those that would be too slow or too challenging to train otherwise. The key innovation is a procedure that allows us to automatically form a curriculum over agents. Through such a curriculum we can progressively train more complex agents by, effectively, bootstrapping fr… ▽ More

    Submitted 5 June, 2018; originally announced June 2018.

    Comments: ICML 2018

  8. arXiv:1803.05649  [pdf, other

    stat.ML cs.AI cs.LG stat.ME

    Sylvester Normalizing Flows for Variational Inference

    Authors: Rianne van den Berg, Leonard Hasenclever, Jakub M. Tomczak, Max Welling

    Abstract: Variational inference relies on flexible approximate posterior distributions. Normalizing flows provide a general recipe to construct flexible variational posteriors. We introduce Sylvester normalizing flows, which can be seen as a generalization of planar flows. Sylvester normalizing flows remove the well-known single-unit bottleneck from planar flows, making a single transformation much more fle… ▽ More

    Submitted 20 February, 2019; v1 submitted 15 March, 2018; originally announced March 2018.

    Comments: Published at UAI 2018, 12 pages, 3 figures, code at: https://github.com/riannevdberg/sylvester-flows

  9. arXiv:1706.02692  [pdf, other

    stat.ME math.NA

    The True Cost of Stochastic Gradient Langevin Dynamics

    Authors: Tigran Nagapetyan, Andrew B. Duncan, Leonard Hasenclever, Sebastian J. Vollmer, Lukasz Szpruch, Konstantinos Zygalakis

    Abstract: The problem of posterior inference is central to Bayesian statistics and a wealth of Markov Chain Monte Carlo (MCMC) methods have been proposed to obtain asymptotically correct samples from the posterior. As datasets in applications grow larger and larger, scalability has emerged as a central problem for MCMC methods. Stochastic Gradient Langevin Dynamics (SGLD) and related stochastic gradient Mar… ▽ More

    Submitted 8 June, 2017; originally announced June 2017.

    Comments: 6 Figures

    MSC Class: 65C05

  10. arXiv:1609.04388  [pdf, other

    stat.ML

    Relativistic Monte Carlo

    Authors: Xiaoyu Lu, Valerio Perrone, Leonard Hasenclever, Yee Whye Teh, Sebastian J. Vollmer

    Abstract: Hamiltonian Monte Carlo (HMC) is a popular Markov chain Monte Carlo (MCMC) algorithm that generates proposals for a Metropolis-Hastings algorithm by simulating the dynamics of a Hamiltonian system. However, HMC is sensitive to large time discretizations and performs poorly if there is a mismatch between the spatial geometry of the target distribution and the scales of the momentum distribution. In… ▽ More

    Submitted 14 September, 2016; originally announced September 2016.

  11. arXiv:1512.09327  [pdf, other

    cs.LG stat.ML

    Distributed Bayesian Learning with Stochastic Natural-gradient Expectation Propagation and the Posterior Server

    Authors: Leonard Hasenclever, Stefan Webb, Thibaut Lienart, Sebastian Vollmer, Balaji Lakshminarayanan, Charles Blundell, Yee Whye Teh

    Abstract: This paper makes two contributions to Bayesian machine learning algorithms. Firstly, we propose stochastic natural gradient expectation propagation (SNEP), a novel alternative to expectation propagation (EP), a popular variational inference algorithm. SNEP is a black box variational algorithm, in that it does not require any simplifying assumptions on the distribution of interest, beyond the exist… ▽ More

    Submitted 7 September, 2017; v1 submitted 31 December, 2015; originally announced December 2015.

    Comments: 37 pages, 7 figures

    Journal ref: Journal of Machine Learning Research 18 (2017) 1-37