Skip to main content

Showing 1–3 of 3 results for author: van Niekerk, B

Searching in archive stat. Search in all archives.
.
  1. arXiv:2004.03499  [pdf, other

    cs.LG stat.ML

    Online Constrained Model-based Reinforcement Learning

    Authors: Benjamin van Niekerk, Andreas Damianou, Benjamin Rosman

    Abstract: Applying reinforcement learning to robotic systems poses a number of challenging problems. A key requirement is the ability to handle continuous state and action spaces while remaining within a limited time and resource budget. Additionally, for safe operation, the system must make robust decisions under hard constraints. To address these challenges, we propose a model based approach that combines… ▽ More

    Submitted 7 April, 2020; originally announced April 2020.

    Comments: Conf. Uncertainty in Artificial Intelligence (UAI). 2017

  2. arXiv:1910.05725  [pdf, other

    stat.ML cs.LG

    If dropout limits trainable depth, does critical initialisation still matter? A large-scale statistical analysis on ReLU networks

    Authors: Arnu Pretorius, Elan van Biljon, Benjamin van Niekerk, Ryan Eloff, Matthew Reynard, Steve James, Benjamin Rosman, Herman Kamper, Steve Kroon

    Abstract: Recent work in signal propagation theory has shown that dropout limits the depth to which information can propagate through a neural network. In this paper, we investigate the effect of initialisation on training speed and generalisation for ReLU networks within this depth limit. We ask the following research question: given that critical initialisation is crucial for training at large depth, if d… ▽ More

    Submitted 20 February, 2020; v1 submitted 13 October, 2019; originally announced October 2019.

    Comments: 8 pages, 6 figures, under consideration at Pattern Recognition Letters

  3. arXiv:1807.04439  [pdf, other

    cs.LG stat.ML

    Will it Blend? Composing Value Functions in Reinforcement Learning

    Authors: Benjamin van Niekerk, Steven James, Adam Earle, Benjamin Rosman

    Abstract: An important property for lifelong-learning agents is the ability to combine existing skills to solve unseen tasks. In general, however, it is unclear how to compose skills in a principled way. We provide a "recipe" for optimal value function composition in entropy-regularised reinforcement learning (RL) and then extend this to the standard RL setting. Composition is demonstrated in a video game e… ▽ More

    Submitted 12 July, 2018; originally announced July 2018.

    Comments: The 2nd Lifelong Learning: A Reinforcement Learning Approach (LLARLA) Workshop, Stockholm, Sweden, FAIM 2018