Skip to main content

Showing 1–6 of 6 results for author: Liesen, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.11453  [pdf, other

    cs.LG cs.AI cs.RO

    A Clean Slate for Offline Reinforcement Learning

    Authors: Matthew Thomas Jackson, Uljad Berdica, Jarek Liesen, Shimon Whiteson, Jakob Nicolaus Foerster

    Abstract: Progress in offline reinforcement learning (RL) has been impeded by ambiguous problem definitions and entangled algorithmic designs, resulting in inconsistent implementations, insufficient ablations, and unfair evaluations. Although offline RL explicitly avoids environment interaction, prior methods frequently employ extensive, undocumented online evaluation for hyperparameter tuning, complicating… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

  2. arXiv:2407.19396  [pdf, other

    cs.LG cs.AI

    NAVIX: Scaling MiniGrid Environments with JAX

    Authors: Eduardo Pignatelli, Jarek Liesen, Robert Tjarko Lange, Chris Lu, Pablo Samuel Castro, Laura Toni

    Abstract: As Deep Reinforcement Learning (Deep RL) research moves towards solving large-scale worlds, efficient environment simulations become crucial for rapid experimentation. However, most existing environments struggle to scale to high throughput, setting back meaningful progress. Interactions are typically computed on the CPU, limiting training speed and throughput, due to slower computation and commun… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

  3. arXiv:2406.15042  [pdf, other

    cs.LG cs.AI

    Behaviour Distillation

    Authors: Andrei Lupu, Chris Lu, Jarek Liesen, Robert Tjarko Lange, Jakob Foerster

    Abstract: Dataset distillation aims to condense large datasets into a small number of synthetic examples that can be used as drop-in replacements when training new models. It has applications to interpretability, neural architecture search, privacy, and continual learning. Despite strong successes in supervised domains, such methods have not yet been extended to reinforcement learning, where the lack of a f… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: Published as a conference paper at ICLR 2024

  4. arXiv:2406.12589  [pdf, other

    cs.LG

    Discovering Minimal Reinforcement Learning Environments

    Authors: Jarek Liesen, Chris Lu, Andrei Lupu, Jakob N. Foerster, Henning Sprekeler, Robert T. Lange

    Abstract: Reinforcement learning (RL) agents are commonly trained and evaluated in the same environment. In contrast, humans often train in a specialized environment before being evaluated, such as studying a book before taking an exam. The potential of such specialized training environments is still vastly underexplored, despite their capacity to dramatically speed up training. The framework of synthetic… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 10 pages, 7 figures

  5. arXiv:2012.03913  [pdf, other

    cs.DC

    Centrality of nodes in Federated Byzantine Agreement Systems

    Authors: André Gaul, Jörg Liesen

    Abstract: The federated Byzantine agreement system (FBAS) is a consensus model introduced by Mazières in 2016 where the participating nodes conceptually form a network, with links between them being established by each node individually and thus in a decentralized way. An important question is whether these decentralized decisions lead to an overall decentralized network. The level of (de-)centralization in… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

    Comments: 24 pages, 3 figures

    MSC Class: C.2.4 ACM Class: C.2.4

  6. arXiv:1912.01365  [pdf, ps, other

    cs.DC

    Mathematical Analysis and Algorithms for Federated Byzantine Agreement Systems

    Authors: André Gaul, Ismail Khoffi, Jörg Liesen, Torsten Stüber

    Abstract: We give an introduction to federated Byzantine agreement systems (FBAS) with many examples ranging from small "academic" cases to the current Stellar network. We then analyze the main concepts from a mathematical and an algorithmic point of view. Based on work of Lachowski we derive algorithms for quorum enumeration, checking quorum intersection, and computing the intact nodes with respect to a gi… ▽ More

    Submitted 3 December, 2019; originally announced December 2019.

    MSC Class: C.2.4 ACM Class: C.2.4