Skip to main content

Showing 1–4 of 4 results for author: Beyers, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.01706  [pdf, other

    cs.LG cs.AI cs.MA

    Sable: a Performant, Efficient and Scalable Sequence Model for MARL

    Authors: Omayma Mahjoub, Sasha Abramowitz, Ruan de Kock, Wiem Khlifi, Simon du Toit, Jemma Daniel, Louay Ben Nessir, Louise Beyers, Claude Formanek, Liam Clark, Arnu Pretorius

    Abstract: As multi-agent reinforcement learning (MARL) progresses towards solving larger and more complex problems, it becomes increasingly important that algorithms exhibit the key properties of (1) strong performance, (2) memory efficiency, and (3) scalability. In this work, we introduce Sable, a performant, memory-efficient, and scalable sequence modeling approach to MARL. Sable works by adapting the ret… ▽ More

    Submitted 26 May, 2025; v1 submitted 2 October, 2024; originally announced October 2024.

  2. arXiv:2409.12001  [pdf, other

    cs.LG cs.AI cs.MA

    Putting Data at the Centre of Offline Multi-Agent Reinforcement Learning

    Authors: Claude Formanek, Louise Beyers, Callum Rhys Tilbury, Jonathan P. Shock, Arnu Pretorius

    Abstract: Offline multi-agent reinforcement learning (MARL) is an exciting direction of research that uses static datasets to find optimal control policies for multi-agent systems. Though the field is by definition data-driven, efforts have thus far neglected data in their drive to achieve state-of-the-art results. We first substantiate this claim by surveying the literature, showing how the majority of wor… ▽ More

    Submitted 18 September, 2024; originally announced September 2024.

  3. arXiv:2407.01343  [pdf, other

    cs.LG cs.AI cs.MA

    Coordination Failure in Cooperative Offline MARL

    Authors: Callum Rhys Tilbury, Claude Formanek, Louise Beyers, Jonathan P. Shock, Arnu Pretorius

    Abstract: Offline multi-agent reinforcement learning (MARL) leverages static datasets of experience to learn optimal multi-agent control. However, learning from static data presents several unique challenges to overcome. In this paper, we focus on coordination failure and investigate the role of joint actions in multi-agent policy gradients with offline data, focusing on a common setting we refer to as the… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Accepted at the Workshop on Aligning Reinforcement Learning Experimentalists and Theorists (ARLET) at the International Conference on Machine Learning, 2024

  4. arXiv:2406.09068  [pdf, other

    cs.LG cs.AI

    Dispelling the Mirage of Progress in Offline MARL through Standardised Baselines and Evaluation

    Authors: Claude Formanek, Callum Rhys Tilbury, Louise Beyers, Jonathan Shock, Arnu Pretorius

    Abstract: Offline multi-agent reinforcement learning (MARL) is an emerging field with great promise for real-world applications. Unfortunately, the current state of research in offline MARL is plagued by inconsistencies in baselines and evaluation protocols, which ultimately makes it difficult to accurately assess progress, trust newly proposed innovations, and allow researchers to easily build upon prior w… ▽ More

    Submitted 30 October, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: Accepted at 38th Conference on Neural Information Processing Systems (NeurIPS 2024) Track on Datasets and Benchmarks