Skip to main content

Showing 1–3 of 3 results for author: Toit, S d

.
  1. arXiv:2505.22151  [pdf, ps, other

    cs.LG

    Oryx: a Performant and Scalable Algorithm for Many-Agent Coordination in Offline MARL

    Authors: Claude Formanek, Omayma Mahjoub, Louay Ben Nessir, Sasha Abramowitz, Ruan de Kock, Wiem Khlifi, Simon Du Toit, Felix Chalumeau, Daniel Rajaonarivonivelomanantsoa, Arnol Fokam, Siddarth Singh, Ulrich Mbou Sob, Arnu Pretorius

    Abstract: A key challenge in offline multi-agent reinforcement learning (MARL) is achieving effective many-agent multi-step coordination in complex environments. In this work, we propose Oryx, a novel algorithm for offline cooperative MARL to directly address this challenge. Oryx adapts the recently proposed retention-based architecture Sable and combines it with a sequential form of implicit constraint Q-l… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  2. arXiv:2505.21236  [pdf, ps, other

    cs.LG cs.AI cs.MA

    Breaking the Performance Ceiling in Complex Reinforcement Learning requires Inference Strategies

    Authors: Felix Chalumeau, Daniel Rajaonarivonivelomanantsoa, Ruan de Kock, Claude Formanek, Sasha Abramowitz, Oumayma Mahjoub, Wiem Khlifi, Simon Du Toit, Louay Ben Nessir, Refiloe Shabe, Arnol Fokam, Siddarth Singh, Ulrich Mbou Sob, Arnu Pretorius

    Abstract: Reinforcement learning (RL) systems have countless applications, from energy-grid management to protein design. However, such real-world scenarios are often extremely difficult, combinatorial in nature, and require complex coordination between multiple agents. This level of complexity can cause even state-of-the-art RL systems, trained until convergence, to hit a performance ceiling which they are… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  3. arXiv:2410.01706  [pdf, other

    cs.LG cs.AI cs.MA

    Sable: a Performant, Efficient and Scalable Sequence Model for MARL

    Authors: Omayma Mahjoub, Sasha Abramowitz, Ruan de Kock, Wiem Khlifi, Simon du Toit, Jemma Daniel, Louay Ben Nessir, Louise Beyers, Claude Formanek, Liam Clark, Arnu Pretorius

    Abstract: As multi-agent reinforcement learning (MARL) progresses towards solving larger and more complex problems, it becomes increasingly important that algorithms exhibit the key properties of (1) strong performance, (2) memory efficiency, and (3) scalability. In this work, we introduce Sable, a performant, memory-efficient, and scalable sequence modeling approach to MARL. Sable works by adapting the ret… ▽ More

    Submitted 26 May, 2025; v1 submitted 2 October, 2024; originally announced October 2024.