Skip to main content

Showing 1–7 of 7 results for author: Abramowitz, S

.
  1. arXiv:2505.22151  [pdf, ps, other

    cs.LG

    Oryx: a Performant and Scalable Algorithm for Many-Agent Coordination in Offline MARL

    Authors: Claude Formanek, Omayma Mahjoub, Louay Ben Nessir, Sasha Abramowitz, Ruan de Kock, Wiem Khlifi, Simon Du Toit, Felix Chalumeau, Daniel Rajaonarivonivelomanantsoa, Arnol Fokam, Siddarth Singh, Ulrich Mbou Sob, Arnu Pretorius

    Abstract: A key challenge in offline multi-agent reinforcement learning (MARL) is achieving effective many-agent multi-step coordination in complex environments. In this work, we propose Oryx, a novel algorithm for offline cooperative MARL to directly address this challenge. Oryx adapts the recently proposed retention-based architecture Sable and combines it with a sequential form of implicit constraint Q-l… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  2. arXiv:2505.21236  [pdf, ps, other

    cs.LG cs.AI cs.MA

    Breaking the Performance Ceiling in Complex Reinforcement Learning requires Inference Strategies

    Authors: Felix Chalumeau, Daniel Rajaonarivonivelomanantsoa, Ruan de Kock, Claude Formanek, Sasha Abramowitz, Oumayma Mahjoub, Wiem Khlifi, Simon Du Toit, Louay Ben Nessir, Refiloe Shabe, Arnol Fokam, Siddarth Singh, Ulrich Mbou Sob, Arnu Pretorius

    Abstract: Reinforcement learning (RL) systems have countless applications, from energy-grid management to protein design. However, such real-world scenarios are often extremely difficult, combinatorial in nature, and require complex coordination between multiple agents. This level of complexity can cause even state-of-the-art RL systems, trained until convergence, to hit a performance ceiling which they are… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  3. arXiv:2410.19382  [pdf, other

    cs.LG cs.AI cs.MA

    Multi-Agent Reinforcement Learning with Selective State-Space Models

    Authors: Jemma Daniel, Ruan de Kock, Louay Ben Nessir, Sasha Abramowitz, Omayma Mahjoub, Wiem Khlifi, Claude Formanek, Arnu Pretorius

    Abstract: The Transformer model has demonstrated success across a wide range of domains, including in Multi-Agent Reinforcement Learning (MARL) where the Multi-Agent Transformer (MAT) has emerged as a leading algorithm in the field. However, a significant drawback of Transformer models is their quadratic computational complexity relative to input size, making them computationally expensive when scaling to l… ▽ More

    Submitted 28 October, 2024; v1 submitted 25 October, 2024; originally announced October 2024.

    Comments: 17 pages, 7 figures

    ACM Class: I.2.11

  4. arXiv:2410.01706  [pdf, other

    cs.LG cs.AI cs.MA

    Sable: a Performant, Efficient and Scalable Sequence Model for MARL

    Authors: Omayma Mahjoub, Sasha Abramowitz, Ruan de Kock, Wiem Khlifi, Simon du Toit, Jemma Daniel, Louay Ben Nessir, Louise Beyers, Claude Formanek, Liam Clark, Arnu Pretorius

    Abstract: As multi-agent reinforcement learning (MARL) progresses towards solving larger and more complex problems, it becomes increasingly important that algorithms exhibit the key properties of (1) strong performance, (2) memory efficiency, and (3) scalability. In this work, we introduce Sable, a performant, memory-efficient, and scalable sequence modeling approach to MARL. Sable works by adapting the ret… ▽ More

    Submitted 26 May, 2025; v1 submitted 2 October, 2024; originally announced October 2024.

  5. arXiv:2311.18598  [pdf, other

    cs.LG cs.AI cs.MA

    Generalisable Agents for Neural Network Optimisation

    Authors: Kale-ab Tessera, Callum Rhys Tilbury, Sasha Abramowitz, Ruan de Kock, Omayma Mahjoub, Benjamin Rosman, Sara Hooker, Arnu Pretorius

    Abstract: Optimising deep neural networks is a challenging task due to complex training dynamics, high computational requirements, and long training times. To address this difficulty, we propose the framework of Generalisable Agents for Neural Network Optimisation (GANNO) -- a multi-agent reinforcement learning (MARL) approach that learns to improve neural network optimisation by dynamically and responsivel… ▽ More

    Submitted 22 March, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: Accepted at the Workshop on Advanced Neural Network Training (WANT) and Optimization for Machine Learning (OPT) at NeurIPS 2023

  6. arXiv:2306.09884  [pdf, other

    cs.LG cs.AI

    Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX

    Authors: Clément Bonnet, Daniel Luo, Donal Byrne, Shikha Surana, Sasha Abramowitz, Paul Duckworth, Vincent Coyette, Laurence I. Midgley, Elshadai Tegegn, Tristan Kalloniatis, Omayma Mahjoub, Matthew Macfarlane, Andries P. Smit, Nathan Grinsztajn, Raphael Boige, Cemlyn N. Waters, Mohamed A. Mimouni, Ulrich A. Mbou Sob, Ruan de Kock, Siddarth Singh, Daniel Furelos-Blanco, Victor Le, Arnu Pretorius, Alexandre Laterre

    Abstract: Open-source reinforcement learning (RL) environments have played a crucial role in driving progress in the development of AI algorithms. In modern RL research, there is a need for simulated environments that are performant, scalable, and modular to enable their utilization in a wider range of potential real-world applications. Therefore, we present Jumanji, a suite of diverse RL environments speci… ▽ More

    Submitted 15 March, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: 9 pages + 21 pages of appendices and references. Published at ICLR 2024

  7. arXiv:2107.01460  [pdf, other

    cs.LG cs.MA

    Mava: a research library for distributed multi-agent reinforcement learning in JAX

    Authors: Ruan de Kock, Omayma Mahjoub, Sasha Abramowitz, Wiem Khlifi, Callum Rhys Tilbury, Claude Formanek, Andries Smit, Arnu Pretorius

    Abstract: Multi-agent reinforcement learning (MARL) research is inherently computationally expensive and it is often difficult to obtain a sufficient number of experiment samples to test hypotheses and make robust statistical claims. Furthermore, MARL algorithms are typically complex in their design and can be tricky to implement correctly. These aspects of MARL present a difficult challenge when it comes t… ▽ More

    Submitted 15 December, 2023; v1 submitted 3 July, 2021; originally announced July 2021.