Skip to main content

Showing 1–4 of 4 results for author: Gallici, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.08222  [pdf, ps, other

    cs.RO cs.AI cs.DC cs.PF

    Scaling Multi Agent Reinforcement Learning for Underwater Acoustic Tracking via Autonomous Vehicles

    Authors: Matteo Gallici, Ivan Masmitja, Mario Martín

    Abstract: Autonomous vehicles (AV) offer a cost-effective solution for scientific missions such as underwater tracking. Recently, reinforcement learning (RL) has emerged as a powerful method for controlling AVs in complex marine environments. However, scaling these techniques to a fleet--essential for multi-target tracking or targets with rapid, unpredictable motion--presents significant computational chall… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

  2. arXiv:2407.04811  [pdf, other

    cs.LG

    Simplifying Deep Temporal Difference Learning

    Authors: Matteo Gallici, Mattie Fellows, Benjamin Ellis, Bartomeu Pou, Ivan Masmitja, Jakob Nicolaus Foerster, Mario Martin

    Abstract: Q-learning played a foundational role in the field reinforcement learning (RL). However, TD algorithms with off-policy data, such as Q-learning, or nonlinear function approximation like deep neural networks require several additional tricks to stabilise training, primarily a large replay buffer and target networks. Unfortunately, the delayed updating of frozen network parameters in the target netw… ▽ More

    Submitted 21 April, 2025; v1 submitted 5 July, 2024; originally announced July 2024.

  3. arXiv:2311.10090  [pdf, other

    cs.LG cs.AI cs.MA

    JaxMARL: Multi-Agent RL Environments and Algorithms in JAX

    Authors: Alexander Rutherford, Benjamin Ellis, Matteo Gallici, Jonathan Cook, Andrei Lupu, Gardar Ingvarsson, Timon Willi, Ravi Hammond, Akbir Khan, Christian Schroeder de Witt, Alexandra Souly, Saptarashmi Bandyopadhyay, Mikayel Samvelyan, Minqi Jiang, Robert Tjarko Lange, Shimon Whiteson, Bruno Lacerda, Nick Hawes, Tim Rocktaschel, Chris Lu, Jakob Nicolaus Foerster

    Abstract: Benchmarks are crucial in the development of machine learning algorithms, with available environments significantly influencing reinforcement learning (RL) research. Traditionally, RL environments run on the CPU, which limits their scalability with typical academic compute. However, recent advancements in JAX have enabled the wider use of hardware acceleration, enabling massively parallel RL train… ▽ More

    Submitted 2 November, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  4. arXiv:2301.05334  [pdf

    cs.LG cs.AI cs.MA

    TransfQMix: Transformers for Leveraging the Graph Structure of Multi-Agent Reinforcement Learning Problems

    Authors: Matteo Gallici, Mario Martin, Ivan Masmitja

    Abstract: Coordination is one of the most difficult aspects of multi-agent reinforcement learning (MARL). One reason is that agents normally choose their actions independently of one another. In order to see coordination strategies emerging from the combination of independent policies, the recent research has focused on the use of a centralized function (CF) that learns each agent's contribution to the team… ▽ More

    Submitted 12 January, 2023; originally announced January 2023.

    Comments: Accepted at AAMAS 2023. Code at https://github.com/mttga/pymarl_transformers