Skip to main content

Showing 1–5 of 5 results for author: Shock, J P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2411.15145  [pdf, other

    cs.CY cs.LG

    Opportunities of Reinforcement Learning in South Africa's Just Transition

    Authors: Claude Formanek, Callum Rhys Tilbury, Jonathan P. Shock

    Abstract: South Africa stands at a crucial juncture, grappling with interwoven socio-economic challenges such as poverty, inequality, unemployment, and the looming climate crisis. The government's Just Transition framework aims to enhance climate resilience, achieve net-zero greenhouse gas emissions by 2050, and promote social inclusion and poverty eradication. According to the Presidential Commission on th… ▽ More

    Submitted 6 November, 2024; originally announced November 2024.

    Comments: Accepted at the Southern African Conference for Artificial Intelligence Research 2024

  2. arXiv:2409.12001  [pdf, other

    cs.LG cs.AI cs.MA

    Putting Data at the Centre of Offline Multi-Agent Reinforcement Learning

    Authors: Claude Formanek, Louise Beyers, Callum Rhys Tilbury, Jonathan P. Shock, Arnu Pretorius

    Abstract: Offline multi-agent reinforcement learning (MARL) is an exciting direction of research that uses static datasets to find optimal control policies for multi-agent systems. Though the field is by definition data-driven, efforts have thus far neglected data in their drive to achieve state-of-the-art results. We first substantiate this claim by surveying the literature, showing how the majority of wor… ▽ More

    Submitted 18 September, 2024; originally announced September 2024.

  3. arXiv:2407.01343  [pdf, other

    cs.LG cs.AI cs.MA

    Coordination Failure in Cooperative Offline MARL

    Authors: Callum Rhys Tilbury, Claude Formanek, Louise Beyers, Jonathan P. Shock, Arnu Pretorius

    Abstract: Offline multi-agent reinforcement learning (MARL) leverages static datasets of experience to learn optimal multi-agent control. However, learning from static data presents several unique challenges to overcome. In this paper, we focus on coordination failure and investigate the role of joint actions in multi-agent policy gradients with offline data, focusing on a common setting we refer to as the… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Accepted at the Workshop on Aligning Reinforcement Learning Experimentalists and Theorists (ARLET) at the International Conference on Machine Learning, 2024

  4. arXiv:2308.08029  [pdf, ps, other

    cs.AI cs.LG q-bio.NC

    Sophisticated Learning: A novel algorithm for active learning during model-based planning

    Authors: Rowan Hodson, Bruce Bassett, Charel van Hoof, Benjamin Rosman, Mark Solms, Jonathan P. Shock, Ryan Smith

    Abstract: We introduce Sophisticated Learning (SL), a planning-to-learn algorithm that embeds active parameter learning inside the Sophisticated Inference (SI) tree-search framework of Active Inference. Unlike SI -- which optimizes beliefs about hidden states -- SL also updates beliefs about model parameters within each simulated branch, enabling counterfactual reasoning about how future observations would… ▽ More

    Submitted 14 August, 2025; v1 submitted 15 August, 2023; originally announced August 2023.

  5. arXiv:1504.03184  [pdf, ps, other

    cs.IT math.DG math.ST physics.data-an

    Probability Density Functions from the Fisher Information Metric

    Authors: T. Clingman, Jeff Murugan, Jonathan P. Shock

    Abstract: We show a general relation between the spatially disjoint product of probability density functions and the sum of their Fisher information metric tensors. We then utilise this result to give a method for constructing the probability density functions for an arbitrary Riemannian Fisher information metric tensor. We note further that this construction is extremely unconstrained, depending only on ce… ▽ More

    Submitted 13 April, 2015; originally announced April 2015.

    Comments: 16 pages, no figures

    Report number: QGaSLAB-15-02