Skip to main content

Showing 1–19 of 19 results for author: Beukman, M

.
  1. arXiv:2505.20659  [pdf, ps, other

    cs.LG

    An Optimisation Framework for Unsupervised Environment Design

    Authors: Nathan Monette, Alistair Letcher, Michael Beukman, Matthew T. Jackson, Alexander Rutherford, Alexander D. Goldie, Jakob N. Foerster

    Abstract: For reinforcement learning agents to be deployed in high-risk settings, they must achieve a high level of robustness to unfamiliar scenarios. One method for improving robustness is unsupervised environment design (UED), a suite of methods aiming to maximise an agent's generalisability across configurations of an environment. In this work, we study UED from an optimisation perspective, providing st… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: Reinforcement Learning Conference 2025

  2. arXiv:2410.23208  [pdf, other

    cs.LG cs.AI

    Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks

    Authors: Michael Matthews, Michael Beukman, Chris Lu, Jakob Foerster

    Abstract: While large models trained with self-supervised learning on offline datasets have shown remarkable capabilities in text and image domains, achieving the same generalisation for agents that act in sequential decision problems remains an open challenge. In this work, we take a step towards this goal by procedurally generating tens of millions of 2D physics-based tasks and using these to train a gene… ▽ More

    Submitted 3 March, 2025; v1 submitted 30 October, 2024; originally announced October 2024.

    Comments: ICLR 2025 Oral. The first two authors contributed equally. Project page located at: https://kinetix-env.github.io/

  3. arXiv:2409.00853  [pdf, other

    cs.AI cs.NE

    JaxLife: An Open-Ended Agentic Simulator

    Authors: Chris Lu, Michael Beukman, Michael Matthews, Jakob Foerster

    Abstract: Human intelligence emerged through the process of natural selection and evolution on Earth. We investigate what it would take to re-create this process in silico. While past work has often focused on low-level processes (such as simulating physics or chemistry), we instead take a more targeted approach, aiming to evolve agents that can accumulate open-ended culture and technologies across generati… ▽ More

    Submitted 1 September, 2024; originally announced September 2024.

  4. arXiv:2408.15099  [pdf, other

    cs.LG cs.AI cs.RO

    No Regrets: Investigating and Improving Regret Approximations for Curriculum Discovery

    Authors: Alexander Rutherford, Michael Beukman, Timon Willi, Bruno Lacerda, Nick Hawes, Jakob Foerster

    Abstract: What data or environments to use for training to improve downstream performance is a longstanding and very topical question in reinforcement learning. In particular, Unsupervised Environment Design (UED) methods have gained recent attention as their adaptive curricula promise to enable agents to be robust to in- and out-of-distribution tasks. This work investigates how existing UED methods select… ▽ More

    Submitted 29 October, 2024; v1 submitted 27 August, 2024; originally announced August 2024.

  5. arXiv:2407.14516  [pdf, other

    cs.RO cs.LG

    RobocupGym: A challenging continuous control benchmark in Robocup

    Authors: Michael Beukman, Branden Ingram, Geraud Nangue Tasse, Benjamin Rosman, Pravesh Ranchod

    Abstract: Reinforcement learning (RL) has progressed substantially over the past decade, with much of this progress being driven by benchmarks. Many benchmarks are focused on video or board games, and a large number of robotics benchmarks lack diversity and real-world applicability. In this paper, we aim to simplify the process of applying reinforcement learning in the 3D simulation league of Robocup, a rob… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  6. arXiv:2403.13091  [pdf, other

    cs.LG cs.AI

    JaxUED: A simple and useable UED library in Jax

    Authors: Samuel Coward, Michael Beukman, Jakob Foerster

    Abstract: We present JaxUED, an open-source library providing minimal dependency implementations of modern Unsupervised Environment Design (UED) algorithms in Jax. JaxUED leverages hardware acceleration to obtain on the order of 100x speedups compared to prior, CPU-based implementations. Inspired by CleanRL, we provide fast, clear, understandable, and easily modifiable implementations, with the aim of accel… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: 11 pages, 5 figures

  7. arXiv:2402.16801  [pdf, other

    cs.LG

    Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning

    Authors: Michael Matthews, Michael Beukman, Benjamin Ellis, Mikayel Samvelyan, Matthew Jackson, Samuel Coward, Jakob Foerster

    Abstract: Benchmarks play a crucial role in the development and analysis of reinforcement learning (RL) algorithms. We identify that existing benchmarks used for research into open-ended learning fall into one of two categories. Either they are too slow for meaningful research to be performed without enormous computational resources, like Crafter, NetHack and Minecraft, or they are not complex enough to pos… ▽ More

    Submitted 3 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  8. arXiv:2402.12284  [pdf, other

    cs.LG cs.AI

    Refining Minimax Regret for Unsupervised Environment Design

    Authors: Michael Beukman, Samuel Coward, Michael Matthews, Mattie Fellows, Minqi Jiang, Michael Dennis, Jakob Foerster

    Abstract: In unsupervised environment design, reinforcement learning agents are trained on environment configurations (levels) generated by an adversary that maximises some objective. Regret is a commonly used objective that theoretically results in a minimax regret (MMR) policy with desirable robustness guarantees; in particular, the agent's maximum regret is bounded. However, once the agent reaches this r… ▽ More

    Submitted 8 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: ICML 2024. The first two authors contributed equally

  9. arXiv:2310.16686  [pdf, other

    cs.AI cs.LG

    Dynamics Generalisation in Reinforcement Learning via Adaptive Context-Aware Policies

    Authors: Michael Beukman, Devon Jarvis, Richard Klein, Steven James, Benjamin Rosman

    Abstract: While reinforcement learning has achieved remarkable successes in several domains, its real-world application is limited due to many methods failing to generalise to unfamiliar conditions. In this work, we consider the problem of generalising to new transition dynamics, corresponding to cases in which the environment's response to the agent's actions differs. For example, the gravitational force e… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023

  10. arXiv:2309.05311  [pdf, other

    cs.CL

    Analysing Cross-Lingual Transfer in Low-Resourced African Named Entity Recognition

    Authors: Michael Beukman, Manuel Fokam

    Abstract: Transfer learning has led to large gains in performance for nearly all NLP tasks while making downstream models easier and faster to train. This has also been extended to low-resourced languages, with some success. We investigate the properties of cross-lingual transfer learning between ten low-resourced languages, from the perspective of a named entity recognition task. We specifically investigat… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

    Comments: Accepted to IJCNLP-AACL 2023

  11. arXiv:2302.01561  [pdf, other

    cs.AI

    Hierarchically Composing Level Generators for the Creation of Complex Structures

    Authors: Michael Beukman, Manuel Fokam, Marcel Kruger, Guy Axelrod, Muhammad Nasir, Branden Ingram, Benjamin Rosman, Steven James

    Abstract: Procedural content generation (PCG) is a growing field, with numerous applications in the video game industry and great potential to help create better games at a fraction of the cost of manual creation. However, much of the work in PCG is focused on generating relatively straightforward levels in simple games, as it is challenging to design an optimisable objective function for complex settings.… ▽ More

    Submitted 19 July, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: Code is available at https://github.com/Michael-Beukman/MCHAMR. This work has been accepted to IEEE Transactions on Games, with copyright transferred to the IEEE

  12. arXiv:2210.12391  [pdf, other

    cs.CL

    MasakhaNER 2.0: Africa-centric Transfer Learning for Named Entity Recognition

    Authors: David Ifeoluwa Adelani, Graham Neubig, Sebastian Ruder, Shruti Rijhwani, Michael Beukman, Chester Palen-Michel, Constantine Lignos, Jesujoba O. Alabi, Shamsuddeen H. Muhammad, Peter Nabende, Cheikh M. Bamba Dione, Andiswa Bukula, Rooweither Mabuya, Bonaventure F. P. Dossou, Blessing Sibanda, Happy Buzaaba, Jonathan Mukiibi, Godson Kalipe, Derguene Mbaye, Amelia Taylor, Fatoumata Kabore, Chris Chinenye Emezue, Anuoluwapo Aremu, Perez Ogayo, Catherine Gitau , et al. (20 additional authors not shown)

    Abstract: African languages are spoken by over a billion people, but are underrepresented in NLP research and development. The challenges impeding progress include the limited availability of annotated datasets, as well as a lack of understanding of the settings where current methods are effective. In this paper, we make progress towards solutions for these challenges, focusing on the task of named entity r… ▽ More

    Submitted 15 November, 2022; v1 submitted 22 October, 2022; originally announced October 2022.

    Comments: Accepted to EMNLP 2022 (updated Github link)

  13. arXiv:2210.11442  [pdf, other

    cs.AI cs.NE

    Augmentative Topology Agents For Open-Ended Learning

    Authors: Muhammad Umair Nasir, Michael Beukman, Steven James, Christopher Wesley Cleghorn

    Abstract: In this work, we tackle the problem of open-ended learning by introducing a method that simultaneously evolves agents and increasingly challenging environments. Unlike previous open-ended approaches that optimize agents using a fixed neural network topology, we hypothesize that generalization can be improved by allowing agents' controllers to become more complex as they encounter more difficult en… ▽ More

    Submitted 11 October, 2023; v1 submitted 20 October, 2022; originally announced October 2022.

    Comments: Accepted to The Proceedings of Genetic and Evolutionary Computation Conference (GECCO) 2023

  14. arXiv:2210.10692  [pdf, ps, other

    cs.CL

    Separating Grains from the Chaff: Using Data Filtering to Improve Multilingual Translation for Low-Resourced African Languages

    Authors: Idris Abdulmumin, Michael Beukman, Jesujoba O. Alabi, Chris Emezue, Everlyn Asiko, Tosin Adewumi, Shamsuddeen Hassan Muhammad, Mofetoluwa Adeyemi, Oreen Yousuf, Sahib Singh, Tajuddeen Rabiu Gwadabe

    Abstract: We participated in the WMT 2022 Large-Scale Machine Translation Evaluation for the African Languages Shared Task. This work describes our approach, which is based on filtering the given noisy data using a sentence-pair classifier that was built by fine-tuning a pre-trained language model. To train the classifier, we obtain positive samples (i.e. high-quality parallel sentences) from a gold-standar… ▽ More

    Submitted 20 October, 2022; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: Accepted at the Seventh Conference on Machine Translation (WMT22)

  15. arXiv:2208.04568  [pdf, other

    cs.CL cs.AI

    The Impact of Data Corruption on Named Entity Recognition for Low-resourced Languages

    Authors: Manuel Fokam, Michael Beukman

    Abstract: Data availability and quality are major challenges in natural language processing for low-resourced languages. In particular, there is significantly less data available than for higher-resourced languages. This data is also often of low quality, rife with errors, invalid text or incorrect annotations. Many prior works focus on dealing with these problems, either by generating synthetic data, or fi… ▽ More

    Submitted 25 November, 2023; v1 submitted 9 August, 2022; originally announced August 2022.

  16. arXiv:2205.02022  [pdf, other

    cs.CL

    A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News Translation

    Authors: David Ifeoluwa Adelani, Jesujoba Oluwadara Alabi, Angela Fan, Julia Kreutzer, Xiaoyu Shen, Machel Reid, Dana Ruiter, Dietrich Klakow, Peter Nabende, Ernie Chang, Tajuddeen Gwadabe, Freshia Sackey, Bonaventure F. P. Dossou, Chris Chinenye Emezue, Colin Leong, Michael Beukman, Shamsuddeen Hassan Muhammad, Guyo Dub Jarso, Oreen Yousuf, Andre Niyongabo Rubungo, Gilles Hacheme, Eric Peter Wairagala, Muhammad Umair Nasir, Benjamin Ayoade Ajibade, Tunde Oluwaseyi Ajayi , et al. (20 additional authors not shown)

    Abstract: Recent advances in the pre-training of language models leverage large-scale datasets to create multilingual models. However, low-resource languages are mostly left out in these datasets. This is primarily because many widely spoken languages are not well represented on the web and therefore excluded from the large-scale crawls used to create datasets. Furthermore, downstream users of these models… ▽ More

    Submitted 22 August, 2022; v1 submitted 4 May, 2022; originally announced May 2022.

    Comments: Accepted to NAACL 2022 (added evaluation data for amh, kin, nya, sna, xho)

  17. arXiv:2204.11842  [pdf, other

    cs.LG cs.AI

    Adaptive Online Value Function Approximation with Wavelets

    Authors: Michael Beukman, Michael Mitchley, Dean Wookey, Steven James, George Konidaris

    Abstract: Using function approximation to represent a value function is necessary for continuous and high-dimensional state spaces. Linear function approximation has desirable theoretical guarantees and often requires less compute and samples than neural networks, but most approaches suffer from an exponential growth in the number of functions as the dimensionality of the state space increases. In this work… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

    Comments: Accepted to RLDM 2022. Code is located at https://github.com/Michael-Beukman/WaveletRL

  18. Procedural Content Generation using Neuroevolution and Novelty Search for Diverse Video Game Levels

    Authors: Michael Beukman, Christopher W Cleghorn, Steven James

    Abstract: Procedurally generated video game content has the potential to drastically reduce the content creation budget of game developers and large studios. However, adoption is hindered by limitations such as slow generation, as well as low quality and diversity of content. We introduce an evolutionary search-based approach for evolving level generators using novelty search to procedurally generate divers… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    Comments: Accepted to the Genetic and Evolutionary Computation Conference (GECCO '22), July 9--13, 2022, Boston, MA, USA. Code is located at https://github.com/Michael-Beukman/PCGNN

  19. arXiv:2201.10334  [pdf, ps, other

    cs.AI

    Towards Objective Metrics for Procedurally Generated Video Game Levels

    Authors: Michael Beukman, Steven James, Christopher Cleghorn

    Abstract: With increasing interest in procedural content generation by academia and game developers alike, it is vital that different approaches can be compared fairly. However, evaluating procedurally generated video game levels is often difficult, due to the lack of standardised, game-independent metrics. In this paper, we introduce two simulation-based evaluation metrics that involve analysing the behavi… ▽ More

    Submitted 9 March, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

    Comments: 7 pages, 10 figures. V3: This work has been submitted to the IEEE for possible publication. Code is located at https://github.com/Michael-Beukman/PCGNN