Skip to main content

Showing 1–6 of 6 results for author: Mon-Williams, R

.
  1. arXiv:2505.22597  [pdf, ps, other

    cs.AI cs.LG cs.MA

    HDDLGym: A Tool for Studying Multi-Agent Hierarchical Problems Defined in HDDL with OpenAI Gym

    Authors: Ngoc La, Ruaridh Mon-Williams, Julie A. Shah

    Abstract: In recent years, reinforcement learning (RL) methods have been widely tested using tools like OpenAI Gym, though many tasks in these environments could also benefit from hierarchical planning. However, there is a lack of a tool that enables seamless integration of hierarchical planning with RL. Hierarchical Domain Definition Language (HDDL), used in classical planning, introduces a structured appr… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: Accepted to Proceedings of ICAPS 2025

  2. arXiv:2505.17323  [pdf, ps, other

    cs.AI cs.LG

    Partner Modelling Emerges in Recurrent Agents (But Only When It Matters)

    Authors: Ruaridh Mon-Williams, Max Taylor-Davies, Elizabeth Mieczkowski, Natalia Velez, Neil R. Bramley, Yanwei Wang, Thomas L. Griffiths, Christopher G. Lucas

    Abstract: Humans are remarkably adept at collaboration, able to infer the strengths and weaknesses of new partners in order to work successfully towards shared goals. To build AI systems with this capability, we must first understand its building blocks: does such flexibility require explicit, dedicated mechanisms for modelling others -- or can it emerge spontaneously from the pressures of open-ended cooper… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  3. arXiv:2503.15703  [pdf, other

    cs.MA cs.AI

    Predicting Multi-Agent Specialization via Task Parallelizability

    Authors: Elizabeth Mieczkowski, Ruaridh Mon-Williams, Neil Bramley, Christopher G. Lucas, Natalia Velez, Thomas L. Griffiths

    Abstract: Multi-agent systems often rely on specialized agents with distinct roles rather than general-purpose agents that perform the entire task independently. However, the conditions that govern the optimal degree of specialization remain poorly understood. In this work, we propose that specialist teams outperform generalist ones when environmental constraints limit task parallelizability -- the potentia… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

  4. arXiv:2408.10123  [pdf, other

    cs.RO cs.CV

    Learning Precise Affordances from Egocentric Videos for Robotic Manipulation

    Authors: Gen Li, Nikolaos Tsagkas, Jifei Song, Ruaridh Mon-Williams, Sethu Vijayakumar, Kun Shao, Laura Sevilla-Lara

    Abstract: Affordance, defined as the potential actions that an object offers, is crucial for robotic manipulation tasks. A deep understanding of affordance can lead to more intelligent AI systems. For example, such knowledge directs an agent to grasp a knife by the handle for cutting and by the blade when passing it to someone. In this paper, we present a streamlined affordance learning system that encompas… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: Project page: https://reagan1311.github.io/affgrasp

  5. arXiv:2406.11231  [pdf, other

    cs.RO cs.AI cs.CL cs.LG

    Enabling robots to follow abstract instructions and complete complex dynamic tasks

    Authors: Ruaridh Mon-Williams, Gen Li, Ran Long, Wenqian Du, Chris Lucas

    Abstract: Completing complex tasks in unpredictable settings like home kitchens challenges robotic systems. These challenges include interpreting high-level human commands, such as "make me a hot beverage" and performing actions like pouring a precise amount of water into a moving mug. To address these challenges, we present a novel framework that combines Large Language Models (LLMs), a curated Knowledge B… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  6. arXiv:2307.13447  [pdf, other

    cs.RO cs.AI cs.LG

    A behavioural transformer for effective collaboration between a robot and a non-stationary human

    Authors: Ruaridh Mon-Williams, Theodoros Stouraitis, Sethu Vijayakumar

    Abstract: A key challenge in human-robot collaboration is the non-stationarity created by humans due to changes in their behaviour. This alters environmental transitions and hinders human-robot collaboration. We propose a principled meta-learning framework to explore how robots could better predict human behaviour, and thereby deal with issues of non-stationarity. On the basis of this framework, we develope… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    Comments: 8 pages, 6 figures