Skip to main content

Showing 1–26 of 26 results for author: Mannion, P

Searching in archive cs. Search in all archives.
.
  1. Inferring Preferences from Demonstrations in Multi-objective Reinforcement Learning

    Authors: Junlin Lu, Patrick Mannion, Karl Mason

    Abstract: Many decision-making problems feature multiple objectives where it is not always possible to know the preferences of a human or agent decision-maker for different objectives. However, demonstrated behaviors from the decision-maker are often available. This research proposes a dynamic weight-based preference inference (DWPI) algorithm that can infer the preferences of agents acting in multi-objecti… ▽ More

    Submitted 30 September, 2024; originally announced September 2024.

    Comments: Neural Comput & Applic (2024)

  2. arXiv:2408.00682  [pdf, other

    cs.MA cs.AI cs.GT

    Learning in Multi-Objective Public Goods Games with Non-Linear Utilities

    Authors: Nicole Orzan, Erman Acar, Davide Grossi, Patrick Mannion, Roxana Rădulescu

    Abstract: Addressing the question of how to achieve optimal decision-making under risk and uncertainty is crucial for enhancing the capabilities of artificial agents that collaborate with or support humans. In this work, we address this question in the context of Public Goods Games. We study learning in a novel multi-objective version of the Public Goods Game where agents have different risk preferences, by… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: In press at ECAI 2024

  3. arXiv:2407.16312  [pdf, other

    cs.MA cs.AI cs.GT

    MOMAland: A Set of Benchmarks for Multi-Objective Multi-Agent Reinforcement Learning

    Authors: Florian Felten, Umut Ucak, Hicham Azmani, Gao Peng, Willem Röpke, Hendrik Baier, Patrick Mannion, Diederik M. Roijers, Jordan K. Terry, El-Ghazali Talbi, Grégoire Danoy, Ann Nowé, Roxana Rădulescu

    Abstract: Many challenging tasks such as managing traffic systems, electricity grids, or supply chains involve complex decision-making processes that must balance multiple conflicting objectives and coordinate the actions of various independent decision-makers (DMs). One perspective for formalising and addressing such tasks is multi-objective multi-agent reinforcement learning (MOMARL). MOMARL broadens rein… ▽ More

    Submitted 27 October, 2024; v1 submitted 23 July, 2024; originally announced July 2024.

  4. arXiv:2407.11489  [pdf, other

    cs.LG cs.AI

    A Meta-Learning Approach for Multi-Objective Reinforcement Learning in Sustainable Home Environments

    Authors: Junlin Lu, Patrick Mannion, Karl Mason

    Abstract: Effective residential appliance scheduling is crucial for sustainable living. While multi-objective reinforcement learning (MORL) has proven effective in balancing user preferences in appliance scheduling, traditional MORL struggles with limited data in non-stationary residential settings characterized by renewable generation variations. Significant context shifts that can invalidate previously le… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  5. arXiv:2404.03997  [pdf, other

    cs.LG cs.AI

    Demonstration Guided Multi-Objective Reinforcement Learning

    Authors: Junlin Lu, Patrick Mannion, Karl Mason

    Abstract: Multi-objective reinforcement learning (MORL) is increasingly relevant due to its resemblance to real-world scenarios requiring trade-offs between multiple objectives. Catering to diverse user preferences, traditional reinforcement learning faces amplified challenges in MORL. To address the difficulty of training policies from scratch in MORL, we introduce demonstration-guided multi-objective rein… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  6. arXiv:2402.07182  [pdf, other

    cs.LG

    Divide and Conquer: Provably Unveiling the Pareto Front with Multi-Objective Reinforcement Learning

    Authors: Willem Röpke, Mathieu Reymond, Patrick Mannion, Diederik M. Roijers, Ann Nowé, Roxana Rădulescu

    Abstract: An important challenge in multi-objective reinforcement learning is obtaining a Pareto front of policies to attain optimal performance under different preferences. We introduce Iterated Pareto Referent Optimisation (IPRO), which decomposes finding the Pareto front into a sequence of constrained single-objective problems. This enables us to guarantee convergence while providing an upper bound on th… ▽ More

    Submitted 6 February, 2025; v1 submitted 11 February, 2024; originally announced February 2024.

    Comments: Accepted at AAMAS 2025

  7. arXiv:2402.02665  [pdf, ps, other

    cs.LG

    Utility-Based Reinforcement Learning: Unifying Single-objective and Multi-objective Reinforcement Learning

    Authors: Peter Vamplew, Cameron Foale, Conor F. Hayes, Patrick Mannion, Enda Howley, Richard Dazeley, Scott Johnson, Johan Källström, Gabriel Ramos, Roxana Rădulescu, Willem Röpke, Diederik M. Roijers

    Abstract: Research in multi-objective reinforcement learning (MORL) has introduced the utility-based paradigm, which makes use of both environmental rewards and a function that defines the utility derived by the user from those rewards. In this paper we extend this paradigm to the context of single-objective reinforcement learning (RL), and outline multiple potential benefits including the ability to perfor… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: Accepted for the Blue Sky Track at AAMAS'24

  8. arXiv:2401.07722  [pdf, other

    cs.AI

    Inferring Preferences from Demonstrations in Multi-Objective Residential Energy Management

    Authors: Junlin Lu, Patrick Mannion, Karl Mason

    Abstract: It is often challenging for a user to articulate their preferences accurately in multi-objective decision-making problems. Demonstration-based preference inference (DemoPI) is a promising approach to mitigate this problem. Understanding the behaviours and values of energy customers is an example of a scenario where preference inference can be used to gain insights into the values of energy custome… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  9. arXiv:2401.07710  [pdf, ps, other

    cs.AI cs.LG eess.SY

    Go-Explore for Residential Energy Management

    Authors: Junlin Lu, Patrick Mannion, Karl Mason

    Abstract: Reinforcement learning is commonly applied in residential energy management, particularly for optimizing energy costs. However, RL agents often face challenges when dealing with deceptive and sparse rewards in the energy control domain, especially with stochastic rewards. In such situations, thorough exploration becomes crucial for learning an optimal policy. Unfortunately, the exploration mechani… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  10. arXiv:2306.11535  [pdf, other

    cs.NE cs.AI

    Evolutionary Strategy Guided Reinforcement Learning via MultiBuffer Communication

    Authors: Adam Callaghan, Karl Mason, Patrick Mannion

    Abstract: Evolutionary Algorithms and Deep Reinforcement Learning have both successfully solved control problems across a variety of domains. Recently, algorithms have been proposed which combine these two methods, aiming to leverage the strengths and mitigate the weaknesses of both approaches. In this paper we introduce a new Evolutionary Reinforcement Learning model which combines a particular family of E… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: 9 pages, 4 figures, ALA 2023 workshop

  11. arXiv:2305.05560  [pdf, other

    cs.AI

    Distributional Multi-Objective Decision Making

    Authors: Willem Röpke, Conor F. Hayes, Patrick Mannion, Enda Howley, Ann Nowé, Diederik M. Roijers

    Abstract: For effective decision support in scenarios with conflicting objectives, sets of potentially optimal solutions can be presented to the decision maker. We explore both what policies these sets should contain and how such sets can be computed efficiently. With this in mind, we take a distributional approach and introduce a novel dominance criterion relating return distributions of policies directly.… ▽ More

    Submitted 18 July, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: Accepted at IJCAI 2023

  12. arXiv:2304.14115  [pdf, other

    cs.AI

    Inferring Preferences from Demonstrations in Multi-objective Reinforcement Learning: A Dynamic Weight-based Approach

    Authors: Junlin Lu, Patrick Mannion, Karl Mason

    Abstract: Many decision-making problems feature multiple objectives. In such problems, it is not always possible to know the preferences of a decision-maker for different objectives. However, it is often possible to observe the behavior of decision-makers. In multi-objective decision-making, preference inference is the process of inferring the preferences of a decision-maker for different objectives. This r… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

    Comments: This work is accepted by ALA 2023 Adaptive and Learning Agents Workshop at AAMAS, London, UK

  13. arXiv:2211.13032  [pdf, other

    cs.AI cs.LG

    Monte Carlo Tree Search Algorithms for Risk-Aware and Multi-Objective Reinforcement Learning

    Authors: Conor F. Hayes, Mathieu Reymond, Diederik M. Roijers, Enda Howley, Patrick Mannion

    Abstract: In many risk-aware and multi-objective reinforcement learning settings, the utility of the user is derived from a single execution of a policy. In these settings, making decisions based on the average future returns is not suitable. For example, in a medical setting a patient may only have one opportunity to treat their illness. Making decisions using just the expected future returns -- known in r… ▽ More

    Submitted 6 December, 2022; v1 submitted 23 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2102.00966

  14. arXiv:2207.00368  [pdf, other

    cs.AI cs.LG

    Multi-Objective Coordination Graphs for the Expected Scalarised Returns with Generative Flow Models

    Authors: Conor F. Hayes, Timothy Verstraeten, Diederik M. Roijers, Enda Howley, Patrick Mannion

    Abstract: Many real-world problems contain multiple objectives and agents, where a trade-off exists between objectives. Key to solving such problems is to exploit sparse dependency structures that exist between agents. For example, in wind farm control a trade-off exists between maximising power and minimising stress on the systems components. Dependencies between turbines arise due to the wake effect. We m… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

  15. arXiv:2204.05027  [pdf, ps, other

    cs.LG cs.AI q-bio.PE

    Exploring the Pareto front of multi-objective COVID-19 mitigation policies using reinforcement learning

    Authors: Mathieu Reymond, Conor F. Hayes, Lander Willem, Roxana Rădulescu, Steven Abrams, Diederik M. Roijers, Enda Howley, Patrick Mannion, Niel Hens, Ann Nowé, Pieter Libin

    Abstract: Infectious disease outbreaks can have a disruptive impact on public health and societal processes. As decision making in the context of epidemic mitigation is hard, reinforcement learning provides a methodology to automatically learn prevention strategies in combination with complex epidemic models. Current research focuses on optimizing policies w.r.t. a single objective, such as the pathogen's a… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

  16. arXiv:2112.15422  [pdf, other

    cs.AI

    Scalar reward is not enough: A response to Silver, Singh, Precup and Sutton (2021)

    Authors: Peter Vamplew, Benjamin J. Smith, Johan Kallstrom, Gabriel Ramos, Roxana Radulescu, Diederik M. Roijers, Conor F. Hayes, Fredrik Heintz, Patrick Mannion, Pieter J. K. Libin, Richard Dazeley, Cameron Foale

    Abstract: The recent paper `"Reward is Enough" by Silver, Singh, Precup and Sutton posits that the concept of reward maximisation is sufficient to underpin all intelligence, both natural and artificial. We contest the underlying assumption of Silver et al. that such reward can be scalar-valued. In this paper we explain why scalar rewards are insufficient to account for some aspects of both biological and co… ▽ More

    Submitted 24 November, 2021; originally announced December 2021.

  17. Expected Scalarised Returns Dominance: A New Solution Concept for Multi-Objective Decision Making

    Authors: Conor F. Hayes, Timothy Verstraeten, Diederik M. Roijers, Enda Howley, Patrick Mannion

    Abstract: In many real-world scenarios, the utility of a user is derived from the single execution of a policy. In this case, to apply multi-objective reinforcement learning, the expected utility of the returns must be optimised. Various scenarios exist where a user's preferences over objectives (also known as the utility function) are unknown or difficult to specify. In such scenarios, a set of optimal pol… ▽ More

    Submitted 1 July, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

  18. A Practical Guide to Multi-Objective Reinforcement Learning and Planning

    Authors: Conor F. Hayes, Roxana Rădulescu, Eugenio Bargiacchi, Johan Källström, Matthew Macfarlane, Mathieu Reymond, Timothy Verstraeten, Luisa M. Zintgraf, Richard Dazeley, Fredrik Heintz, Enda Howley, Athirai A. Irissappane, Patrick Mannion, Ann Nowé, Gabriel Ramos, Marcello Restelli, Peter Vamplew, Diederik M. Roijers

    Abstract: Real-world decision-making tasks are generally complex, requiring trade-offs between multiple, often conflicting, objectives. Despite this, the majority of research in reinforcement learning and decision-theoretic planning either assumes only a single objective, or that multiple objectives can be adequately handled via a simple linear combination. Such approaches may oversimplify the underlying pr… ▽ More

    Submitted 17 March, 2021; originally announced March 2021.

    Journal ref: Auton Agent Multi-Agent Syst 36, 26 (2022)

  19. arXiv:2102.00966  [pdf, other

    cs.LG cs.AI

    Risk Aware and Multi-Objective Decision Making with Distributional Monte Carlo Tree Search

    Authors: Conor F. Hayes, Mathieu Reymond, Diederik M. Roijers, Enda Howley, Patrick Mannion

    Abstract: In many risk-aware and multi-objective reinforcement learning settings, the utility of the user is derived from the single execution of a policy. In these settings, making decisions based on the average future returns is not suitable. For example, in a medical setting a patient may only have one opportunity to treat their illness. When making a decision, just the expected return -- known in reinfo… ▽ More

    Submitted 2 February, 2021; v1 submitted 1 February, 2021; originally announced February 2021.

    Comments: 8 pages, 4 figures

  20. arXiv:2101.11967  [pdf, other

    cs.MA cs.AI

    Exploring the Impact of Tunable Agents in Sequential Social Dilemmas

    Authors: David O'Callaghan, Patrick Mannion

    Abstract: When developing reinforcement learning agents, the standard approach is to train an agent to converge to a fixed policy that is as close to optimal as possible for a single fixed reward function. If different agent behaviour is required in the future, an agent trained in this way must normally be either fully or partially retrained, wasting valuable time and resources. In this study, we leverage m… ▽ More

    Submitted 28 January, 2021; originally announced January 2021.

  21. arXiv:2011.07290  [pdf, other

    cs.MA cs.AI cs.GT cs.LG

    Opponent Learning Awareness and Modelling in Multi-Objective Normal Form Games

    Authors: Roxana Rădulescu, Timothy Verstraeten, Yijie Zhang, Patrick Mannion, Diederik M. Roijers, Ann Nowé

    Abstract: Many real-world multi-agent interactions consider multiple distinct criteria, i.e. the payoffs are multi-objective in nature. However, the same multi-objective payoff vector may lead to different utilities for each participant. Therefore, it is essential for an agent to learn about the behaviour of other agents in the system. In this work, we present the first study of the effects of such opponent… ▽ More

    Submitted 14 November, 2020; originally announced November 2020.

    Comments: Under review since 14 November 2020

  22. arXiv:2002.00444  [pdf, other

    cs.LG cs.AI cs.RO

    Deep Reinforcement Learning for Autonomous Driving: A Survey

    Authors: B Ravi Kiran, Ibrahim Sobh, Victor Talpaert, Patrick Mannion, Ahmad A. Al Sallab, Senthil Yogamani, Patrick Pérez

    Abstract: With the development of deep representation learning, the domain of reinforcement learning (RL) has become a powerful learning framework now capable of learning complex policies in high dimensional environments. This review summarises deep reinforcement learning (DRL) algorithms and provides a taxonomy of automated driving tasks where (D)RL methods have been employed, while addressing key computat… ▽ More

    Submitted 23 January, 2021; v1 submitted 2 February, 2020; originally announced February 2020.

    Comments: Accepted for publication at IEEE Transactions on Intelligent Transportation Systems

  23. arXiv:2001.08177  [pdf, other

    cs.GT cs.AI cs.LG cs.MA

    A utility-based analysis of equilibria in multi-objective normal form games

    Authors: Roxana Rădulescu, Patrick Mannion, Yijie Zhang, Diederik M. Roijers, Ann Nowé

    Abstract: In multi-objective multi-agent systems (MOMAS), agents explicitly consider the possible tradeoffs between conflicting objective functions. We argue that compromises between competing objectives in MOMAS should be analysed on the basis of the utility that these compromises have for the users of a system, where an agent's utility function maps their payoff vectors to scalar utility values. This util… ▽ More

    Submitted 17 January, 2020; originally announced January 2020.

    Comments: Under review since 16 January 2020

  24. Multi-Objective Multi-Agent Decision Making: A Utility-based Analysis and Survey

    Authors: Roxana Rădulescu, Patrick Mannion, Diederik M. Roijers, Ann Nowé

    Abstract: The majority of multi-agent system (MAS) implementations aim to optimise agents' policies with respect to a single objective, despite the fact that many real-world problem domains are inherently multi-objective in nature. Multi-objective multi-agent systems (MOMAS) explicitly consider the possible trade-offs between conflicting objective functions. We argue that, in MOMAS, such compromises should… ▽ More

    Submitted 6 September, 2019; originally announced September 2019.

    Comments: Under review since 15 May 2019

  25. arXiv:1902.03601  [pdf, other

    cs.CV

    Vulnerable road user detection: state-of-the-art and open challenges

    Authors: Patrick Mannion

    Abstract: Correctly identifying vulnerable road users (VRUs), e.g. cyclists and pedestrians, remains one of the most challenging environment perception tasks for autonomous vehicles (AVs). This work surveys the current state-of-the-art in VRU detection, covering topics such as benchmarks and datasets, object detection techniques and relevant machine learning algorithms. The article concludes with a discussi… ▽ More

    Submitted 10 February, 2019; originally announced February 2019.

  26. arXiv:1901.01536  [pdf, other

    cs.LG cs.RO stat.ML

    Exploring applications of deep reinforcement learning for real-world autonomous driving systems

    Authors: Victor Talpaert, Ibrahim Sobh, B Ravi Kiran, Patrick Mannion, Senthil Yogamani, Ahmad El-Sallab, Patrick Perez

    Abstract: Deep Reinforcement Learning (DRL) has become increasingly powerful in recent years, with notable achievements such as Deepmind's AlphaGo. It has been successfully deployed in commercial vehicles like Mobileye's path planning system. However, a vast majority of work on DRL is focused on toy examples in controlled synthetic car simulator environments such as TORCS and CARLA. In general, DRL is still… ▽ More

    Submitted 16 January, 2019; v1 submitted 6 January, 2019; originally announced January 2019.

    Comments: Accepted for Oral Presentation at VISAPP 2019