Skip to main content

Showing 1–11 of 11 results for author: Gyevnar, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.17801  [pdf, ps, other

    cs.AI

    Integrating Counterfactual Simulations with Language Models for Explaining Multi-Agent Behaviour

    Authors: Bálint Gyevnár, Christopher G. Lucas, Stefano V. Albrecht, Shay B. Cohen

    Abstract: Autonomous multi-agent systems (MAS) are useful for automating complex tasks but raise trust concerns due to risks like miscoordination and goal misalignment. Explainability is vital for trust calibration, but explainable reinforcement learning for MAS faces challenges in state/action space complexity, stakeholder needs, and evaluation. Using the counterfactual theory of causation and LLMs' summar… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  2. arXiv:2502.09288  [pdf, other

    cs.CY

    AI Safety for Everyone

    Authors: Balint Gyevnar, Atoosa Kasirzadeh

    Abstract: Recent discussions and research in AI safety have increasingly emphasized the deep connection between AI safety and existential risk from advanced AI systems, suggesting that work on AI safety necessarily entails serious consideration of potential existential threats. However, this framing has three potential drawbacks: it may exclude researchers and practitioners who are committed to AI safety bu… ▽ More

    Submitted 14 February, 2025; v1 submitted 13 February, 2025; originally announced February 2025.

  3. arXiv:2501.19256  [pdf, other

    cs.AI cs.HC cs.RO

    Objective Metrics for Human-Subjects Evaluation in Explainable Reinforcement Learning

    Authors: Balint Gyevnar, Mark Towers

    Abstract: Explanation is a fundamentally human process. Understanding the goal and audience of the explanation is vital, yet existing work on explainable reinforcement learning (XRL) routinely does not consult humans in their evaluations. Even when they do, they routinely resort to subjective metrics, such as confidence or understanding, that can only inform researchers of users' opinions, not their practic… ▽ More

    Submitted 31 January, 2025; originally announced January 2025.

  4. arXiv:2403.08828  [pdf, other

    cs.HC cs.AI cs.RO

    People Attribute Purpose to Autonomous Vehicles When Explaining Their Behavior: Insights from Cognitive Science for Explainable AI

    Authors: Balint Gyevnar, Stephanie Droop, Tadeg Quillien, Shay B. Cohen, Neil R. Bramley, Christopher G. Lucas, Stefano V. Albrecht

    Abstract: It is often argued that effective human-centered explainable artificial intelligence (XAI) should resemble human reasoning. However, empirical investigations of how concepts from cognitive science can aid the design of XAI are lacking. Based on insights from cognitive science, we propose a framework of explanatory modes to analyze how people frame explanations, whether mechanistic, teleological, o… ▽ More

    Submitted 3 February, 2025; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: CHI 2025

  5. arXiv:2402.10086  [pdf, other

    cs.RO cs.AI cs.CV cs.HC cs.LG

    Explainable AI for Safe and Trustworthy Autonomous Driving: A Systematic Review

    Authors: Anton Kuznietsov, Balint Gyevnar, Cheng Wang, Steven Peters, Stefano V. Albrecht

    Abstract: Artificial Intelligence (AI) shows promising applications for the perception and planning tasks in autonomous driving (AD) due to its superior performance compared to conventional methods. However, inscrutable AI systems exacerbate the existing challenge of safety assurance of AD. One way to mitigate this challenge is to utilize explainable AI (XAI) techniques. To this end, we present the first co… ▽ More

    Submitted 3 July, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

  6. arXiv:2302.10809  [pdf, other

    cs.AI cs.RO

    Causal Explanations for Sequential Decision-Making in Multi-Agent Systems

    Authors: Balint Gyevnar, Cheng Wang, Christopher G. Lucas, Shay B. Cohen, Stefano V. Albrecht

    Abstract: We present CEMA: Causal Explanations in Multi-Agent systems; a framework for creating causal natural language explanations of an agent's decisions in dynamic sequential multi-agent systems to build more trustworthy autonomous agents. Unlike prior work that assumes a fixed causal structure, CEMA only requires a probabilistic model for forward-simulating the state of the system. Using such a model,… ▽ More

    Submitted 14 February, 2024; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: Accepted in 23rd International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS), 2024

    ACM Class: I.2.9

  7. Bridging the Transparency Gap: What Can Explainable AI Learn From the AI Act?

    Authors: Balint Gyevnar, Nick Ferguson, Burkhard Schafer

    Abstract: The European Union has proposed the Artificial Intelligence Act which introduces detailed requirements of transparency for AI systems. Many of these requirements can be addressed by the field of explainable AI (XAI), however, there is a fundamental difference between XAI and the Act regarding what transparency is. The Act views transparency as a means that supports wider values, such as accountabi… ▽ More

    Submitted 29 July, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: Accepted in European Conference on Artificial Intelligence (ECAI) 2023

    ACM Class: I.2.0

  8. arXiv:2208.01769  [pdf, other

    cs.MA cs.AI cs.LG

    Deep Reinforcement Learning for Multi-Agent Interaction

    Authors: Ibrahim H. Ahmed, Cillian Brewitt, Ignacio Carlucho, Filippos Christianos, Mhairi Dunion, Elliot Fosong, Samuel Garcin, Shangmin Guo, Balint Gyevnar, Trevor McInroe, Georgios Papoudakis, Arrasy Rahman, Lukas Schäfer, Massimiliano Tamborski, Giuseppe Vecchio, Cheng Wang, Stefano V. Albrecht

    Abstract: The development of autonomous agents which can interact with other agents to accomplish a given task is a core area of research in artificial intelligence and machine learning. Towards this goal, the Autonomous Agents Research Group develops novel machine learning algorithms for autonomous systems control, with a specific focus on deep reinforcement learning and multi-agent reinforcement learning.… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

    Comments: Published in AI Communications Special Issue on Multi-Agent Systems Research in the UK

  9. A Human-Centric Method for Generating Causal Explanations in Natural Language for Autonomous Vehicle Motion Planning

    Authors: Balint Gyevnar, Massimiliano Tamborski, Cheng Wang, Christopher G. Lucas, Shay B. Cohen, Stefano V. Albrecht

    Abstract: Inscrutable AI systems are difficult to trust, especially if they operate in safety-critical settings like autonomous driving. Therefore, there is a need to build transparent and queryable systems to increase trust levels. We propose a transparent, human-centric explanation generation method for autonomous vehicle motion planning and prediction based on an existing white-box system called IGP2. Ou… ▽ More

    Submitted 27 June, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

    Comments: IJCAI Workshop on Artificial Intelligence for Autonomous Driving (AI4AD), 2022

  10. arXiv:2103.06113  [pdf, other

    cs.RO cs.MA

    GRIT: Fast, Interpretable, and Verifiable Goal Recognition with Learned Decision Trees for Autonomous Driving

    Authors: Cillian Brewitt, Balint Gyevnar, Samuel Garcin, Stefano V. Albrecht

    Abstract: It is important for autonomous vehicles to have the ability to infer the goals of other vehicles (goal recognition), in order to safely interact with other vehicles and predict their future trajectories. This is a difficult problem, especially in urban environments with interactions between many vehicles. Goal recognition methods must be fast to run in real time and make accurate inferences. As au… ▽ More

    Submitted 9 August, 2021; v1 submitted 10 March, 2021; originally announced March 2021.

    Comments: 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  11. arXiv:2002.02277  [pdf, other

    cs.RO

    Interpretable Goal-based Prediction and Planning for Autonomous Driving

    Authors: Stefano V. Albrecht, Cillian Brewitt, John Wilhelm, Balint Gyevnar, Francisco Eiras, Mihai Dobre, Subramanian Ramamoorthy

    Abstract: We propose an integrated prediction and planning system for autonomous driving which uses rational inverse planning to recognise the goals of other vehicles. Goal recognition informs a Monte Carlo Tree Search (MCTS) algorithm to plan optimal maneuvers for the ego vehicle. Inverse planning and MCTS utilise a shared set of defined maneuvers and macro actions to construct plans which are explainable… ▽ More

    Submitted 15 March, 2021; v1 submitted 6 February, 2020; originally announced February 2020.

    Comments: IEEE International Conference on Robotics and Automation (ICRA), 2021