Skip to main content

Showing 1–4 of 4 results for author: Methnani, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18346  [pdf, ps, other

    cs.AI

    AI Alignment through Reinforcement Learning from Human Feedback? Contradictions and Limitations

    Authors: Adam Dahlgren Lindström, Leila Methnani, Lea Krause, Petter Ericson, Íñigo Martínez de Rituerto de Troya, Dimitri Coelho Mollo, Roel Dobbe

    Abstract: This paper critically evaluates the attempts to align Artificial Intelligence (AI) systems, especially Large Language Models (LLMs), with human values and intentions through Reinforcement Learning from Feedback (RLxF) methods, involving either human feedback (RLHF) or AI feedback (RLAIF). Specifically, we show the shortcomings of the broadly pursued alignment goals of honesty, harmlessness, and he… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 12 pages, 1 table, to be submitted

  2. arXiv:2312.07635  [pdf, other

    cs.AI

    Clash of the Explainers: Argumentation for Context-Appropriate Explanations

    Authors: Leila Methnani, Virginia Dignum, Andreas Theodorou

    Abstract: Understanding when and why to apply any given eXplainable Artificial Intelligence (XAI) technique is not a straightforward task. There is no single approach that is best suited for a given context. This paper aims to address the challenge of selecting the most appropriate explainer given the context in which an explanation is required. For AI explainability to be effective, explanations and how th… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: 17 pages, 3 figures, Accepted at XAI^3 Workshop at ECAI 2023

  3. arXiv:2309.12756  [pdf, other

    cs.SE cs.AI

    Towards an MLOps Architecture for XAI in Industrial Applications

    Authors: Leonhard Faubel, Thomas Woudsma, Leila Methnani, Amir Ghorbani Ghezeljhemeidan, Fabian Buelow, Klaus Schmid, Willem D. van Driel, Benjamin Kloepper, Andreas Theodorou, Mohsen Nosratinia, Magnus Bång

    Abstract: Machine learning (ML) has become a popular tool in the industrial sector as it helps to improve operations, increase efficiency, and reduce costs. However, deploying and managing ML models in production environments can be complex. This is where Machine Learning Operations (MLOps) comes in. MLOps aims to streamline this deployment and management process. One of the remaining MLOps challenges is th… ▽ More

    Submitted 20 October, 2023; v1 submitted 22 September, 2023; originally announced September 2023.

  4. arXiv:2204.10740  [pdf, other

    cs.MA cs.AI

    Embracing AWKWARD! Real-time Adjustment of Reactive Plans Using Social Norms

    Authors: Leila Methnani, Andreas Antoniades, Andreas Theodorou

    Abstract: This paper presents the AWKWARD architecture for the development of hybrid agents in Multi-Agent Systems. AWKWARD agents can have their plans re-configured in real time to align with social role requirements under changing environmental and social circumstances. The proposed hybrid architecture makes use of Behaviour Oriented Design (BOD) to develop agents with reactive planning and of the well-es… ▽ More

    Submitted 21 July, 2022; v1 submitted 22 April, 2022; originally announced April 2022.

    Comments: 18 pages, 2 figures, 3 Tables, 4 Formalisms, Accepted at COINE 2022 Workshop