Skip to main content

Showing 1–11 of 11 results for author: Khamassi, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.04655  [pdf, other

    cs.CL cs.AI

    Strong and weak alignment of large language models with human values

    Authors: Mehdi Khamassi, Marceau Nahon, Raja Chatila

    Abstract: Minimizing negative impacts of Artificial Intelligent (AI) systems on human societies without human supervision requires them to be able to align with human values. However, most current work only addresses this issue from a technical point of view, e.g., improving current methods relying on reinforcement learning from human feedback, neglecting what it means and is required for alignment to occur… ▽ More

    Submitted 12 August, 2024; v1 submitted 5 August, 2024; originally announced August 2024.

    Comments: Accepted for publication in Scientific Reports, special issue on AI aligment

  2. arXiv:2403.20177  [pdf

    cs.AI cs.RO q-bio.NC

    Preliminaries to artificial consciousness: a multidimensional heuristic approach

    Authors: K. Evers, M. Farisco, R. Chatila, B. D. Earp, I. T. Freire, F. Hamker, E. Nemeth, P. F. M. J. Verschure, M. Khamassi

    Abstract: The pursuit of artificial consciousness requires conceptual clarity to navigate its theoretical and empirical challenges. This paper introduces a composite, multilevel, and multidimensional model of consciousness as a heuristic framework to guide research in this field. Consciousness is treated as a complex phenomenon, with distinct constituents and dimensions that can be operationalized for study… ▽ More

    Submitted 2 January, 2025; v1 submitted 29 March, 2024; originally announced March 2024.

  3. arXiv:2403.02514  [pdf, other

    cs.RO cs.AI cs.LG

    A Formalisation of the Purpose Framework: the Autonomy-Alignment Problem in Open-Ended Learning Robots

    Authors: Gianluca Baldassarre, Richard J. Duro, Emilio Cartoni, Mehdi Khamassi, Alejandro Romero, Vieri Giuliano Santucci

    Abstract: The unprecedented advancement of artificial intelligence enables the development of increasingly autonomous robots. These robots hold significant potential, particularly in moving beyond engineered factory settings to operate in the unstructured environments inhabited by humans. However, this possibility also generates a relevant autonomy-alignment problem to ensure that robots' autonomous learnin… ▽ More

    Submitted 7 April, 2025; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: 15 pages, 5 figures

  4. arXiv:2005.06223  [pdf, other

    cs.AI cs.LG cs.NE cs.RO

    DREAM Architecture: a Developmental Approach to Open-Ended Learning in Robotics

    Authors: Stephane Doncieux, Nicolas Bredeche, Léni Le Goff, Benoît Girard, Alexandre Coninx, Olivier Sigaud, Mehdi Khamassi, Natalia Díaz-Rodríguez, David Filliat, Timothy Hospedales, A. Eiben, Richard Duro

    Abstract: Robots are still limited to controlled conditions, that the robot designer knows with enough details to endow the robot with the appropriate models or behaviors. Learning algorithms add some flexibility with the ability to discover the appropriate behavior given either some demonstrations or a reward to guide its exploration with a reinforcement learning algorithm. Reinforcement learning algorithm… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

  5. arXiv:2005.03987  [pdf, other

    cs.RO

    Coping with the variability in humans reward during simulated human-robot interactions through the coordination of multiple learning strategies

    Authors: Rémi Dromnelle, Benoît Girard, Erwan Renaudo, Raja Chatila, Mehdi Khamassi

    Abstract: An important current challenge in Human-Robot Interaction (HRI) is to enable robots to learn on-the-fly from human feedback. However, humans show a great variability in the way they reward robots. We propose to address this issue by enabling the robot to combine different learning strategies, namely model-based (MB) and model-free (MF) reinforcement learning. We simulate two HRI scenarios: a simpl… ▽ More

    Submitted 6 May, 2020; originally announced May 2020.

    Comments: 6 pages, 5 figures, written for the RO-MAN 2020 conference. arXiv admin note: text overlap with arXiv:2004.14698

  6. arXiv:2004.14698  [pdf, other

    cs.RO cs.LG

    How to reduce computation time while sparing performance during robot navigation? A neuro-inspired architecture for autonomous shifting between model-based and model-free learning

    Authors: Rémi Dromnelle, Erwan Renaudo, Guillaume Pourcel, Raja Chatila, Benoît Girard, Mehdi Khamassi

    Abstract: Taking inspiration from how the brain coordinates multiple learning systems is an appealing strategy to endow robots with more flexibility. One of the expected advantages would be for robots to autonomously switch to the least costly system when its performance is satisfying. However, to our knowledge no study on a real robot has yet shown that the measured computational cost is reduced while perf… ▽ More

    Submitted 16 July, 2020; v1 submitted 30 April, 2020; originally announced April 2020.

    Comments: 12 pages, 4 figures ; Living Machines 2020

  7. arXiv:1812.00253  [pdf, other

    cs.RO cs.CV cs.LG

    A Deep Learning Approach for Multi-View Engagement Estimation of Children in a Child-Robot Joint Attention task

    Authors: Jack Hadfield, Georgia Chalvatzaki, Petros Koutras, Mehdi Khamassi, Costas S. Tzafestas, Petros Maragos

    Abstract: In this work we tackle the problem of child engagement estimation while children freely interact with a robot in their room. We propose a deep-based multi-view solution that takes advantage of recent developments in human pose detection. We extract the child's pose from different RGB-D cameras placed elegantly in the room, fuse the results and feed them to a deep neural network trained for classif… ▽ More

    Submitted 1 December, 2018; originally announced December 2018.

    Comments: 7 pages, 6 figures

  8. Prioritized Sweeping Neural DynaQ with Multiple Predecessors, and Hippocampal Replays

    Authors: Lise Aubin, Mehdi Khamassi, Benoît Girard

    Abstract: During sleep and awake rest, the hippocampus replays sequences of place cells that have been activated during prior experiences. These have been interpreted as a memory consolidation process, but recent results suggest a possible interpretation in terms of reinforcement learning. The Dyna reinforcement learning algorithms use off-line replays to improve learning. Under limited replay budget, a pri… ▽ More

    Submitted 13 August, 2018; v1 submitted 15 February, 2018; originally announced February 2018.

    Comments: Living Machines 2018 (Paris, France)

  9. Adaptive coordination of working-memory and reinforcement learning in non-human primates performing a trial-and-error problem solving task

    Authors: Guillaume Viejo, Benoît Girard, Emmanuel Procyk, Mehdi Khamassi

    Abstract: Accumulating evidence suggest that human behavior in trial-and-error learning tasks based on decisions between discrete actions may involve a combination of reinforcement learning (RL) and working-memory (WM). While the understanding of brain activity at stake in this type of tasks often involve the comparison with non-human primate neurophysiological results, it is not clear whether monkeys use s… ▽ More

    Submitted 2 November, 2017; originally announced November 2017.

    Comments: Behavioural Brain Research, Elsevier, 2017

  10. Sustainable computational science: the ReScience initiative

    Authors: Nicolas P. Rougier, Konrad Hinsen, Frédéric Alexandre, Thomas Arildsen, Lorena Barba, Fabien C. Y. Benureau, C. Titus Brown, Pierre de Buyl, Ozan Caglayan, Andrew P. Davison, Marc André Delsuc, Georgios Detorakis, Alexandra K. Diem, Damien Drix, Pierre Enel, Benoît Girard, Olivia Guest, Matt G. Hall, Rafael Neto Henriques, Xavier Hinaut, Kamil S Jaron, Mehdi Khamassi, Almar Klein, Tiina Manninen, Pietro Marchesi , et al. (20 additional authors not shown)

    Abstract: Computer science offers a large set of tools for prototyping, writing, running, testing, validating, sharing and reproducing results, however computational science lags behind. In the best case, authors may provide their source code as a compressed archive and they may feel confident their research is reproducible. But this is not exactly true. James Buckheit and David Donoho proposed more than tw… ▽ More

    Submitted 11 November, 2017; v1 submitted 14 July, 2017; originally announced July 2017.

    Comments: 8 pages, 1 figure

    Journal ref: PeerJ Computer Science 3:e142 (2017)

  11. arXiv:1610.01986  [pdf, other

    cs.LG

    Active exploration in parameterized reinforcement learning

    Authors: Mehdi Khamassi, Costas Tzafestas

    Abstract: Online model-free reinforcement learning (RL) methods with continuous actions are playing a prominent role when dealing with real-world applications such as Robotics. However, when confronted to non-stationary environments, these methods crucially rely on an exploration-exploitation trade-off which is rarely dynamically and automatically adjusted to changes in the environment. Here we propose an a… ▽ More

    Submitted 6 October, 2016; originally announced October 2016.

    Comments: Submitted to EWRL2016