Skip to main content

Showing 1–50 of 96 results for author: Oudeyer, P

Searching in archive cs. Search in all archives.
.
  1. Flow-Lenia: Emergent evolutionary dynamics in mass conservative continuous cellular automata

    Authors: Erwan Plantec, Gautier Hamon, Mayalen Etcheverry, Bert Wang-Chak Chan, Pierre-Yves Oudeyer, Clément Moulin-Frier

    Abstract: Central to the artificial life endeavour is the creation of artificial systems spontaneously generating properties found in the living world such as autopoiesis, self-replication, evolution and open-endedness. While numerous models and paradigms have been proposed, cellular automata (CA) have taken a very important place in the field notably as they enable the study of phenomenons like self-reprod… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

    Comments: This manuscript has been accepted for publication in the Artificial Life journal (https://direct.mit.edu/artl)

    Journal ref: Artificial Life (2025) 31(2): 228-248

  2. arXiv:2506.06725  [pdf, ps, other

    cs.AI cs.LG

    WorldLLM: Improving LLMs' world modeling using curiosity-driven theory-making

    Authors: Guillaume Levy, Cedric Colas, Pierre-Yves Oudeyer, Thomas Carta, Clement Romac

    Abstract: Large Language Models (LLMs) possess general world knowledge but often struggle to generate precise predictions in structured, domain-specific contexts such as simulations. These limitations arise from their inability to ground their broad, unstructured understanding in specific environments. To address this, we present WorldLLM, a framework that enhances LLM-based world modeling by combining Baye… ▽ More

    Submitted 7 June, 2025; originally announced June 2025.

  3. arXiv:2505.15998  [pdf, ps, other

    cs.AI

    Exploring Flow-Lenia Universes with a Curiosity-driven AI Scientist: Discovering Diverse Ecosystem Dynamics

    Authors: Thomas Michel, Marko Cvjetko, Gautier Hamon, Pierre-Yves Oudeyer, Clément Moulin-Frier

    Abstract: We present a method for the automated discovery of system-level dynamics in Flow-Lenia--a continuous cellular automaton (CA) with mass conservation and parameter localization-using a curiosity--driven AI scientist. This method aims to uncover processes leading to self-organization of evolutionary and ecosystemic dynamics in CAs. We build on previous work which uses diversity search algorithms in L… ▽ More

    Submitted 2 June, 2025; v1 submitted 21 May, 2025; originally announced May 2025.

    Comments: 10 pages, 10 figures

  4. arXiv:2505.01106  [pdf, other

    cs.CY

    Investigating Middle School Students Question-Asking and Answer-Evaluation Skills When Using ChatGPT for Science Investigation

    Authors: Rania Abdelghani, Kou Murayama, Celeste Kidd, Hélène Sauzéon, Pierre-Yves Oudeyer

    Abstract: Generative AI (GenAI) tools such as ChatGPT allow users, including school students without prior AI expertise, to explore and address a wide range of tasks. Surveys show that most students aged eleven and older already use these tools for school-related activities. However, little is known about how they actually use GenAI and how it impacts their learning. This study addresses this gap by exami… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

  5. arXiv:2504.03814  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Recursive Training Loops in LLMs: How training data properties modulate distribution shift in generated data?

    Authors: Grgur Kovač, Jérémy Perez, Rémy Portelas, Peter Ford Dominey, Pierre-Yves Oudeyer

    Abstract: Large language models (LLMs) are increasingly used in the creation of online content, creating feedback loops as subsequent generations of models will be trained on this synthetic data. Such loops were shown to lead to distribution shifts - models misrepresenting the true underlying distributions of human data (also called model collapse). However, how human data properties affect such shifts rema… ▽ More

    Submitted 2 July, 2025; v1 submitted 4 April, 2025; originally announced April 2025.

    MSC Class: 68T50 ACM Class: I.2.7

  6. arXiv:2502.07709  [pdf, ps, other

    cs.AI

    MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces

    Authors: Loris Gaven, Thomas Carta, Clément Romac, Cédric Colas, Sylvain Lamprier, Olivier Sigaud, Pierre-Yves Oudeyer

    Abstract: Open-ended learning agents must efficiently prioritize goals in vast possibility spaces, focusing on those that maximize learning progress (LP). When such autotelic exploration is achieved by LLM agents trained with online RL in high-dimensional and evolving goal spaces, a key challenge for LP prediction is modeling one's own competence, a form of metacognitive monitoring. Traditional approaches e… ▽ More

    Submitted 17 June, 2025; v1 submitted 11 February, 2025; originally announced February 2025.

  7. arXiv:2411.03495  [pdf, other

    cs.CL cs.AI

    Automatic Generation of Question Hints for Mathematics Problems using Large Language Models in Educational Technology

    Authors: Junior Cedric Tonga, Benjamin Clement, Pierre-Yves Oudeyer

    Abstract: The automatic generation of hints by Large Language Models (LLMs) within Intelligent Tutoring Systems (ITSs) has shown potential to enhance student learning. However, generating pedagogically sound hints that address student misconceptions and adhere to specific educational objectives remains challenging. This work explores using LLMs (GPT-4o and Llama-3-8B-instruct) as teachers to generate effect… ▽ More

    Submitted 5 November, 2024; originally announced November 2024.

    Comments: Accepted at NeurIPS 2024 Workshop on Large Foundation Models for Educational Assessment (FM-Assess)

  8. arXiv:2410.19920  [pdf, other

    cs.LG

    Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting

    Authors: Mohamed Salim Aissi, Clement Romac, Thomas Carta, Sylvain Lamprier, Pierre-Yves Oudeyer, Olivier Sigaud, Laure Soulier, Nicolas Thome

    Abstract: Reinforcement learning (RL) is a promising approach for aligning large language models (LLMs) knowledge with sequential decision-making tasks. However, few studies have thoroughly investigated the impact on LLM agents capabilities of fine-tuning them with RL in a specific environment. In this paper, we propose a novel framework to analyze the sensitivity of LLMs to prompt formulations following RL… ▽ More

    Submitted 29 October, 2024; v1 submitted 25 October, 2024; originally announced October 2024.

  9. arXiv:2410.12481  [pdf, other

    cs.LG cs.AI

    SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling

    Authors: Loris Gaven, Clement Romac, Thomas Carta, Sylvain Lamprier, Olivier Sigaud, Pierre-Yves Oudeyer

    Abstract: The past years have seen Large Language Models (LLMs) strive not only as generative models but also as agents solving textual sequential decision-making tasks. When facing complex environments where their zero-shot abilities are insufficient, recent work showed online Reinforcement Learning (RL) could be used for the LLM agent to discover and learn efficient strategies interactively. However, most… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  10. arXiv:2408.14352  [pdf, ps, other

    cs.CL cs.AI cs.LG

    LogProber: Disentangling confidence from contamination in LLM responses

    Authors: Nicolas Yax, Pierre-Yves Oudeyer, Stefano Palminteri

    Abstract: In machine learning, contamination refers to situations where testing data leak into the training set. The issue is particularly relevant for the evaluation of the performance of Large Language Models (LLMs), which are generally trained on gargantuan, and generally opaque, corpora of text scraped from the world wide web. Developing tools to detect contamination is therefore crucial to be able to f… ▽ More

    Submitted 20 June, 2025; v1 submitted 26 August, 2024; originally announced August 2024.

  11. arXiv:2407.05377  [pdf, other

    cs.AI

    Collective Innovation in Groups of Large Language Models

    Authors: Eleni Nisioti, Sebastian Risi, Ida Momennejad, Pierre-Yves Oudeyer, Clément Moulin-Frier

    Abstract: Human culture relies on collective innovation: our ability to continuously explore how existing elements in our environment can be combined to create new ones. Language is hypothesized to play a key role in human culture, driving individual cognitive capacities and shaping communication. Yet the majority of models of collective innovation assign no cognitive capacities or language abilities to age… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  12. arXiv:2407.04503  [pdf, ps, other

    physics.soc-ph cs.AI cs.MA

    When LLMs Play the Telephone Game: Cultural Attractors as Conceptual Tools to Evaluate LLMs in Multi-turn Settings

    Authors: Jérémy Perez, Grgur Kovač, Corentin Léger, Cédric Colas, Gaia Molinaro, Maxime Derex, Pierre-Yves Oudeyer, Clément Moulin-Frier

    Abstract: As large language models (LLMs) start interacting with each other and generating an increasing amount of text online, it becomes crucial to better understand how information is transformed as it passes from one LLM to the next. While significant research has examined individual LLM behaviors, existing studies have largely overlooked the collective behaviors and information distortions arising from… ▽ More

    Submitted 2 June, 2025; v1 submitted 5 July, 2024; originally announced July 2024.

    Comments: Code available at https://github.com/jeremyperez2/TelephoneGameLLM. Companion website with a Data Explorer tool at https://sites.google.com/view/telephone-game-llm

    MSC Class: 68T50 ACM Class: I.2.7

  13. arXiv:2404.04671  [pdf, other

    cs.CL cs.LG q-bio.PE

    PhyloLM : Inferring the Phylogeny of Large Language Models and Predicting their Performances in Benchmarks

    Authors: Nicolas Yax, Pierre-Yves Oudeyer, Stefano Palminteri

    Abstract: This paper introduces PhyloLM, a method adapting phylogenetic algorithms to Large Language Models (LLMs) to explore whether and how they relate to each other and to predict their performance characteristics. Our method calculates a phylogenetic distance metrics based on the similarity of LLMs' output. The resulting metric is then used to construct dendrograms, which satisfactorily capture known re… ▽ More

    Submitted 16 June, 2024; v1 submitted 6 April, 2024; originally announced April 2024.

  14. arXiv:2403.08882  [pdf, other

    cs.MA cs.AI q-bio.PE

    Cultural evolution in populations of Large Language Models

    Authors: Jérémy Perez, Corentin Léger, Marcela Ovando-Tellez, Chris Foulon, Joan Dussauld, Pierre-Yves Oudeyer, Clément Moulin-Frier

    Abstract: Research in cultural evolution aims at providing causal explanations for the change of culture over time. Over the past decades, this field has generated an important body of knowledge, using experimental, historical, and computational methods. While computational models have been very successful at generating testable hypotheses about the effects of several factors, such as population structure o… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 17 pages, 20 figures. Open-source code available at https://github.com/jeremyperez2/LLM-Culture

    MSC Class: 68T50 ACM Class: I.2.7

  15. Interactive environments for training children's curiosity through the practice of metacognitive skills: a pilot study

    Authors: Rania Abdelghani, Edith Law, Chloé Desvaux, Pierre-Yves Oudeyer, Hélène Sauzéon

    Abstract: Curiosity-driven learning has shown significant positive effects on students' learning experiences and outcomes. But despite this importance, reports show that children lack this skill, especially in formal educational settings. To address this challenge, we propose an 8-session workshop that aims to enhance children's curiosity through training a set of specific metacognitive skills we hypothesiz… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  16. Stick to your Role! Stability of Personal Values Expressed in Large Language Models

    Authors: Grgur Kovač, Rémy Portelas, Masataka Sawayama, Peter Ford Dominey, Pierre-Yves Oudeyer

    Abstract: The standard way to study Large Language Models (LLMs) with benchmarks or psychology questionnaires is to provide many different queries from similar minimal contexts (e.g. multiple choice questions). However, due to LLMs' highly context-dependent nature, conclusions from such minimal-context evaluations may be little informative about the model's behavior in deployment (where it will be exposed t… ▽ More

    Submitted 28 August, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: The project website and code are available at https://sites.google.com/view/llmvaluestability Published in PLOS ONE ( https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0309114 ), and a shorter version at CogSci 24 ( https://escholarship.org/uc/item/7w4823c6 )

    MSC Class: 68T07 ACM Class: I.2.7

    Journal ref: PLOS ONE, August 2024

  17. arXiv:2402.10236  [pdf, other

    cs.MA cs.AI cs.LG

    Discovering Sensorimotor Agency in Cellular Automata using Diversity Search

    Authors: Gautier Hamon, Mayalen Etcheverry, Bert Wang-Chak Chan, Clément Moulin-Frier, Pierre-Yves Oudeyer

    Abstract: The research field of Artificial Life studies how life-like phenomena such as autopoiesis, agency, or self-regulation can self-organize in computer simulations. In cellular automata (CA), a key open-question has been whether it it is possible to find environment rules that self-organize robust "individuals" from an initial state with no prior existence of things like "bodies", "brain", "perception… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  18. arXiv:2402.01669  [pdf, other

    cs.CY cs.AI cs.LG

    Improved Performances and Motivation in Intelligent Tutoring Systems: Combining Machine Learning and Learner Choice

    Authors: Benjamin Clément, Hélène Sauzéon, Didier Roy, Pierre-Yves Oudeyer

    Abstract: Large class sizes challenge personalized learning in schools, prompting the use of educational technologies such as intelligent tutoring systems. To address this, we present an AI-driven personalization system, called ZPDES, based on the Learning Progress Hypothesis - modeling curiosity-driven learning - and multi-armed bandit techniques. It sequences exercises that maximize learning progress for… ▽ More

    Submitted 5 March, 2025; v1 submitted 16 January, 2024; originally announced February 2024.

    ACM Class: I.2.1; I.2.6

  19. arXiv:2312.00455  [pdf

    cs.AI cs.LG nlin.CG

    Meta-Diversity Search in Complex Systems, A Recipe for Artificial Open-Endedness ?

    Authors: Mayalen Etcheverry, Bert Wang-Chak Chan, Clément Moulin-Frier, Pierre-Yves Oudeyer

    Abstract: Can we build an artificial system that would be able to generate endless surprises if ran "forever" in Minecraft? While there is not a single path toward solving that grand challenge, this article presents what we believe to be some working ingredients for the endless generation of novel increasingly complex artifacts in Minecraft. Our framework for an open-ended system includes two components: a… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  20. Machine Culture

    Authors: Levin Brinkmann, Fabian Baumann, Jean-François Bonnefon, Maxime Derex, Thomas F. Müller, Anne-Marie Nussberger, Agnieszka Czaplicka, Alberto Acerbi, Thomas L. Griffiths, Joseph Henrich, Joel Z. Leibo, Richard McElreath, Pierre-Yves Oudeyer, Jonathan Stray, Iyad Rahwan

    Abstract: The ability of humans to create and disseminate culture is often credited as the single most important factor of our success as a species. In this Perspective, we explore the notion of machine culture, culture mediated or generated by machines. We argue that intelligent machines simultaneously transform the cultural evolutionary processes of variation, transmission, and selection. Recommender algo… ▽ More

    Submitted 22 November, 2023; v1 submitted 19 November, 2023; originally announced November 2023.

    Journal ref: Nat Hum Behav 7, 1855-1868 (2023)

  21. arXiv:2311.00344  [pdf, other

    cs.AI

    A Definition of Open-Ended Learning Problems for Goal-Conditioned Agents

    Authors: Olivier Sigaud, Gianluca Baldassarre, Cedric Colas, Stephane Doncieux, Richard Duro, Pierre-Yves Oudeyer, Nicolas Perrin-Gilbert, Vieri Giuliano Santucci

    Abstract: A lot of recent machine learning research papers have ``open-ended learning'' in their title. But very few of them attempt to define what they mean when using the term. Even worse, when looking more closely there seems to be no consensus on what distinguishes open-ended learning from related concepts such as continual learning, lifelong learning or autotelic learning. In this paper, we contribute… ▽ More

    Submitted 7 June, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

  22. arXiv:2310.10692  [pdf, other

    cs.LG cs.AI

    ACES: Generating Diverse Programming Puzzles with with Autotelic Generative Models

    Authors: Julien Pourcel, Cédric Colas, Gaia Molinaro, Pierre-Yves Oudeyer, Laetitia Teodorescu

    Abstract: The ability to invent novel and interesting problems is a remarkable feature of human intelligence that drives innovation, art, and science. We propose a method that aims to automate this process by harnessing the power of state-of-the-art generative models to produce a diversity of challenging yet solvable problems, here in the context of Python programming puzzles. Inspired by the intrinsically… ▽ More

    Submitted 29 May, 2024; v1 submitted 15 October, 2023; originally announced October 2023.

  23. arXiv:2310.03192  [pdf, other

    cs.CY

    Generative AI in the Classroom: Can Students Remain Active Learners?

    Authors: Rania Abdelghani, Hélène Sauzéon, Pierre-Yves Oudeyer

    Abstract: Generative Artificial Intelligence (GAI) can be seen as a double-edged weapon in education. Indeed, it may provide personalized, interactive and empowering pedagogical sequences that could favor students' intrinsic motivation, active engagement and help them have more control over their learning. But at the same time, other GAI properties such as the lack of uncertainty signalling even in cases of… ▽ More

    Submitted 10 November, 2023; v1 submitted 4 October, 2023; originally announced October 2023.

  24. arXiv:2307.08452  [pdf, other

    q-bio.BM cs.LG cs.PL

    SBMLtoODEjax: Efficient Simulation and Optimization of Biological Network Models in JAX

    Authors: Mayalen Etcheverry, Michael Levin, Clément Moulin-Frier, Pierre-Yves Oudeyer

    Abstract: Advances in bioengineering and biomedicine demand a deep understanding of the dynamic behavior of biological systems, ranging from protein pathways to complex cellular processes. Biological networks like gene regulatory networks and protein pathways are key drivers of embryogenesis and physiological processes. Comprehending their diverse behaviors is essential for tackling diseases, including canc… ▽ More

    Submitted 29 October, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

  25. arXiv:2307.07871  [pdf, other

    cs.AI cs.LG

    The SocialAI School: Insights from Developmental Psychology Towards Artificial Socio-Cultural Agents

    Authors: Grgur Kovač, Rémy Portelas, Peter Ford Dominey, Pierre-Yves Oudeyer

    Abstract: Developmental psychologists have long-established the importance of socio-cognitive abilities in human intelligence. These abilities enable us to enter, participate and benefit from human culture. AI research on social interactive agents mostly concerns the emergence of culture in a multi-agent setting (often without a strong grounding in developmental psychology). We argue that AI research should… ▽ More

    Submitted 23 November, 2023; v1 submitted 15 July, 2023; originally announced July 2023.

    Comments: Preprint, see v1 for a shorter version (accepted at the "Workshop on Theory-of-Mind" at ICML 2023) See project website for demo and code: https://sites.google.com/view/socialai-school

    MSC Class: 68T07 ACM Class: I.2.0

  26. arXiv:2307.07870  [pdf, other

    cs.CL cs.AI cs.LG

    Large Language Models as Superpositions of Cultural Perspectives

    Authors: Grgur Kovač, Masataka Sawayama, Rémy Portelas, Cédric Colas, Peter Ford Dominey, Pierre-Yves Oudeyer

    Abstract: Large Language Models (LLMs) are often misleadingly recognized as having a personality or a set of values. We argue that an LLM can be seen as a superposition of perspectives with different values and personality traits. LLMs exhibit context-dependent values and personality traits that change based on the induced perspective (as opposed to humans, who tend to have more coherent values and personal… ▽ More

    Submitted 7 November, 2023; v1 submitted 15 July, 2023; originally announced July 2023.

    Comments: Preprint

    MSC Class: 68T07 ACM Class: I.2.7

  27. arXiv:2305.12487  [pdf, other

    cs.AI cs.CL cs.LG

    Augmenting Autotelic Agents with Large Language Models

    Authors: Cédric Colas, Laetitia Teodorescu, Pierre-Yves Oudeyer, Xingdi Yuan, Marc-Alexandre Côté

    Abstract: Humans learn to master open-ended repertoires of skills by imagining and practicing their own goals. This autotelic learning process, literally the pursuit of self-generated (auto) goals (telos), becomes more and more open-ended as the goals become more diverse, abstract and creative. The resulting exploration of the space of possible skills is supported by an inter-individual exploration: goal re… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

  28. arXiv:2304.10548  [pdf, other

    cs.CL cs.AI cs.HC

    Supporting Qualitative Analysis with Large Language Models: Combining Codebook with GPT-3 for Deductive Coding

    Authors: Ziang Xiao, Xingdi Yuan, Q. Vera Liao, Rania Abdelghani, Pierre-Yves Oudeyer

    Abstract: Qualitative analysis of textual contents unpacks rich and valuable information by assigning labels to the data. However, this process is often labor-intensive, particularly when working with large datasets. While recent AI-based tools demonstrate utility, researchers may not have readily available AI resources and expertise, let alone be challenged by the limited generalizability of those task-spe… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: 28th International Conference on Intelligent User Interfaces (IUI '23 Companion), March 27--31, 2023, Sydney, NSW, Australia

  29. arXiv:2302.05244  [pdf, other

    cs.AI cs.CL cs.LG

    A Song of Ice and Fire: Analyzing Textual Autotelic Agents in ScienceWorld

    Authors: Laetitia Teodorescu, Xingdi Yuan, Marc-Alexandre Côté, Pierre-Yves Oudeyer

    Abstract: Building open-ended agents that can autonomously discover a diversity of behaviours is one of the long-standing goals of artificial intelligence. This challenge can be studied in the framework of autotelic RL agents, i.e. agents that learn by selecting and pursuing their own goals, self-organizing a learning curriculum. Recent work identified language as a key dimension of autotelic learning, in p… ▽ More

    Submitted 24 February, 2023; v1 submitted 10 February, 2023; originally announced February 2023.

    Comments: In review at ICML 2023

  30. arXiv:2302.02662  [pdf, other

    cs.LG

    Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning

    Authors: Thomas Carta, Clément Romac, Thomas Wolf, Sylvain Lamprier, Olivier Sigaud, Pierre-Yves Oudeyer

    Abstract: Recent works successfully leveraged Large Language Models' (LLM) abilities to capture abstract knowledge about world's physics to solve decision-making problems. Yet, the alignment between LLMs' knowledge and the environment can be wrong and limit functional competence due to lack of grounding. In this paper, we study an approach (named GLAM) to achieve this alignment through functional grounding:… ▽ More

    Submitted 17 October, 2024; v1 submitted 6 February, 2023; originally announced February 2023.

    Journal ref: PMLR 202 (2023):3676-3713

  31. arXiv:2212.07906  [pdf, other

    cs.NE cs.AI nlin.CG

    Flow-Lenia: Towards open-ended evolution in cellular automata through mass conservation and parameter localization

    Authors: Erwan Plantec, Gautier Hamon, Mayalen Etcheverry, Pierre-Yves Oudeyer, Clément Moulin-Frier, Bert Wang-Chak Chan

    Abstract: The design of complex self-organising systems producing life-like phenomena, such as the open-ended evolution of virtual creatures, is one of the main goals of artificial life. Lenia, a family of cellular automata (CA) generalizing Conway's Game of Life to continuous space, time and states, has attracted a lot of attention because of the wide diversity of self-organizing patterns it can generate.… ▽ More

    Submitted 24 March, 2023; v1 submitted 14 December, 2022; originally announced December 2022.

  32. GPT-3-driven pedagogical agents for training children's curious question-asking skills

    Authors: Rania Abdelghani, Yen-Hsiang Wang, Xingdi Yuan, Tong Wang, Pauline Lucas, Hélène Sauzéon, Pierre-Yves Oudeyer

    Abstract: In order to train children's ability to ask curiosity-driven questions, previous research has explored designing specific exercises relying on providing semantic and linguistic cues to help formulate such questions. But despite showing pedagogical efficiency, this method is still limited as it relies on generating the said cues by hand, which can be a very costly process. In this context, we propo… ▽ More

    Submitted 30 May, 2023; v1 submitted 25 November, 2022; originally announced November 2022.

  33. arXiv:2210.06468  [pdf, other

    cs.AI cs.CL cs.LG

    Contrastive Multimodal Learning for Emergence of Graphical Sensory-Motor Communication

    Authors: Tristan Karch, Yoann Lemesle, Romain Laroche, Clément Moulin-Frier, Pierre-Yves Oudeyer

    Abstract: In this paper, we investigate whether artificial agents can develop a shared language in an ecological setting where communication relies on a sensory-motor channel. To this end, we introduce the Graphical Referential Game (GREG) where a speaker must produce a graphical utterance to name a visual referent object while a listener has to select the corresponding object among distractor referents, gi… ▽ More

    Submitted 14 February, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

  34. arXiv:2209.11000  [pdf, other

    cs.CL

    Selecting Better Samples from Pre-trained LLMs: A Case Study on Question Generation

    Authors: Xingdi Yuan, Tong Wang, Yen-Hsiang Wang, Emery Fine, Rania Abdelghani, Pauline Lucas, Hélène Sauzéon, Pierre-Yves Oudeyer

    Abstract: Large Language Models (LLMs) have in recent years demonstrated impressive prowess in natural language generation. A common practice to improve generation diversity is to sample multiple outputs from the model. However, there lacks a simple and robust way of selecting the best output from these stochastic samples. As a case study framed in the context of question generation, we propose two prompt-b… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

  35. arXiv:2207.04118  [pdf, other

    cs.AI

    Automatic Exploration of Textual Environments with Language-Conditioned Autotelic Agents

    Authors: Laetitia Teodorescu, Eric Yuan, Marc-Alexandre Côté, Pierre-Yves Oudeyer

    Abstract: In this extended abstract we discuss the opportunities and challenges of studying intrinsically-motivated agents for exploration in textual environments. We argue that there is important synergy between text environments and autonomous agents. We identify key properties of text worlds that make them suitable for exploration by autonmous agents, namely, depth, breadth, progress niches and the ease… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

  36. arXiv:2206.09674  [pdf, other

    cs.CL cs.AI cs.LG

    EAGER: Asking and Answering Questions for Automatic Reward Shaping in Language-guided RL

    Authors: Thomas Carta, Pierre-Yves Oudeyer, Olivier Sigaud, Sylvain Lamprier

    Abstract: Reinforcement learning (RL) in long horizon and sparse reward tasks is notoriously difficult and requires a lot of training steps. A standard solution to speed up the process is to leverage additional reward signals, shaping it to better guide the learning process. In the context of language-conditioned RL, the abstraction and generalisation properties of the language input provide opportunities f… ▽ More

    Submitted 13 October, 2022; v1 submitted 20 June, 2022; originally announced June 2022.

    Comments: 24 pages, 16 figures, 5 tables

  37. arXiv:2206.05060  [pdf, other

    cs.AI cs.MA cs.SI

    Social Network Structure Shapes Innovation: Experience-sharing in RL with SAPIENS

    Authors: Eleni Nisioti, Mateo Mahaut, Pierre-Yves Oudeyer, Ida Momennejad, Clément Moulin-Frier

    Abstract: Human culture relies on innovation: our ability to continuously explore how existing elements can be combined to create new ones. Innovation is not solitary, it relies on collective search and accumulation. Reinforcement learning (RL) approaches commonly assume that fully-connected groups are best suited for innovation. However, human laboratory and field studies have shown that hierarchical innov… ▽ More

    Submitted 18 November, 2022; v1 submitted 10 June, 2022; originally announced June 2022.

  38. Language and Culture Internalisation for Human-Like Autotelic AI

    Authors: Cédric Colas, Tristan Karch, Clément Moulin-Frier, Pierre-Yves Oudeyer

    Abstract: Building autonomous agents able to grow open-ended repertoires of skills across their lives is a fundamental goal of artificial intelligence (AI). A promising developmental approach recommends the design of intrinsically motivated agents that learn new skills by generating and pursuing their own goals - autotelic agents. But despite recent progress, existing algorithms still show serious limitatio… ▽ More

    Submitted 16 November, 2022; v1 submitted 2 June, 2022; originally announced June 2022.

    Journal ref: Nature Machine Intelligence 4, 1068-1076 (2022)

  39. arXiv:2205.06111  [pdf, other

    cs.AI cs.CL

    Asking for Knowledge: Training RL Agents to Query External Knowledge Using Language

    Authors: Iou-Jen Liu, Xingdi Yuan, Marc-Alexandre Côté, Pierre-Yves Oudeyer, Alexander G. Schwing

    Abstract: To solve difficult tasks, humans ask questions to acquire knowledge from external sources. In contrast, classical reinforcement learning agents lack such an ability and often resort to exploratory behavior. This is exacerbated as few present-day environments support querying for knowledge. In order to study how agents can be taught to query external knowledge via language, we first introduce two n… ▽ More

    Submitted 3 July, 2022; v1 submitted 12 May, 2022; originally announced May 2022.

    Comments: ICML 2022; Project page: https://ioujenliu.github.io/AFK/

  40. Conversational agents for fostering curiosity-driven learning in children

    Authors: Rania Abdelghani, Pierre-Yves Oudeyer, Edith Law, Catherine de Vulpillières, Hélène Sauzéon

    Abstract: Curiosity is an important factor that favors independent and individualized learning in children. Research suggests that it is also a competence that can be fostered by training specific metacognitive skills and information-searching behaviors. In this light, we develop a conversational agent that helps children generate curiosity-driven questions, and encourages their use to lead autonomous explo… ▽ More

    Submitted 12 April, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

  41. arXiv:2201.11014  [pdf, other

    cs.CV cs.CL cs.LG

    Language-biased image classification: evaluation based on semantic representations

    Authors: Yoann Lemesle, Masataka Sawayama, Guillermo Valle-Perez, Maxime Adolphe, Hélène Sauzéon, Pierre-Yves Oudeyer

    Abstract: Humans show language-biased image recognition for a word-embedded image, known as picture-word interference. Such interference depends on hierarchical semantic categories and reflects that human language processing highly interacts with visual processing. Similar to humans, recent artificial models jointly trained on texts and images, e.g., OpenAI CLIP, show language-biased image classification. E… ▽ More

    Submitted 12 March, 2022; v1 submitted 26 January, 2022; originally announced January 2022.

    Comments: Accepted at ICLR 2022

  42. arXiv:2112.07342  [pdf, other

    cs.LG cs.AI cs.MA

    Learning to Guide and to Be Guided in the Architect-Builder Problem

    Authors: Paul Barde, Tristan Karch, Derek Nowrouzezahrai, Clément Moulin-Frier, Christopher Pal, Pierre-Yves Oudeyer

    Abstract: We are interested in interactive agents that learn to coordinate, namely, a $builder$ -- which performs actions but ignores the goal of the task, i.e. has no access to rewards -- and an $architect$ which guides the builder towards the goal of the task. We define and explore a formal setting where artificial agents are equipped with mechanisms that allow them to simultaneously learn a task while at… ▽ More

    Submitted 11 April, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: International Conference on Learning Representations (2022)

  43. Identifying Functions and Behaviours of Social Robots during Learning Activities: Teachers' Perspective

    Authors: Jessy Ceha, Edith Law, Dana Kulić, Pierre-Yves Oudeyer, Didier Roy

    Abstract: With advances in artificial intelligence, research is increasingly exploring the potential functions that social robots can play in education. As teachers are a critical stakeholder in the use and application of educational technologies, we conducted a study to understand teachers' perspectives on how a social robot could support a variety of learning activities in the classroom. Through interview… ▽ More

    Submitted 30 October, 2021; originally announced November 2021.

    Comments: This is a preprint of an article published in The International Journal of Social Robotics. The final authenticated version is available online at: https://doi.org/10.1007/s12369-021-00820-7

  44. arXiv:2107.00956  [pdf, other

    cs.LG cs.AI cs.CL

    SocialAI: Benchmarking Socio-Cognitive Abilities in Deep Reinforcement Learning Agents

    Authors: Grgur Kovač, Rémy Portelas, Katja Hofmann, Pierre-Yves Oudeyer

    Abstract: Building embodied autonomous agents capable of participating in social interactions with humans is one of the main challenges in AI. Within the Deep Reinforcement Learning (DRL) field, this objective motivated multiple works on embodied language use. However, current approaches focus on language as a communication tool in very simplified and non-diverse social situations: the "naturalness" of lang… ▽ More

    Submitted 1 September, 2021; v1 submitted 2 July, 2021; originally announced July 2021.

    Comments: under review. This paper extends and generalizes work in arXiv:2104.13207

  45. arXiv:2106.14421  [pdf, other

    cs.LG

    Causal Reinforcement Learning using Observational and Interventional Data

    Authors: Maxime Gasse, Damien Grasset, Guillaume Gaudron, Pierre-Yves Oudeyer

    Abstract: Learning efficiently a causal model of the environment is a key challenge of model-based RL agents operating in POMDPs. We consider here a scenario where the learning agent has the ability to collect online experiences through direct interactions with the environment (interventional data), but has also access to a large collection of offline experiences, obtained by observing another agent interac… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

  46. arXiv:2106.13871  [pdf, other

    cs.SD cs.GR cs.LG eess.AS

    Transflower: probabilistic autoregressive dance generation with multimodal attention

    Authors: Guillermo Valle-Pérez, Gustav Eje Henter, Jonas Beskow, André Holzapfel, Pierre-Yves Oudeyer, Simon Alexanderson

    Abstract: Dance requires skillful composition of complex movements that follow rhythmic, tonal and timbral features of music. Formally, generating dance conditioned on a piece of music can be expressed as a problem of modelling a high-dimensional continuous motion signal, conditioned on an audio signal. In this work we make two contributions to tackle this problem. First, we present a novel probabilistic au… ▽ More

    Submitted 11 June, 2022; v1 submitted 25 June, 2021; originally announced June 2021.

    Comments: Article presented at SIGGRAPH Asia 2021, and published in ACM Transactions on Graphics

  47. arXiv:2106.08858  [pdf, other

    cs.AI cs.CL cs.LG

    Grounding Spatio-Temporal Language with Transformers

    Authors: Tristan Karch, Laetitia Teodorescu, Katja Hofmann, Clément Moulin-Frier, Pierre-Yves Oudeyer

    Abstract: Language is an interface to the outside world. In order for embodied agents to use it, language must be grounded in other, sensorimotor modalities. While there is an extended literature studying how machines can learn grounded language, the topic of how to learn spatio-temporal linguistic concepts is still largely uncharted. To make progress in this direction, we here introduce a novel spatio-temp… ▽ More

    Submitted 11 October, 2021; v1 submitted 16 June, 2021; originally announced June 2021.

    Comments: Contains main article and supplementaries

    Journal ref: Neurips 2021

  48. Towards Teachable Autotelic Agents

    Authors: Olivier Sigaud, Ahmed Akakzia, Hugo Caselles-Dupré, Cédric Colas, Pierre-Yves Oudeyer, Mohamed Chetouani

    Abstract: Autonomous discovery and direct instruction are two distinct sources of learning in children but education sciences demonstrate that mixed approaches such as assisted discovery or guided play result in improved skill acquisition. In the field of Artificial Intelligence, these extremes respectively map to autonomous agents learning from their own signals and interactive learning agents fully taught… ▽ More

    Submitted 20 March, 2023; v1 submitted 25 May, 2021; originally announced May 2021.

    Journal ref: Sigaud, O., Akakzia, A., Caselles-Dupré, H., Colas, C., Oudeyer, P. Y., & Chetouani, M. (2022). Towards Teachable Autotelic Agents. IEEE Transactions on Cognitive and Developmental Systems

  49. arXiv:2104.13207  [pdf, other

    cs.LG cs.AI

    SocialAI 0.1: Towards a Benchmark to Stimulate Research on Socio-Cognitive Abilities in Deep Reinforcement Learning Agents

    Authors: Grgur Kovač, Rémy Portelas, Katja Hofmann, Pierre-Yves Oudeyer

    Abstract: Building embodied autonomous agents capable of participating in social interactions with humans is one of the main challenges in AI. This problem motivated many research directions on embodied language use. Current approaches focus on language as a communication tool in very simplified and non diverse social situations: the "naturalness" of language is reduced to the concept of high vocabulary siz… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

    Comments: Accepted at NAACL ViGIL Workshop 2021

  50. arXiv:2103.09815  [pdf, other

    cs.LG

    TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL

    Authors: Clément Romac, Rémy Portelas, Katja Hofmann, Pierre-Yves Oudeyer

    Abstract: Training autonomous agents able to generalize to multiple tasks is a key target of Deep Reinforcement Learning (DRL) research. In parallel to improving DRL algorithms themselves, Automatic Curriculum Learning (ACL) study how teacher algorithms can train DRL agents more efficiently by adapting task selection to their evolving abilities. While multiple standard benchmarks exist to compare DRL agents… ▽ More

    Submitted 9 June, 2021; v1 submitted 17 March, 2021; originally announced March 2021.