Skip to main content

Showing 1–20 of 20 results for author: Henaff, M

.
  1. arXiv:2504.14151  [pdf, other

    cs.CV cs.AI cs.RO

    Locate 3D: Real-World Object Localization via Self-Supervised Learning in 3D

    Authors: Sergio Arnaud, Paul McVay, Ada Martin, Arjun Majumdar, Krishna Murthy Jatavallabhula, Phillip Thomas, Ruslan Partsey, Daniel Dugas, Abha Gejji, Alexander Sax, Vincent-Pierre Berges, Mikael Henaff, Ayush Jain, Ang Cao, Ishita Prasad, Mrinal Kalakrishnan, Michael Rabbat, Nicolas Ballas, Mido Assran, Oleksandr Maksymets, Aravind Rajeswaran, Franziska Meier

    Abstract: We present LOCATE 3D, a model for localizing objects in 3D scenes from referring expressions like "the small coffee table between the sofa and the lamp." LOCATE 3D sets a new state-of-the-art on standard referential grounding benchmarks and showcases robust generalization capabilities. Notably, LOCATE 3D operates directly on sensor observation streams (posed RGB-D frames), enabling real-world depl… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.

    ACM Class: I.2.10; I.2.6; I.2.9; I.3.7; I.4.6; I.4.8

  2. arXiv:2501.13928  [pdf, other

    cs.CV cs.AI cs.GR cs.RO

    Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass

    Authors: Jianing Yang, Alexander Sax, Kevin J. Liang, Mikael Henaff, Hao Tang, Ang Cao, Joyce Chai, Franziska Meier, Matt Feiszli

    Abstract: Multi-view 3D reconstruction remains a core challenge in computer vision, particularly in applications requiring accurate and scalable representations across diverse perspectives. Current leading methods such as DUSt3R employ a fundamentally pairwise approach, processing images in pairs and necessitating costly global alignment procedures to reconstruct from multiple views. In this work, we propos… ▽ More

    Submitted 19 March, 2025; v1 submitted 23 January, 2025; originally announced January 2025.

    Comments: CVPR 2025. Project website: https://fast3r-3d.github.io/

  3. arXiv:2412.08542  [pdf, other

    cs.AI cs.CL cs.LG

    MaestroMotif: Skill Design from Artificial Intelligence Feedback

    Authors: Martin Klissarov, Mikael Henaff, Roberta Raileanu, Shagun Sodhani, Pascal Vincent, Amy Zhang, Pierre-Luc Bacon, Doina Precup, Marlos C. Machado, Pierluca D'Oro

    Abstract: Describing skills in natural language has the potential to provide an accessible way to inject human knowledge about decision-making into an AI system. We present MaestroMotif, a method for AI-assisted skill design, which yields high-performing and adaptable agents. MaestroMotif leverages the capabilities of Large Language Models (LLMs) to effectively create and reuse skills. It first uses an LLM'… ▽ More

    Submitted 11 December, 2024; originally announced December 2024.

  4. arXiv:2410.23022  [pdf, other

    cs.LG cs.AI cs.CL cs.RO

    Online Intrinsic Rewards for Decision Making Agents from Large Language Model Feedback

    Authors: Qinqing Zheng, Mikael Henaff, Amy Zhang, Aditya Grover, Brandon Amos

    Abstract: Automatically synthesizing dense rewards from natural language descriptions is a promising paradigm in reinforcement learning (RL), with applications to sparse reward problems, open-ended exploration, and hierarchical skill design. Recent works have made promising steps by exploiting the prior knowledge of large language models (LLMs). However, these approaches suffer from important limitations: t… ▽ More

    Submitted 17 December, 2024; v1 submitted 30 October, 2024; originally announced October 2024.

  5. arXiv:2312.03801  [pdf, other

    cs.LG cs.AI

    Generalization to New Sequential Decision Making Tasks with In-Context Learning

    Authors: Sharath Chandra Raparthy, Eric Hambro, Robert Kirk, Mikael Henaff, Roberta Raileanu

    Abstract: Training autonomous agents that can learn new tasks from only a handful of demonstrations is a long-standing problem in machine learning. Recently, transformers have been shown to learn new language or vision tasks without any weight updates from only a few examples, also referred to as in-context learning. However, the sequential decision making setting poses additional challenges having a lower… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  6. arXiv:2310.00166  [pdf, other

    cs.AI cs.LG

    Motif: Intrinsic Motivation from Artificial Intelligence Feedback

    Authors: Martin Klissarov, Pierluca D'Oro, Shagun Sodhani, Roberta Raileanu, Pierre-Luc Bacon, Pascal Vincent, Amy Zhang, Mikael Henaff

    Abstract: Exploring rich environments and evaluating one's actions without prior knowledge is immensely challenging. In this paper, we propose Motif, a general method to interface such prior knowledge from a Large Language Model (LLM) with an agent. Motif is based on the idea of grounding LLMs for decision-making without requiring them to interact with the environment: it elicits preferences from an LLM ove… ▽ More

    Submitted 29 September, 2023; originally announced October 2023.

    Comments: The first two authors equally contributed - order decided by coin flip

  7. arXiv:2306.03236  [pdf, other

    cs.AI

    A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs

    Authors: Mikael Henaff, Minqi Jiang, Roberta Raileanu

    Abstract: Exploration in environments which differ across episodes has received increasing attention in recent years. Current methods use some combination of global novelty bonuses, computed using the agent's entire training experience, and \textit{episodic novelty bonuses}, computed using only experience from the current episode. However, the use of these two types of bonuses has been ad-hoc and poorly und… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  8. arXiv:2210.06518  [pdf, other

    cs.LG cs.AI cs.RO

    Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories

    Authors: Qinqing Zheng, Mikael Henaff, Brandon Amos, Aditya Grover

    Abstract: Natural agents can effectively learn from multiple data sources that differ in size, quality, and types of measurements. We study this heterogeneity in the context of offline reinforcement learning (RL) by introducing a new, practically motivated semi-supervised setting. Here, an agent has access to two sets of trajectories: labelled trajectories containing state, action and reward triplets at eve… ▽ More

    Submitted 22 June, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: ICML 2023. Code: https://github.com/facebookresearch/ssorl/

  9. arXiv:2210.05805  [pdf, other

    cs.LG cs.AI

    Exploration via Elliptical Episodic Bonuses

    Authors: Mikael Henaff, Roberta Raileanu, Minqi Jiang, Tim Rocktäschel

    Abstract: In recent years, a number of reinforcement learning (RL) methods have been proposed to explore complex environments which differ across episodes. In this work, we show that the effectiveness of these methods critically relies on a count-based episodic term in their exploration bonus. As a result, despite their success in relatively simple, noise-free settings, these methods fall short in more real… ▽ More

    Submitted 4 January, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

  10. arXiv:2007.08459  [pdf, other

    cs.LG cs.AI stat.ML

    PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning

    Authors: Alekh Agarwal, Mikael Henaff, Sham Kakade, Wen Sun

    Abstract: Direct policy gradient methods for reinforcement learning are a successful approach for a variety of reasons: they are model free, they directly optimize the performance metric of interest, and they allow for richly parameterized policies. Their primary drawback is that, by being local in nature, they fail to adequately explore the environment. In contrast, while model-based approaches and Q-learn… ▽ More

    Submitted 13 August, 2020; v1 submitted 16 July, 2020; originally announced July 2020.

  11. arXiv:1911.05815  [pdf, other

    cs.LG stat.ML

    Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement Learning

    Authors: Dipendra Misra, Mikael Henaff, Akshay Krishnamurthy, John Langford

    Abstract: We present an algorithm, HOMER, for exploration and reinforcement learning in rich observation environments that are summarizable by an unknown latent state space. The algorithm interleaves representation learning to identify a new notion of kinematic state abstraction with strategic exploration to reach new states using the learned abstraction. The algorithm provably explores the environment with… ▽ More

    Submitted 13 November, 2019; originally announced November 2019.

  12. arXiv:1911.00617  [pdf, other

    cs.LG cs.AI stat.ML

    Explicit Explore-Exploit Algorithms in Continuous State Spaces

    Authors: Mikael Henaff

    Abstract: We present a new model-based algorithm for reinforcement learning (RL) which consists of explicit exploration and exploitation phases, and is applicable in large or infinite state spaces. The algorithm maintains a set of dynamics models consistent with current experience and explores by finding policies which induce high disagreement between their state predictions. It then exploits using the refi… ▽ More

    Submitted 2 December, 2019; v1 submitted 1 November, 2019; originally announced November 2019.

  13. arXiv:1901.02705  [pdf, other

    cs.LG cs.AI stat.ML

    Model-Predictive Policy Learning with Uncertainty Regularization for Driving in Dense Traffic

    Authors: Mikael Henaff, Alfredo Canziani, Yann LeCun

    Abstract: Learning a policy using only observational data is challenging because the distribution of states it induces at execution time may differ from the distribution observed during training. We propose to train a policy by unrolling a learned model of the environment dynamics over multiple time steps while explicitly penalizing two costs: the original cost the policy seeks to optimize, and an uncertain… ▽ More

    Submitted 7 January, 2019; originally announced January 2019.

  14. arXiv:1711.04994  [pdf, other

    cs.AI

    Prediction Under Uncertainty with Error-Encoding Networks

    Authors: Mikael Henaff, Junbo Zhao, Yann LeCun

    Abstract: In this work we introduce a new framework for performing temporal predictions in the presence of uncertainty. It is based on a simple idea of disentangling components of the future state which are predictable from those which are inherently unpredictable, and encoding the unpredictable components into a low-dimensional latent variable which is fed into a forward model. Our method uses a supervised… ▽ More

    Submitted 30 November, 2017; v1 submitted 14 November, 2017; originally announced November 2017.

  15. arXiv:1705.07177  [pdf, other

    cs.AI

    Model-Based Planning with Discrete and Continuous Actions

    Authors: Mikael Henaff, William F. Whitney, Yann LeCun

    Abstract: Action planning using learned and differentiable forward models of the world is a general approach which has a number of desirable properties, including improved sample complexity over model-free RL methods, reuse of learned models across different tasks, and the ability to perform efficient gradient-based optimization in continuous action spaces. However, this approach does not apply straightforw… ▽ More

    Submitted 4 April, 2018; v1 submitted 19 May, 2017; originally announced May 2017.

  16. arXiv:1612.03969  [pdf, ps, other

    cs.CL

    Tracking the World State with Recurrent Entity Networks

    Authors: Mikael Henaff, Jason Weston, Arthur Szlam, Antoine Bordes, Yann LeCun

    Abstract: We introduce a new model, the Recurrent Entity Network (EntNet). It is equipped with a dynamic long-term memory which allows it to maintain and update a representation of the state of the world as it receives new data. For language understanding tasks, it can reason on-the-fly as it reads text, not just when it is required to answer a question or respond as is the case for a Memory Network (Sukhba… ▽ More

    Submitted 10 May, 2017; v1 submitted 12 December, 2016; originally announced December 2016.

    Journal ref: ICLR 2017

  17. arXiv:1602.06662  [pdf, other

    cs.NE cs.AI cs.LG stat.ML

    Recurrent Orthogonal Networks and Long-Memory Tasks

    Authors: Mikael Henaff, Arthur Szlam, Yann LeCun

    Abstract: Although RNNs have been shown to be powerful tools for processing sequential data, finding architectures or optimization strategies that allow them to model very long term dependencies is still an active area of research. In this work, we carefully analyze two synthetic datasets originally outlined in (Hochreiter and Schmidhuber, 1997) which are used to evaluate the ability of RNNs to store inform… ▽ More

    Submitted 15 March, 2017; v1 submitted 22 February, 2016; originally announced February 2016.

  18. arXiv:1506.05163  [pdf, other

    cs.LG cs.CV cs.NE

    Deep Convolutional Networks on Graph-Structured Data

    Authors: Mikael Henaff, Joan Bruna, Yann LeCun

    Abstract: Deep Learning's recent successes have mostly relied on Convolutional Networks, which exploit fundamental statistical properties of images, sounds and video data: the local stationarity and multi-scale compositional structure, that allows expressing long range interactions in terms of shorter, localized interactions. However, there exist other important examples, such as text documents or bioinform… ▽ More

    Submitted 16 June, 2015; originally announced June 2015.

  19. arXiv:1412.0233  [pdf, other

    cs.LG

    The Loss Surfaces of Multilayer Networks

    Authors: Anna Choromanska, Mikael Henaff, Michael Mathieu, Gérard Ben Arous, Yann LeCun

    Abstract: We study the connection between the highly non-convex loss function of a simple model of the fully-connected feed-forward neural network and the Hamiltonian of the spherical spin-glass model under the assumptions of: i) variable independence, ii) redundancy in network parametrization, and iii) uniformity. These assumptions enable us to explain the complexity of the fully decoupled neural network t… ▽ More

    Submitted 21 January, 2015; v1 submitted 30 November, 2014; originally announced December 2014.

  20. arXiv:1312.5851  [pdf, other

    cs.CV cs.LG cs.NE

    Fast Training of Convolutional Networks through FFTs

    Authors: Michael Mathieu, Mikael Henaff, Yann LeCun

    Abstract: Convolutional networks are one of the most widely employed architectures in computer vision and machine learning. In order to leverage their ability to learn complex functions, large amounts of data are required for training. Training a large convolutional network to produce state-of-the-art results can take weeks, even when using modern GPUs. Producing labels using a trained network can also be c… ▽ More

    Submitted 6 March, 2014; v1 submitted 20 December, 2013; originally announced December 2013.