Skip to main content

Showing 1–15 of 15 results for author: Fantacci, C

.
  1. arXiv:2503.20020  [pdf, other

    cs.RO

    Gemini Robotics: Bringing AI into the Physical World

    Authors: Gemini Robotics Team, Saminda Abeyruwan, Joshua Ainslie, Jean-Baptiste Alayrac, Montserrat Gonzalez Arenas, Travis Armstrong, Ashwin Balakrishna, Robert Baruch, Maria Bauza, Michiel Blokzijl, Steven Bohez, Konstantinos Bousmalis, Anthony Brohan, Thomas Buschmann, Arunkumar Byravan, Serkan Cabi, Ken Caluwaerts, Federico Casarini, Oscar Chang, Jose Enrique Chen, Xi Chen, Hao-Tien Lewis Chiang, Krzysztof Choromanski, David D'Ambrosio, Sudeep Dasari , et al. (93 additional authors not shown)

    Abstract: Recent advancements in large multimodal models have led to the emergence of remarkable generalist capabilities in digital domains, yet their translation to physical agents such as robots remains a significant challenge. This report introduces a new family of AI models purposefully designed for robotics and built upon the foundation of Gemini 2.0. We present Gemini Robotics, an advanced Vision-Lang… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

  2. arXiv:2401.08525  [pdf, other

    cs.AI cs.CV cs.LG cs.RO

    GATS: Gather-Attend-Scatter

    Authors: Konrad Zolna, Serkan Cabi, Yutian Chen, Eric Lau, Claudio Fantacci, Jurgis Pasukonis, Jost Tobias Springenberg, Sergio Gomez Colmenarejo

    Abstract: As the AI community increasingly adopts large-scale models, it is crucial to develop general and flexible tools to integrate them. We introduce Gather-Attend-Scatter (GATS), a novel module that enables seamless combination of pretrained foundation models, both trainable and frozen, into larger multimodal networks. GATS empowers AI systems to process and generate information across multiple modalit… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  3. arXiv:2306.11706  [pdf, other

    cs.RO cs.LG

    RoboCat: A Self-Improving Generalist Agent for Robotic Manipulation

    Authors: Konstantinos Bousmalis, Giulia Vezzani, Dushyant Rao, Coline Devin, Alex X. Lee, Maria Bauza, Todor Davchev, Yuxiang Zhou, Agrim Gupta, Akhil Raju, Antoine Laurens, Claudio Fantacci, Valentin Dalibard, Martina Zambelli, Murilo Martins, Rugile Pevceviciute, Michiel Blokzijl, Misha Denil, Nathan Batchelor, Thomas Lampe, Emilio Parisotto, Konrad Żołna, Scott Reed, Sergio Gómez Colmenarejo, Jon Scholz , et al. (14 additional authors not shown)

    Abstract: The ability to leverage heterogeneous robotic experience from different robots and tasks to quickly master novel skills and embodiments has the potential to transform robot learning. Inspired by recent advances in foundation models for vision and language, we propose a multi-embodiment, multi-task generalist agent for robotic manipulation. This agent, named RoboCat, is a visual goal-conditioned de… ▽ More

    Submitted 22 December, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: Transactions on Machine Learning Research (12/2023)

  4. arXiv:2306.09800  [pdf, other

    cs.LG cs.RO

    $\pi2\text{vec}$: Policy Representations with Successor Features

    Authors: Gianluca Scarpellini, Ksenia Konyushkova, Claudio Fantacci, Tom Le Paine, Yutian Chen, Misha Denil

    Abstract: This paper describes $\pi2\text{vec}$, a method for representing behaviors of black box policies as feature vectors. The policy representations capture how the statistics of foundation model features change in response to the policy behavior in a task agnostic way, and can be trained from offline data, allowing them to be used in offline policy selection. This work provides a key piece of a recipe… ▽ More

    Submitted 24 January, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: Accepted paper at ICLR2024

  5. arXiv:2304.06600  [pdf, other

    cs.LG cs.CV cs.RO

    Lossless Adaptation of Pretrained Vision Models For Robotic Manipulation

    Authors: Mohit Sharma, Claudio Fantacci, Yuxiang Zhou, Skanda Koppula, Nicolas Heess, Jon Scholz, Yusuf Aytar

    Abstract: Recent works have shown that large models pretrained on common visual learning tasks can provide useful representations for a wide range of specialized perception problems, as well as a variety of robotic manipulation tasks. While prior work on robotic manipulation has predominantly used frozen pretrained features, we demonstrate that in robotics this approach can fail to reach optimal performance… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    Comments: ICLR'23, Project page see https://sites.google.com/view/robo-adapters/

  6. arXiv:2211.13743  [pdf, other

    cs.LG cs.AI cs.RO

    SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration

    Authors: Giulia Vezzani, Dhruva Tirumala, Markus Wulfmeier, Dushyant Rao, Abbas Abdolmaleki, Ben Moran, Tuomas Haarnoja, Jan Humplik, Roland Hafner, Michael Neunert, Claudio Fantacci, Tim Hertweck, Thomas Lampe, Fereshteh Sadeghi, Nicolas Heess, Martin Riedmiller

    Abstract: The ability to effectively reuse prior knowledge is a key requirement when building general and flexible Reinforcement Learning (RL) agents. Skill reuse is one of the most common approaches, but current methods have considerable limitations.For example, fine-tuning an existing policy frequently fails, as the policy can degrade rapidly early in training. In a similar vein, distillation of expert be… ▽ More

    Submitted 11 January, 2023; v1 submitted 24 November, 2022; originally announced November 2022.

  7. arXiv:2110.06192  [pdf, other

    cs.RO cs.LG

    Beyond Pick-and-Place: Tackling Robotic Stacking of Diverse Shapes

    Authors: Alex X. Lee, Coline Devin, Yuxiang Zhou, Thomas Lampe, Konstantinos Bousmalis, Jost Tobias Springenberg, Arunkumar Byravan, Abbas Abdolmaleki, Nimrod Gileadi, David Khosid, Claudio Fantacci, Jose Enrique Chen, Akhil Raju, Rae Jeong, Michael Neunert, Antoine Laurens, Stefano Saliceti, Federico Casarini, Martin Riedmiller, Raia Hadsell, Francesco Nori

    Abstract: We study the problem of robotic stacking with objects of complex geometry. We propose a challenging and diverse set of such objects that was carefully designed to require strategies beyond a simple "pick-and-place" solution. Our method is a reinforcement learning (RL) approach combined with vision-based interactive policy distillation and simulation-to-reality transfer. Our learned policies can ef… ▽ More

    Submitted 3 November, 2021; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: CoRL 2021. Video: https://dpmd.ai/robotics-stacking-YT . Blog: https://dpmd.ai/robotics-stacking . Code: https://github.com/deepmind/rgb_stacking

  8. In Situ Translational Hand-Eye Calibration of Laser Profile Sensors using Arbitrary Objects

    Authors: Prajval Kumar Murali, Ines Sorrentino, Angelo Rendiniello, Claudio Fantacci, Enrico Villagrossi, Andrea Polo, Alessandro Ardesi, Marco Maggiali, Lorenzo Natale, Daniele Pucci, Silvio Traversaro

    Abstract: Hand-eye calibration of laser profile sensors is the process of extracting the homogeneous transformation between the laser profile sensor frame and the end-effector frame of a robot in order to express the data extracted by the sensor in the robot's global coordinate system. For laser profile scanners this is a challenging procedure, as they provide data only in two dimensions and state-of-the-ar… ▽ More

    Submitted 22 March, 2021; originally announced March 2021.

    Comments: The first two authors contributed equally to this work. Accepted to the IEEE International Conference on Robotics and Automation (ICRA) 2021

    Journal ref: IEEE International Conference on Robotics and Automation, 2021

  9. arXiv:1710.04465  [pdf, other

    cs.RO eess.SY stat.CO

    Markerless visual servoing on unknown objects for humanoid robot platforms

    Authors: Claudio Fantacci, Giulia Vezzani, Ugo Pattacini, Vadim Tikhanoff, Lorenzo Natale

    Abstract: To precisely reach for an object with a humanoid robot, it is of central importance to have good knowledge of both end-effector, object pose and shape. In this work we propose a framework for markerless visual servoing on unknown objects, which is divided in four main parts: I) a least-squares minimization problem is formulated to find the volume of the object graspable by the robot's hand using i… ▽ More

    Submitted 12 October, 2017; originally announced October 2017.

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA), 2018

  10. Visual end-effector tracking using a 3D model-aided particle filter for humanoid robot platforms

    Authors: Claudio Fantacci, Ugo Pattacini, Vadim Tikhanoff, Lorenzo Natale

    Abstract: This paper addresses recursive markerless estimation of a robot's end-effector using visual observations from its cameras. The problem is formulated into the Bayesian framework and addressed using Sequential Monte Carlo (SMC) filtering. We use a 3D rendering engine and Computer Aided Design (CAD) schematics of the robot to virtually create images from the robot's camera viewpoints. These images ar… ▽ More

    Submitted 4 August, 2017; v1 submitted 14 March, 2017; originally announced March 2017.

    Journal ref: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2017

  11. arXiv:1602.03945  [pdf, ps, other

    eess.SY

    An Overview of Particle Methods for Random Finite Set Models

    Authors: Branko Ristic, Michael Beard, Claudio Fantacci

    Abstract: This overview paper describes the particle methods developed for the implementation of the a class of Bayes filters formulated using the random finite set formalism. It is primarily intended for the readership already familiar with the particle methods in the context of the standard Bayes filter. The focus in on the Bernoulli particle filter, the probability hypothesis density (PHD) particle filte… ▽ More

    Submitted 11 February, 2016; originally announced February 2016.

    Comments: 50 pages including 6 figures in Information Fusion, 2016

  12. arXiv:1508.04158  [pdf, other

    stat.ME eess.SY

    Distributed multi-object tracking over sensor networks: a random finite set approach

    Authors: Claudio Fantacci

    Abstract: The aim of the present dissertation is to address distributed tracking over a network of heterogeneous and geographically dispersed nodes (or agents) with sensing, communication and processing capabilities. Tracking is carried out in the Bayesian framework and its extension to a distributed context is made possible via an information-theoretic approach to data fusion which exploits consensus algor… ▽ More

    Submitted 12 July, 2015; originally announced August 2015.

    Comments: Ph.D. thesis of Claudio Fantacci, Università di Firenze, Dipartimento di Ingegneria dell'Informazione (DINFO), Florence, Italy Successfully defended on the 5th of March 2015

  13. arXiv:1501.01579  [pdf, other

    eess.SY stat.CO

    Consensus Labeled Random Finite Set Filtering for Distributed Multi-Object Tracking

    Authors: C. Fantacci, B. -N. Vo, B. -T. Vo, G. Battistelli, L. Chisci

    Abstract: This paper addresses distributed multi-object tracking over a network of heterogeneous and geographically dispersed nodes with sensing, communication and processing capabilities. The main contribution is an approach to distributed multi-object estimation based on labeled Random Finite Sets (RFSs) and dynamic Bayesian inference, which enables the development of two novel consensus tracking filters,… ▽ More

    Submitted 9 June, 2016; v1 submitted 7 January, 2015; originally announced January 2015.

  14. The Marginalized $δ$-GLMB Filter

    Authors: C. Fantacci, B. -T. Vo, F. Papi, B. -N. Vo

    Abstract: The multi-target Bayes filter proposed by Mahler is a principled solution to recursive Bayesian tracking based on RFS or FISST. The $δ$-GLMB filter is an exact closed form solution to the multi-target Bayes recursion which yields joint state and label or trajectory estimates in the presence of clutter, missed detections and association uncertainty. Due to presence of explicit data associations in… ▽ More

    Submitted 6 April, 2017; v1 submitted 5 January, 2015; originally announced January 2015.

    Comments: With the consent of the authors, part of this work has been reworded and published in the IEEE Signal Processing Letters, vol. 23, no. 6, pp. 863-867, with the title "Scalable Multisensor Multitarget Tracking Using the Marginalized δ-GLMB Density". DOI: 10.1109/LSP.2016.2557078. IEEEXplore link: http://ieeexplore.ieee.org/abstract/document/7457284/

  15. Generalized Labeled Multi-Bernoulli Approximation of Multi-Object Densities

    Authors: Francesco Papi, Ba-Ngu Vo, Ba-Tuong Vo, Claudio Fantacci, Michael Beard

    Abstract: In multi-object inference, the multi-object probability density captures the uncertainty in the number and the states of the objects as well as the statistical dependence between the objects. Exact computation of the multi-object density is generally intractable and tractable implementations usually require statistical independence assumptions between objects. In this paper we propose a tractable… ▽ More

    Submitted 6 July, 2015; v1 submitted 17 December, 2014; originally announced December 2014.