Skip to main content

Showing 1–8 of 8 results for author: Milano, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.12207  [pdf, other

    cs.CV cs.RO

    NeuSurfEmb: A Complete Pipeline for Dense Correspondence-based 6D Object Pose Estimation without CAD Models

    Authors: Francesco Milano, Jen Jen Chung, Hermann Blum, Roland Siegwart, Lionel Ott

    Abstract: State-of-the-art approaches for 6D object pose estimation assume the availability of CAD models and require the user to manually set up physically-based rendering (PBR) pipelines for synthetic training data generation. Both factors limit the application of these methods in real-world scenarios. In this work, we present a pipeline that does not require CAD models and allows training a state-of-the-… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: Accepted by the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2024. 8 pages, 4 figures, 5 tables

  2. arXiv:2311.02734  [pdf, other

    cs.CV cs.RO

    ISAR: A Benchmark for Single- and Few-Shot Object Instance Segmentation and Re-Identification

    Authors: Nicolas Gorlo, Kenneth Blomqvist, Francesco Milano, Roland Siegwart

    Abstract: Most object-level mapping systems in use today make use of an upstream learned object instance segmentation model. If we want to teach them about a new object or segmentation class, we need to build a large dataset and retrain the system. To build spatial AI systems that can quickly be taught about new objects, we need to effectively solve the problem of single-shot object detection, instance segm… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

    Comments: 8 pages, 6 figures, to be published in IEEE WACV 2024

  3. arXiv:2309.05448  [pdf, other

    cs.CV cs.AI cs.CL

    Panoptic Vision-Language Feature Fields

    Authors: Haoran Chen, Kenneth Blomqvist, Francesco Milano, Roland Siegwart

    Abstract: Recently, methods have been proposed for 3D open-vocabulary semantic segmentation. Such methods are able to segment scenes into arbitrary classes based on text descriptions provided during runtime. In this paper, we propose to the best of our knowledge the first algorithm for open-vocabulary panoptic segmentation in 3D scenes. Our algorithm, Panoptic Vision-Language Feature Fields (PVLFF), learns… ▽ More

    Submitted 18 January, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

    Comments: This work has been accepted by IEEE Robotics and Automation Letters

  4. arXiv:2303.10962  [pdf, other

    cs.RO cs.CV

    Neural Implicit Vision-Language Feature Fields

    Authors: Kenneth Blomqvist, Francesco Milano, Jen Jen Chung, Lionel Ott, Roland Siegwart

    Abstract: Recently, groundbreaking results have been presented on open-vocabulary semantic image segmentation. Such methods segment each pixel in an image into arbitrary categories provided at run-time in the form of text prompts, as opposed to a fixed set of classes defined at training time. In this work, we present a zero-shot volumetric open-vocabulary semantic scene segmentation method. Our method build… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

  5. arXiv:2211.13969  [pdf, other

    cs.CV cs.RO

    Unsupervised Continual Semantic Adaptation through Neural Rendering

    Authors: Zhizheng Liu, Francesco Milano, Jonas Frey, Roland Siegwart, Hermann Blum, Cesar Cadena

    Abstract: An increasing amount of applications rely on data-driven models that are deployed for perception tasks across a sequence of scenes. Due to the mismatch between training and deployment data, adapting the model on the new scenes is often crucial to obtain good performance. In this work, we study continual multi-scene adaptation for the task of semantic segmentation, assuming that no ground-truth lab… ▽ More

    Submitted 24 March, 2023; v1 submitted 25 November, 2022; originally announced November 2022.

    Comments: Accepted by the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023. Zhizheng Liu and Francesco Milano share first authorship. Hermann Blum and Cesar Cadena share senior authorship. 18 pages, 8 figures, 9 tables

  6. Continual Adaptation of Semantic Segmentation using Complementary 2D-3D Data Representations

    Authors: Jonas Frey, Hermann Blum, Francesco Milano, Roland Siegwart, Cesar Cadena

    Abstract: Semantic segmentation networks are usually pre-trained once and not updated during deployment. As a consequence, misclassifications commonly occur if the distribution of the training data deviates from the one encountered during the robot's operation. We propose to mitigate this problem by adapting the neural network to the robot's environment during deployment, without any need for external super… ▽ More

    Submitted 20 August, 2022; v1 submitted 3 November, 2021; originally announced November 2021.

    Comments: Accepted for IEEE Robotics and Automation Letters (R-AL 2022)

    Report number: 9874976

    Journal ref: IEEE Robotics and Automation Letters 2022

  7. arXiv:2105.01595  [pdf, other

    cs.RO cs.CV

    Self-Improving Semantic Perception for Indoor Localisation

    Authors: Hermann Blum, Francesco Milano, René Zurbrügg, Roland Siegward, Cesar Cadena, Abel Gawel

    Abstract: We propose a novel robotic system that can improve its perception during deployment. Contrary to the established approach of learning semantics from large datasets and deploying fixed models, we propose a framework in which semantic models are continuously updated on the robot to adapt to the deployment environments. By combining continual learning with self-supervision, our robotic system learns… ▽ More

    Submitted 15 September, 2021; v1 submitted 4 May, 2021; originally announced May 2021.

    Comments: A summary video can be accessed at https://youtu.be/awsynhkkFpk

    Journal ref: CoRL 2021 https://openreview.net/forum?id=X2KJq-S11BC

  8. arXiv:2010.12455  [pdf, other

    cs.CV cs.CG cs.LG

    Primal-Dual Mesh Convolutional Neural Networks

    Authors: Francesco Milano, Antonio Loquercio, Antoni Rosinol, Davide Scaramuzza, Luca Carlone

    Abstract: Recent works in geometric deep learning have introduced neural networks that allow performing inference tasks on three-dimensional geometric data by defining convolution, and sometimes pooling, operations on triangle meshes. These methods, however, either consider the input mesh as a graph, and do not exploit specific geometric properties of meshes for feature aggregation and downsampling, or are… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

    Comments: Accepted to the 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada. Code available at: https://github.com/MIT-SPARK/PD-MeshNet

    Journal ref: 34th Conference on Neural Information Processing Systems (NeurIPS 2020)