Skip to main content

Showing 1–8 of 8 results for author: Rizzoli, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.11891  [pdf, other

    cs.CV

    From Open-Vocabulary to Vocabulary-Free Semantic Segmentation

    Authors: Klara Reichard, Giulia Rizzoli, Stefano Gasperini, Lukas Hoyer, Pietro Zanuttigh, Nassir Navab, Federico Tombari

    Abstract: Open-vocabulary semantic segmentation enables models to identify novel object categories beyond their training data. While this flexibility represents a significant advancement, current approaches still rely on manually specified class names as input, creating an inherent bottleneck in real-world applications. This work proposes a Vocabulary-Free Semantic Segmentation pipeline, eliminating the nee… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

    Comments: Submitted to: Pattern Recognition Letters, Klara Reichard and Giulia Rizzoli equally contributed to this work

  2. arXiv:2407.13363  [pdf, other

    cs.CV

    Learning from the Web: Language Drives Weakly-Supervised Incremental Learning for Semantic Segmentation

    Authors: Chang Liu, Giulia Rizzoli, Pietro Zanuttigh, Fu Li, Yi Niu

    Abstract: Current weakly-supervised incremental learning for semantic segmentation (WILSS) approaches only consider replacing pixel-level annotations with image-level labels, while the training images are still from well-designed datasets. In this work, we argue that widely available web images can also be considered for the learning of new classes. To achieve this, firstly we introduce a strategy to select… ▽ More

    Submitted 3 September, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  3. arXiv:2403.13762  [pdf, other

    cs.CV

    When Cars meet Drones: Hyperbolic Federated Learning for Source-Free Domain Adaptation in Adverse Weather

    Authors: Giulia Rizzoli, Matteo Caligiuri, Donald Shenaj, Francesco Barbato, Pietro Zanuttigh

    Abstract: In Federated Learning (FL), multiple clients collaboratively train a global model without sharing private data. In semantic segmentation, the Federated source Free Domain Adaptation (FFreeDA) setting is of particular interest, where clients undergo unsupervised training after supervised pretraining at the server side. While few recent works address FL for autonomous vehicles, intrinsic real-world… ▽ More

    Submitted 20 September, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Comments: WACV 2025, 10 pages manuscript, 6 pages supplemental material

  4. arXiv:2309.10479  [pdf, other

    cs.CV

    RECALL+: Adversarial Web-based Replay for Continual Learning in Semantic Segmentation

    Authors: Chang Liu, Giulia Rizzoli, Francesco Barbato, Andrea Maracani, Marco Toldo, Umberto Michieli, Yi Niu, Pietro Zanuttigh

    Abstract: Catastrophic forgetting of previous knowledge is a critical issue in continual learning typically handled through various regularization strategies. However, existing methods struggle especially when several incremental steps are performed. In this paper, we extend our previous approach (RECALL) and tackle forgetting by exploiting unsupervised web-crawled data to retrieve examples of old classes f… ▽ More

    Submitted 16 February, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

  5. arXiv:2308.10491  [pdf, other

    cs.CV

    SynDrone -- Multi-modal UAV Dataset for Urban Scenarios

    Authors: Giulia Rizzoli, Francesco Barbato, Matteo Caligiuri, Pietro Zanuttigh

    Abstract: The development of computer vision algorithms for Unmanned Aerial Vehicles (UAVs) imagery heavily relies on the availability of annotated high-resolution aerial data. However, the scarcity of large-scale real datasets with pixel-level annotations poses a significant challenge to researchers as the limited number of images in existing datasets hinders the effectiveness of deep learning models that… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: Accepted at ICCV Workshops, downloadable dataset with CC-BY license, 8 pages, 4 figures, 8 tables

  6. arXiv:2305.14269  [pdf, other

    cs.CV cs.MM

    Source-Free Domain Adaptation for RGB-D Semantic Segmentation with Vision Transformers

    Authors: Giulia Rizzoli, Donald Shenaj, Pietro Zanuttigh

    Abstract: With the increasing availability of depth sensors, multimodal frameworks that combine color information with depth data are gaining interest. However, ground truth data for semantic segmentation is burdensome to provide, thus making domain adaptation a significant research area. Yet most domain adaptation methods are not able to effectively handle multimodal data. Specifically, we address the chal… ▽ More

    Submitted 6 December, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: WACV 2024, 2nd Workshop on Pretraining (WACVW)

  7. arXiv:2212.10428  [pdf, other

    cs.CV

    HouseCat6D -- A Large-Scale Multi-Modal Category Level 6D Object Perception Dataset with Household Objects in Realistic Scenarios

    Authors: HyunJun Jung, Guangyao Zhai, Shun-Cheng Wu, Patrick Ruhkamp, Hannah Schieber, Giulia Rizzoli, Pengyuan Wang, Hongcheng Zhao, Lorenzo Garattoni, Sven Meier, Daniel Roth, Nassir Navab, Benjamin Busam

    Abstract: Estimating 6D object poses is a major challenge in 3D computer vision. Building on successful instance-level approaches, research is shifting towards category-level pose estimation for practical applications. Current category-level datasets, however, fall short in annotation quality and pose variety. Addressing this, we introduce HouseCat6D, a new category-level 6D pose dataset. It features 1) mul… ▽ More

    Submitted 1 December, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

  8. arXiv:2211.04188  [pdf, other

    cs.CV

    DepthFormer: Multimodal Positional Encodings and Cross-Input Attention for Transformer-Based Segmentation Networks

    Authors: Francesco Barbato, Giulia Rizzoli, Pietro Zanuttigh

    Abstract: Most approaches for semantic segmentation use only information from color cameras to parse the scenes, yet recent advancements show that using depth data allows to further improve performances. In this work, we focus on transformer-based deep learning architectures, that have achieved state-of-the-art performances on the segmentation task, and we propose to employ depth information by embedding it… ▽ More

    Submitted 27 March, 2023; v1 submitted 8 November, 2022; originally announced November 2022.

    Comments: Accepted at ICASSP 2023