Skip to main content

Showing 1–9 of 9 results for author: Piccinelli, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.16591  [pdf, other

    cs.CV

    UniK3D: Universal Camera Monocular 3D Estimation

    Authors: Luigi Piccinelli, Christos Sakaridis, Mattia Segu, Yung-Hsu Yang, Siyuan Li, Wim Abbeloos, Luc Van Gool

    Abstract: Monocular 3D estimation is crucial for visual perception. However, current methods fall short by relying on oversimplified assumptions, such as pinhole camera models or rectified images. These limitations severely restrict their general applicability, causing poor performance in real-world scenarios with fisheye or panoramic images and resulting in substantial context loss. To address this, we pre… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

  2. arXiv:2502.20110  [pdf, other

    cs.CV

    UniDepthV2: Universal Monocular Metric Depth Estimation Made Simpler

    Authors: Luigi Piccinelli, Christos Sakaridis, Yung-Hsu Yang, Mattia Segu, Siyuan Li, Wim Abbeloos, Luc Van Gool

    Abstract: Accurate monocular metric depth estimation (MMDE) is crucial to solving downstream tasks in 3D perception and modeling. However, the remarkable accuracy of recent MMDE methods is confined to their training domains. These methods fail to generalize to unseen domains even in the presence of moderate domain gaps, which hinders their practical applicability. We propose a new model, UniDepthV2, capable… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

    Comments: arXiv admin note: substantial text overlap with arXiv:2403.18913

  3. arXiv:2410.01806  [pdf, other

    cs.CV cs.AI

    Samba: Synchronized Set-of-Sequences Modeling for Multiple Object Tracking

    Authors: Mattia Segu, Luigi Piccinelli, Siyuan Li, Yung-Hsu Yang, Bernt Schiele, Luc Van Gool

    Abstract: Multiple object tracking in complex scenarios - such as coordinated dance performances, team sports, or dynamic animal groups - presents unique challenges. In these settings, objects frequently move in coordinated patterns, occlude each other, and exhibit long-term dependencies in their trajectories. However, it remains a key open research question on how to model long-range dependencies within tr… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

  4. arXiv:2409.17221  [pdf, other

    cs.CV

    Walker: Self-supervised Multiple Object Tracking by Walking on Temporal Appearance Graphs

    Authors: Mattia Segu, Luigi Piccinelli, Siyuan Li, Luc Van Gool, Fisher Yu, Bernt Schiele

    Abstract: The supervision of state-of-the-art multiple object tracking (MOT) methods requires enormous annotation efforts to provide bounding boxes for all frames of all videos, and instance IDs to associate them through time. To this end, we introduce Walker, the first self-supervised tracker that learns from videos with sparse bounding box annotations, and no tracking labels. First, we design a quasi-dens… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

    Comments: ECCV 2024

  5. arXiv:2409.11235  [pdf, other

    cs.CV

    SLAck: Semantic, Location, and Appearance Aware Open-Vocabulary Tracking

    Authors: Siyuan Li, Lei Ke, Yung-Hsu Yang, Luigi Piccinelli, Mattia Segù, Martin Danelljan, Luc Van Gool

    Abstract: Open-vocabulary Multiple Object Tracking (MOT) aims to generalize trackers to novel categories not in the training set. Currently, the best-performing methods are mainly based on pure appearance matching. Due to the complexity of motion patterns in the large-vocabulary scenarios and unstable classification of the novel objects, the motion and semantics cues are either ignored or applied based on h… ▽ More

    Submitted 17 September, 2024; originally announced September 2024.

    Comments: ECCV2024

  6. arXiv:2406.04221  [pdf, other

    cs.CV

    Matching Anything by Segmenting Anything

    Authors: Siyuan Li, Lei Ke, Martin Danelljan, Luigi Piccinelli, Mattia Segu, Luc Van Gool, Fisher Yu

    Abstract: The robust association of the same objects across video frames in complex scenes is crucial for many applications, especially Multiple Object Tracking (MOT). Current methods predominantly rely on labeled domain-specific video datasets, which limits the cross-domain generalization of learned similarity embeddings. We propose MASA, a novel method for robust instance association learning, capable of… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: CVPR 2024 Highlight. code at: https://github.com/siyuanliii/masa

  7. arXiv:2403.18913  [pdf, other

    cs.CV

    UniDepth: Universal Monocular Metric Depth Estimation

    Authors: Luigi Piccinelli, Yung-Hsu Yang, Christos Sakaridis, Mattia Segu, Siyuan Li, Luc Van Gool, Fisher Yu

    Abstract: Accurate monocular metric depth estimation (MMDE) is crucial to solving downstream tasks in 3D perception and modeling. However, the remarkable accuracy of recent MMDE methods is confined to their training domains. These methods fail to generalize to unseen domains even in the presence of moderate domain gaps, which hinders their practical applicability. We propose a new model, UniDepth, capable o… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  8. arXiv:2304.06334  [pdf, other

    cs.CV

    iDisc: Internal Discretization for Monocular Depth Estimation

    Authors: Luigi Piccinelli, Christos Sakaridis, Fisher Yu

    Abstract: Monocular depth estimation is fundamental for 3D scene understanding and downstream applications. However, even under the supervised setup, it is still challenging and ill-posed due to the lack of full geometric constraints. Although a scene can consist of millions of pixels, there are fewer high-level patterns. We propose iDisc to learn those patterns with internal discretized representations. Th… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    Comments: Accepted at CVPR 2023

  9. arXiv:2302.01287  [pdf, other

    cs.CV cs.AI

    Multi-scale Feature Alignment for Continual Learning of Unlabeled Domains

    Authors: Kevin Thandiackal, Luigi Piccinelli, Pushpak Pati, Orcun Goksel

    Abstract: Methods for unsupervised domain adaptation (UDA) help to improve the performance of deep neural networks on unseen domains without any labeled data. Especially in medical disciplines such as histopathology, this is crucial since large datasets with detailed annotations are scarce. While the majority of existing UDA methods focus on the adaptation from a labeled source to a single unlabeled target… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.