Skip to main content

Showing 1–3 of 3 results for author: Sapienza, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2304.08230  [pdf, other

    cs.CV cs.AI

    Uncovering the Background-Induced bias in RGB based 6-DoF Object Pose Estimation

    Authors: Elena Govi, Davide Sapienza, Carmelo Scribano, Tobia Poppi, Giorgia Franchini, Paola Ardòn, Micaela Verucchi, Marko Bertogna

    Abstract: In recent years, there has been a growing trend of using data-driven methods in industrial settings. These kinds of methods often process video images or parts, therefore the integrity of such images is crucial. Sometimes datasets, e.g. consisting of images, can be sophisticated for various reasons. It becomes critical to understand how the manipulation of video and images can impact the effective… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: 17 pages, 10 figures, submitted to EURASIP Journal on Image and Video Processing

    ACM Class: I.2.10; I.4.0

  2. arXiv:2302.06821  [pdf, other

    cs.RO cs.CV

    Model-Based Underwater 6D Pose Estimation from RGB

    Authors: Davide Sapienza, Elena Govi, Sara Aldhaheri, Marko Bertogna, Eloy Roura, Èric Pairet, Micaela Verucchi, Paola Ardón

    Abstract: Object pose estimation underwater allows an autonomous system to perform tracking and intervention tasks. Nonetheless, underwater target pose estimation is remarkably challenging due to, among many factors, limited visibility, light scattering, cluttered environments, and constantly varying water conditions. An approach is to employ sonar or laser sensing to acquire 3D data, however, the data is n… ▽ More

    Submitted 15 September, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: Under RA-L Submission

  3. arXiv:2106.10153  [pdf, other

    cs.CV

    All You Can Embed: Natural Language based Vehicle Retrieval with Spatio-Temporal Transformers

    Authors: Carmelo Scribano, Davide Sapienza, Giorgia Franchini, Micaela Verucchi, Marko Bertogna

    Abstract: Combining Natural Language with Vision represents a unique and interesting challenge in the domain of Artificial Intelligence. The AI City Challenge Track 5 for Natural Language-Based Vehicle Retrieval focuses on the problem of combining visual and textual information, applied to a smart-city use case. In this paper, we present All You Can Embed (AYCE), a modular solution to correlate single-vehic… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

    Comments: CVPR 2021 AI CITY CHALLENGE Natural Language-Based Vehicle Retrieval