Skip to main content

Showing 1–4 of 4 results for author: Alberti, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2110.10101  [pdf, other

    cs.CV

    Domain Generalization through Audio-Visual Relative Norm Alignment in First Person Action Recognition

    Authors: Mirco Planamente, Chiara Plizzari, Emanuele Alberti, Barbara Caputo

    Abstract: First person action recognition is becoming an increasingly researched area thanks to the rising popularity of wearable cameras. This is bringing to light cross-domain issues that are yet to be addressed in this context. Indeed, the information extracted from learned representations suffers from an intrinsic "environmental bias". This strongly affects the ability to generalize to unseen scenarios,… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

    Comments: Accepted at WACV 2022. arXiv admin note: substantial text overlap with arXiv:2106.01689

  2. arXiv:2107.00337  [pdf, other

    cs.CV

    PoliTO-IIT Submission to the EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition

    Authors: Chiara Plizzari, Mirco Planamente, Emanuele Alberti, Barbara Caputo

    Abstract: In this report, we describe the technical details of our submission to the EPIC-Kitchens-100 Unsupervised Domain Adaptation (UDA) Challenge in Action Recognition. To tackle the domain-shift which exists under the UDA setting, we first exploited a recent Domain Generalization (DG) technique, called Relative Norm Alignment (RNA). It consists in designing a model able to generalize well to any unseen… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: 3rd place in the 2021 EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition

  3. arXiv:2106.01689  [pdf, other

    cs.CV

    Cross-Domain First Person Audio-Visual Action Recognition through Relative Norm Alignment

    Authors: Mirco Planamente, Chiara Plizzari, Emanuele Alberti, Barbara Caputo

    Abstract: First person action recognition is an increasingly researched topic because of the growing popularity of wearable cameras. This is bringing to light cross-domain issues that are yet to be addressed in this context. Indeed, the information extracted from learned representations suffers from an intrinsic environmental bias. This strongly affects the ability to generalize to unseen scenarios, limitin… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

    Comments: 11 pages, 7 figures

  4. IDDA: a large-scale multi-domain dataset for autonomous driving

    Authors: Emanuele Alberti, Antonio Tavera, Carlo Masone, Barbara Caputo

    Abstract: Semantic segmentation is key in autonomous driving. Using deep visual learning architectures is not trivial in this context, because of the challenges in creating suitable large scale annotated datasets. This issue has been traditionally circumvented through the use of synthetic datasets, that have become a popular resource in this field. They have been released with the need to develop semantic s… ▽ More

    Submitted 22 October, 2021; v1 submitted 17 April, 2020; originally announced April 2020.

    Comments: Accepted at IROS 2020 and RA-L. Download at: https://idda-dataset.github.io/home/

    Journal ref: IEEE Robotics and Automation Letters, vol. 5, no. 4, pp. 5526-5533, Oct. 2020