Skip to main content

Showing 1–1 of 1 results for author: Bosetti, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.16412  [pdf, ps, other

    cs.CV

    Text-Enhanced Zero-Shot Action Recognition: A training-free approach

    Authors: Massimo Bosetti, Shibingfeng Zhang, Benedetta Liberatori, Giacomo Zara, Elisa Ricci, Paolo Rota

    Abstract: Vision-language models (VLMs) have demonstrated remarkable performance across various visual tasks, leveraging joint learning of visual and textual representations. While these models excel in zero-shot image tasks, their application to zero-shot video action recognition (ZSVAR) remains challenging due to the dynamic and temporal nature of actions. Existing methods for ZS-VAR typically require ext… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

    Comments: accepted to ICPR 2024