Skip to main content

Showing 1–5 of 5 results for author: Ziaeetabar, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.15192  [pdf, ps, other

    cs.CV

    Leveraging Foundation Models for Multimodal Graph-Based Action Recognition

    Authors: Fatemeh Ziaeetabar, Florentin Wörgötter

    Abstract: Foundation models have ushered in a new era for multimodal video understanding by enabling the extraction of rich spatiotemporal and semantic representations. In this work, we introduce a novel graph-based framework that integrates a vision-language foundation, leveraging VideoMAE for dynamic visual encoding and BERT for contextual textual embedding, to address the challenge of recognizing fine-gr… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  2. arXiv:2311.07285  [pdf, other

    cs.CV

    Multi Sentence Description of Complex Manipulation Action Videos

    Authors: Fatemeh Ziaeetabar, Reza Safabakhsh, Saeedeh Momtazi, Minija Tamosiunaite, Florentin Wörgötter

    Abstract: Automatic video description requires the generation of natural language statements about the actions, events, and objects in the video. An important human trait, when we describe a video, is that we are able to do this with variable levels of detail. Different from this, existing approaches for automatic video descriptions are mostly focused on single sentence generation at a fixed level of detail… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  3. arXiv:2310.00670  [pdf, other

    cs.CV

    A Hierarchical Graph-based Approach for Recognition and Description Generation of Bimanual Actions in Videos

    Authors: Fatemeh Ziaeetabar, Reza Safabakhsh, Saeedeh Momtazi, Minija Tamosiunaite, Florentin Wörgötter

    Abstract: Nuanced understanding and the generation of detailed descriptive content for (bimanual) manipulation actions in videos is important for disciplines such as robotics, human-computer interaction, and video content analysis. This study describes a novel method, integrating graph based modeling with layered hierarchical attention mechanisms, resulting in higher precision and better comprehensiveness o… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

  4. arXiv:2004.10518  [pdf, other

    cs.AI cs.CV

    Human and Machine Action Prediction Independent of Object Information

    Authors: Fatemeh Ziaeetabar, Jennifer Pomp, Stefan Pfeiffer, Nadiya El-Sourani, Ricarda I. Schubotz, Minija Tamosiunaite, Florentin Wörgötter

    Abstract: Predicting other people's action is key to successful social interactions, enabling us to adjust our own behavior to the consequence of the others' future actions. Studies on action recognition have focused on the importance of individual visual features of objects involved in an action and its context. Humans, however, recognize actions on unknown objects or even when objects are imagined (pantom… ▽ More

    Submitted 22 April, 2020; originally announced April 2020.

    Comments: This paper includes 31 pages, 11 figures and 1 table

  5. Action Prediction in Humans and Robots

    Authors: Florentin Wörgötter, Fatemeh Ziaeetabar, Stefan Pfeiffer, Osman Kaya, Tomas Kulvicius, Minija Tamosiunaite

    Abstract: Efficient action prediction is of central importance for the fluent workflow between humans and equally so for human-robot interaction. To achieve prediction, actions can be encoded by a series of events, where every event corresponds to a change in a (static or dynamic) relation between some of the objects in a scene. Manipulation actions and others can be uniquely encoded this way and only, on a… ▽ More

    Submitted 3 July, 2019; originally announced July 2019.

    Journal ref: Scientific Reports, 2020