Skip to main content

Showing 1–5 of 5 results for author: Rodin, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.05787  [pdf, ps, other

    cs.CV

    EASG-Bench: Video Q&A Benchmark with Egocentric Action Scene Graphs

    Authors: Ivan Rodin, Tz-Ying Wu, Kyle Min, Sharath Nittur Sridhar, Antonino Furnari, Subarna Tripathi, Giovanni Maria Farinella

    Abstract: We introduce EASG-Bench, a question-answering benchmark for egocentric videos where the question-answering pairs are created from spatio-temporally grounded dynamic scene graphs capturing intricate relationships among actors, actions, and objects. We propose a systematic evaluation framework and evaluate several language-only and video large language models (video-LLMs) on this benchmark. We obser… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

  2. arXiv:2405.02770  [pdf, other

    cs.LG

    PhilHumans: Benchmarking Machine Learning for Personal Health

    Authors: Vadim Liventsev, Vivek Kumar, Allmin Pradhap Singh Susaiyah, Zixiu Wu, Ivan Rodin, Asfand Yaar, Simone Balloccu, Marharyta Beraziuk, Sebastiano Battiato, Giovanni Maria Farinella, Aki Härmä, Rim Helaoui, Milan Petkovic, Diego Reforgiato Recupero, Ehud Reiter, Daniele Riboni, Raymond Sterling

    Abstract: The use of machine learning in Healthcare has the potential to improve patient outcomes as well as broaden the reach and affordability of Healthcare. The history of other application areas indicates that strong benchmarks are essential for the development of intelligent systems. We present Personal Health Interfaces Leveraging HUman-MAchine Natural interactions (PhilHumans), a holistic suite of be… ▽ More

    Submitted 16 May, 2024; v1 submitted 4 May, 2024; originally announced May 2024.

  3. arXiv:2312.03391  [pdf, other

    cs.CV

    Action Scene Graphs for Long-Form Understanding of Egocentric Videos

    Authors: Ivan Rodin, Antonino Furnari, Kyle Min, Subarna Tripathi, Giovanni Maria Farinella

    Abstract: We present Egocentric Action Scene Graphs (EASGs), a new representation for long-form understanding of egocentric videos. EASGs extend standard manually-annotated representations of egocentric videos, such as verb-noun action labels, by providing a temporally evolving graph-based description of the actions performed by the camera wearer, including interacted objects, their relationships, and how a… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  4. arXiv:2202.04132  [pdf, other

    cs.CV

    Untrimmed Action Anticipation

    Authors: Ivan Rodin, Antonino Furnari, Dimitrios Mavroeidis, Giovanni Maria Farinella

    Abstract: Egocentric action anticipation consists in predicting a future action the camera wearer will perform from egocentric video. While the task has recently attracted the attention of the research community, current approaches assume that the input videos are "trimmed", meaning that a short video sequence is sampled a fixed time before the beginning of the action. We argue that, despite the recent adva… ▽ More

    Submitted 8 February, 2022; originally announced February 2022.

  5. arXiv:2107.13411  [pdf, other

    cs.CV

    Predicting the Future from First Person (Egocentric) Vision: A Survey

    Authors: Ivan Rodin, Antonino Furnari, Dimitrios Mavroedis, Giovanni Maria Farinella

    Abstract: Egocentric videos can bring a lot of information about how humans perceive the world and interact with the environment, which can be beneficial for the analysis of human behaviour. The research in egocentric video analysis is developing rapidly thanks to the increasing availability of wearable devices and the opportunities offered by new large-scale egocentric datasets. As computer vision techniqu… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

    Comments: Computer Vision and Image Understanding, 2021