Skip to main content

Showing 1–5 of 5 results for author: Papoutsakis, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.17356  [pdf, other

    cs.CV

    A vision-based framework for human behavior understanding in industrial assembly lines

    Authors: Konstantinos Papoutsakis, Nikolaos Bakalos, Konstantinos Fragkoulis, Athena Zacharia, Georgia Kapetadimitri, Maria Pateraki

    Abstract: This paper introduces a vision-based framework for capturing and understanding human behavior in industrial assembly lines, focusing on car door manufacturing. The framework leverages advanced computer vision techniques to estimate workers' locations and 3D poses and analyze work postures, actions, and task progress. A key contribution is the introduction of the CarDA dataset, which contains domai… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

  2. arXiv:2405.12789  [pdf, other

    cs.CV

    Anticipating Object State Changes in Long Procedural Videos

    Authors: Victoria Manousaki, Konstantinos Bacharidis, Filippos Gouidis, Konstantinos Papoutsakis, Dimitris Plexousakis, Antonis Argyros

    Abstract: In this work, we introduce (a) the new problem of anticipating object state changes in images and videos during procedural activities, (b) new curated annotation data for object state change classification based on the Ego4D dataset, and (c) the first method for addressing this challenging problem. Solutions to this new task have important implications in vision-based scene understanding, automate… ▽ More

    Submitted 2 December, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

  3. arXiv:2403.12151  [pdf, other

    cs.AI cs.CL cs.CV cs.LG

    Fusing Domain-Specific Content from Large Language Models into Knowledge Graphs for Enhanced Zero Shot Object State Classification

    Authors: Filippos Gouidis, Katerina Papantoniou, Konstantinos Papoutsakis, Theodore Patkos, Antonis Argyros, Dimitris Plexousakis

    Abstract: Domain-specific knowledge can significantly contribute to addressing a wide variety of vision tasks. However, the generation of such knowledge entails considerable human labor and time costs. This study investigates the potential of Large Language Models (LLMs) in generating and providing domain-specific information through semantic embeddings. To achieve this, an LLM is integrated into a pipeline… ▽ More

    Submitted 11 December, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted at the AAAI-MAKE 2024

    Journal ref: Proceedings of the AAAI Spring Symposium, 2024, pages 115-124

  4. Recognizing Unseen States of Unknown Objects by Leveraging Knowledge Graphs

    Authors: Filipos Gouidis, Konstantinos Papoutsakis, Theodore Patkos, Antonis Argyros, Dimitris Plexousakis

    Abstract: We investigate the problem of Object State Classification (OSC) as a zero-shot learning problem. Specifically, we propose the first Object-agnostic State Classification (OaSC) method that infers the state of a certain object without relying on the knowledge or the estimation of the object class. In that direction, we capitalize on Knowledge Graphs (KGs) for structuring and organizing knowledge, wh… ▽ More

    Submitted 16 June, 2025; v1 submitted 22 July, 2023; originally announced July 2023.

    Comments: This is the authors' version of the paper published at IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2025. The definitive version is available at: https://openaccess.thecvf.com/content/WACV2025/html/Gouidis_Recognizing_Unseen_States_of_Unknown_Objects_by_Leveraging_Knowledge_Graphs_WACV_2025_paper.html

    Journal ref: 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pp. 8637-8648

  5. arXiv:2209.05194  [pdf, other

    cs.CV

    Graphing the Future: Activity and Next Active Object Prediction using Graph-based Activity Representations

    Authors: Victoria Manousaki, Konstantinos Papoutsakis, Antonis Argyros

    Abstract: We present a novel approach for the visual prediction of human-object interactions in videos. Rather than forecasting the human and object motion or the future hand-object contact points, we aim at predicting (a)the class of the on-going human-object interaction and (b) the class(es) of the next active object(s) (NAOs), i.e., the object(s) that will be involved in the interaction in the near futur… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

    Comments: 13 pages, Conference: In Advances in Visual Computing (ISVC 2022), Springer, San Diego, USA, October 2022