Skip to main content

Showing 1–8 of 8 results for author: Aspandi, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2411.06872  [pdf, other

    cs.CV cs.AI

    Multi-Modal interpretable automatic video captioning

    Authors: Antoine Hanna-Asaad, Decky Aspandi, Titus Zaharia

    Abstract: Video captioning aims to describe video contents using natural language format that involves understanding and interpreting scenes, actions and events that occurs simultaneously on the view. Current approaches have mainly concentrated on visual cues, often neglecting the rich information available from other important modality of audio information, including their inter-dependencies. In this work,… ▽ More

    Submitted 11 November, 2024; originally announced November 2024.

  2. arXiv:2406.12440  [pdf, other

    cs.CV cs.HC

    Deep self-supervised learning with visualisation for automatic gesture recognition

    Authors: Fabien Allemand, Alessio Mazzela, Jun Villette, Decky Aspandi, Titus Zaharia

    Abstract: Gesture is an important mean of non-verbal communication, with visual modality allows human to convey information during interaction, facilitating peoples and human-machine interactions. However, it is considered difficult to automatically recognise gestures. In this work, we explore three different means to recognise hand signs using deep learning: supervised learning based methods, self-supervis… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Student research project with company collaboration

  3. arXiv:2211.08074  [pdf, other

    cs.CV cs.HC

    Predicting Eye Gaze Location on Websites

    Authors: Ciheng Zhang, Decky Aspandi, Steffen Staab

    Abstract: World-wide-web, with the website and webpage as the main interface, facilitates the dissemination of important information. Hence it is crucial to optimize them for better user interaction, which is primarily done by analyzing users' behavior, especially users' eye-gaze locations. However, gathering these data is still considered to be labor and time intensive. In this work, we enable the developm… ▽ More

    Submitted 6 January, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

  4. arXiv:2201.03638  [pdf, other

    cs.HC cs.CY

    User Interaction Analysis through Contrasting Websites Experience

    Authors: Decky Aspandi, Sarah Doosdal, Victor Ülger, Lukas Gillich, Steffen Staab

    Abstract: Current advance of internet allows rapid dissemination of information, accelerating the progress on wide spectrum of society. This has been done mainly through the use of website interface with inherent unique human interactions. In this regards the usability analysis becomes a central part to improve the human interactions. However, This analysis has not yet quantitatively been evaluated through… ▽ More

    Submitted 12 January, 2022; v1 submitted 10 January, 2022; originally announced January 2022.

    Comments: The papers authors list have been updated according to the specific author requests

  5. arXiv:2104.12345  [pdf, other

    cs.CV cs.HC

    Machine Learning-based Lie Detector applied to a Novel Annotated Game Dataset

    Authors: Nuria Rodriguez-Diaz, Decky Aspandi, Federico Sukno, Xavier Binefa

    Abstract: Lie detection is considered a concern for everyone in their day to day life given its impact on human interactions. Thus, people normally pay attention to both what their interlocutors are saying and also to their visual appearances, including faces, to try to find any signs that indicate whether the person is telling the truth or not. While automatic lie detection may help us to understand this l… ▽ More

    Submitted 30 June, 2021; v1 submitted 26 April, 2021; originally announced April 2021.

  6. An Enhanced Adversarial Network with Combined Latent Features for Spatio-Temporal Facial Affect Estimation in the Wild

    Authors: Decky Aspandi, Federico Sukno, Björn Schuller, Xavier Binefa

    Abstract: Affective Computing has recently attracted the attention of the research community, due to its numerous applications in diverse areas. In this context, the emergence of video-based data allows to enrich the widely used spatial features with the inclusion of temporal information. However, such spatio-temporal modelling often results in very high-dimensional feature spaces and large volumes of data,… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

    Comments: Accepted Version on VISAPP 2021

  7. arXiv:2002.00883  [pdf, other

    cs.CV cs.LG

    Adversarial-based neural networks for affect estimations in the wild

    Authors: Decky Aspandi, Adria Mallol-Ragolta, Björn Schuller, Xavier Binefa

    Abstract: There is a growing interest in affective computing research nowadays given its crucial role in bridging humans with computers. This progress has been recently accelerated due to the emergence of bigger data. One recent advance in this field is the use of adversarial learning to improve model learning through augmented samples. However, the use of latent features, which is feasible through adversar… ▽ More

    Submitted 9 February, 2020; v1 submitted 3 February, 2020; originally announced February 2020.

    Comments: Paper for FG 2020 Affect Challenge https://ibug.doc.ic.ac.uk/resources/fg-2020-competition-affective-behavior-analysis/

  8. arXiv:1912.04711  [pdf, other

    cs.CV cs.HC

    End-to-end facial and physiological model for Affective Computing and applications

    Authors: Joaquim Comas, Decky Aspandi, Xavier Binefa

    Abstract: In recent years, Affective Computing and its applications have become a fast-growing research topic. Furthermore, the rise of Deep Learning has introduced significant improvements in the emotion recognition system compared to classical methods. In this work, we propose a multi-modal emotion recognition model based on deep learning techniques using the combination of peripheral physiological signal… ▽ More

    Submitted 20 January, 2020; v1 submitted 10 December, 2019; originally announced December 2019.