Skip to main content

Showing 1–2 of 2 results for author: Naufil, M

.
  1. arXiv:2309.06462  [pdf, other

    cs.CV

    Action Segmentation Using 2D Skeleton Heatmaps and Multi-Modality Fusion

    Authors: Syed Waleed Hyder, Muhammad Usama, Anas Zafar, Muhammad Naufil, Fawad Javed Fateh, Andrey Konin, M. Zeeshan Zia, Quoc-Huy Tran

    Abstract: This paper presents a 2D skeleton-based action segmentation method with applications in fine-grained human activity recognition. In contrast with state-of-the-art methods which directly take sequences of 3D skeleton coordinates as inputs and apply Graph Convolutional Networks (GCNs) for spatiotemporal feature learning, our main idea is to use sequences of 2D skeleton heatmaps as inputs and employ… ▽ More

    Submitted 25 April, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: Accepted to ICRA 2024

  2. arXiv:2305.19478  [pdf, other

    cs.CV

    Permutation-Aware Action Segmentation via Unsupervised Frame-to-Segment Alignment

    Authors: Quoc-Huy Tran, Ahmed Mehmood, Muhammad Ahmed, Muhammad Naufil, Anas Zafar, Andrey Konin, M. Zeeshan Zia

    Abstract: This paper presents an unsupervised transformer-based framework for temporal activity segmentation which leverages not only frame-level cues but also segment-level cues. This is in contrast with previous methods which often rely on frame-level information only. Our approach begins with a frame-level prediction module which estimates framewise action classes via a transformer encoder. The frame-lev… ▽ More

    Submitted 26 October, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: Accepted to WACV 2024