Skip to main content

Showing 1–2 of 2 results for author: Hesham, S A S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.13552  [pdf, ps, other

    cs.CV

    A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects

    Authors: Guohuan Xie, Syed Ariff Syed Hesham, Wenya Guo, Bing Li, Ming-Ming Cheng, Guolei Sun, Yun Liu

    Abstract: Video Scene Parsing (VSP) has emerged as a cornerstone in computer vision, facilitating the simultaneous segmentation, recognition, and tracking of diverse visual entities in dynamic scenes. In this survey, we present a holistic review of recent advances in VSP, covering a wide array of vision tasks, including Video Semantic Segmentation (VSS), Video Instance Segmentation (VIS), Video Panoptic Seg… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  2. arXiv:2503.20824  [pdf, other

    eess.IV cs.AI cs.LG

    Exploiting Temporal State Space Sharing for Video Semantic Segmentation

    Authors: Syed Ariff Syed Hesham, Yun Liu, Guolei Sun, Henghui Ding, Jing Yang, Ender Konukoglu, Xue Geng, Xudong Jiang

    Abstract: Video semantic segmentation (VSS) plays a vital role in understanding the temporal evolution of scenes. Traditional methods often segment videos frame-by-frame or in a short temporal window, leading to limited temporal context, redundant computations, and heavy memory requirements. To this end, we introduce a Temporal Video State Space Sharing (TV3S) architecture to leverage Mamba state space mode… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

    Comments: IEEE/CVF Conference on Computer Vision and Pattern Recognition 2025