Skip to main content

Showing 1–3 of 3 results for author: Qiuhong, K

.
  1. arXiv:2201.02494  [pdf, other

    cs.CV cs.CL

    Progressive Video Summarization via Multimodal Self-supervised Learning

    Authors: Li Haopeng, Ke Qiuhong, Gong Mingming, Tom Drummond

    Abstract: Modern video summarization methods are based on deep neural networks that require a large amount of annotated data for training. However, existing datasets for video summarization are small-scale, easily leading to over-fitting of the deep models. Considering that the annotation of large-scale datasets is time-consuming, we propose a multimodal self-supervised learning framework to obtain semantic… ▽ More

    Submitted 19 October, 2022; v1 submitted 7 January, 2022; originally announced January 2022.

  2. Video Joint Modelling Based on Hierarchical Transformer for Co-summarization

    Authors: Li Haopeng, Ke Qiuhong, Gong Mingming, Zhang Rui

    Abstract: Video summarization aims to automatically generate a summary (storyboard or video skim) of a video, which can facilitate large-scale video retrieval and browsing. Most of the existing methods perform video summarization on individual videos, which neglects the correlations among similar videos. Such correlations, however, are also informative for video understanding and video summarization. To add… ▽ More

    Submitted 29 June, 2022; v1 submitted 26 December, 2021; originally announced December 2021.

  3. arXiv:2112.10057  [pdf, other

    cs.CV

    Precondition and Effect Reasoning for Action Recognition

    Authors: Yoo Hongsang, Li Haopeng, Ke Qiuhong, Liu Liangchen, Zhang Rui

    Abstract: Human action recognition has drawn a lot of attention in the recent years due to the research and application significance. Most existing works on action recognition focus on learning effective spatial-temporal features from videos, but neglect the strong causal relationship among the precondition, action and effect. Such relationships are also crucial to the accuracy of action recognition. In thi… ▽ More

    Submitted 13 November, 2022; v1 submitted 18 December, 2021; originally announced December 2021.

    Comments: The paper is under consideration at Computer Vision and Image Understanding