Enhancing Blind Video Quality Assessment with Rich Quality-aware Features

Sun, Wei; Wu, Haoning; Zhang, Zicheng; Jia, Jun; Zhang, Zhichao; Cao, Linhan; Chen, Qiubo; Min, Xiongkuo; Lin, Weisi; Zhai, Guangtao

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2405.08745 (eess)

[Submitted on 14 May 2024]

Title:Enhancing Blind Video Quality Assessment with Rich Quality-aware Features

Authors:Wei Sun, Haoning Wu, Zicheng Zhang, Jun Jia, Zhichao Zhang, Linhan Cao, Qiubo Chen, Xiongkuo Min, Weisi Lin, Guangtao Zhai

View PDF HTML (experimental)

Abstract:In this paper, we present a simple but effective method to enhance blind video quality assessment (BVQA) models for social media videos. Motivated by previous researches that leverage pre-trained features extracted from various computer vision models as the feature representation for BVQA, we further explore rich quality-aware features from pre-trained blind image quality assessment (BIQA) and BVQA models as auxiliary features to help the BVQA model to handle complex distortions and diverse content of social media videos. Specifically, we use SimpleVQA, a BVQA model that consists of a trainable Swin Transformer-B and a fixed SlowFast, as our base model. The Swin Transformer-B and SlowFast components are responsible for extracting spatial and motion features, respectively. Then, we extract three kinds of features from Q-Align, LIQE, and FAST-VQA to capture frame-level quality-aware features, frame-level quality-aware along with scene-specific features, and spatiotemporal quality-aware features, respectively. Through concatenating these features, we employ a multi-layer perceptron (MLP) network to regress them into quality scores. Experimental results demonstrate that the proposed model achieves the best performance on three public social media VQA datasets. Moreover, the proposed model won first place in the CVPR NTIRE 2024 Short-form UGC Video Quality Assessment Challenge. The code is available at \url{this https URL}.

Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
Cite as:	arXiv:2405.08745 [eess.IV]
	(or arXiv:2405.08745v1 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2405.08745

Submission history

From: Wei Sun [view email]
[v1] Tue, 14 May 2024 16:32:11 UTC (7,771 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Enhancing Blind Video Quality Assessment with Rich Quality-aware Features

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Enhancing Blind Video Quality Assessment with Rich Quality-aware Features

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators