ESVQA: Perceptual Quality Assessment of Egocentric Spatial Videos

Zhu, Xilei; Duan, Huiyu; Yang, Liu; Zhu, Yucheng; Min, Xiongkuo; Zhai, Guangtao; Callet, Patrick Le

Computer Science > Computer Vision and Pattern Recognition

arXiv:2412.20423 (cs)

[Submitted on 29 Dec 2024]

Title:ESVQA: Perceptual Quality Assessment of Egocentric Spatial Videos

Authors:Xilei Zhu, Huiyu Duan, Liu Yang, Yucheng Zhu, Xiongkuo Min, Guangtao Zhai, Patrick Le Callet

View PDF HTML (experimental)

Abstract:With the rapid development of eXtended Reality (XR), egocentric spatial shooting and display technologies have further enhanced immersion and engagement for users. Assessing the quality of experience (QoE) of egocentric spatial videos is crucial to ensure a high-quality viewing experience. However, the corresponding research is still lacking. In this paper, we use the embodied experience to highlight this more immersive experience and study the new problem, i.e., embodied perceptual quality assessment for egocentric spatial videos. Specifically, we introduce the first Egocentric Spatial Video Quality Assessment Database (ESVQAD), which comprises 600 egocentric spatial videos and their mean opinion scores (MOSs). Furthermore, we propose a novel multi-dimensional binocular feature fusion model, termed ESVQAnet, which integrates binocular spatial, motion, and semantic features to predict the perceptual quality. Experimental results demonstrate the ESVQAnet outperforms 16 state-of-the-art VQA models on the embodied perceptual quality assessment task, and exhibits strong generalization capability on traditional VQA tasks. The database and codes will be released upon the publication.

Comments:	7 pages, 3 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
Cite as:	arXiv:2412.20423 [cs.CV]
	(or arXiv:2412.20423v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2412.20423

Submission history

From: Xilei Zhu [view email]
[v1] Sun, 29 Dec 2024 10:13:30 UTC (1,044 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:ESVQA: Perceptual Quality Assessment of Egocentric Spatial Videos

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:ESVQA: Perceptual Quality Assessment of Egocentric Spatial Videos

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators