Subjective and Objective Quality Assessment of Rendered Human Avatar Videos in Virtual Reality

Chen, Yu-Chih; Saha, Avinab; Chapiro, Alexandre; Häne, Christian; Bazin, Jean-Charles; Qiu, Bo; Zanetti, Stefano; Katsavounidis, Ioannis; Bovik, Alan C.

doi:10.1109/tip.2024.3468881

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2408.07041 (eess)

[Submitted on 13 Aug 2024 (v1), last revised 2 Oct 2024 (this version, v2)]

Title:Subjective and Objective Quality Assessment of Rendered Human Avatar Videos in Virtual Reality

Authors:Yu-Chih Chen, Avinab Saha, Alexandre Chapiro, Christian Häne, Jean-Charles Bazin, Bo Qiu, Stefano Zanetti, Ioannis Katsavounidis, Alan C. Bovik

View PDF HTML (experimental)

Abstract:We study the visual quality judgments of human subjects on digital human avatars (sometimes referred to as "holograms" in the parlance of virtual reality [VR] and augmented reality [AR] systems) that have been subjected to distortions. We also study the ability of video quality models to predict human judgments. As streaming human avatar videos in VR or AR become increasingly common, the need for more advanced human avatar video compression protocols will be required to address the tradeoffs between faithfully transmitting high-quality visual representations while adjusting to changeable bandwidth scenarios. During transmission over the internet, the perceived quality of compressed human avatar videos can be severely impaired by visual artifacts. To optimize trade-offs between perceptual quality and data volume in practical workflows, video quality assessment (VQA) models are essential tools. However, there are very few VQA algorithms developed specifically to analyze human body avatar videos, due, at least in part, to the dearth of appropriate and comprehensive datasets of adequate size. Towards filling this gap, we introduce the LIVE-Meta Rendered Human Avatar VQA Database, which contains 720 human avatar videos processed using 20 different combinations of encoding parameters, labeled by corresponding human perceptual quality judgments that were collected in six degrees of freedom VR headsets. To demonstrate the usefulness of this new and unique video resource, we use it to study and compare the performances of a variety of state-of-the-art Full Reference and No Reference video quality prediction models, including a new model called HoloQA. As a service to the research community, we publicly releases the metadata of the new database at this https URL.

Comments:	Accepted to IEEE Transactions on Image Processing, 2024
Subjects:	Image and Video Processing (eess.IV)
Cite as:	arXiv:2408.07041 [eess.IV]
	(or arXiv:2408.07041v2 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2408.07041
Related DOI:	https://doi.org/10.1109/tip.2024.3468881

Submission history

From: Yu-Chih Chen [view email]
[v1] Tue, 13 Aug 2024 17:11:54 UTC (19,206 KB)
[v2] Wed, 2 Oct 2024 21:23:19 UTC (19,206 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Subjective and Objective Quality Assessment of Rendered Human Avatar Videos in Virtual Reality

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Subjective and Objective Quality Assessment of Rendered Human Avatar Videos in Virtual Reality

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators