Learning Frequency-Specific Quantization Scaling in VVC for Standard-Compliant Task-driven Image Coding

Fischer, Kristian; Brand, Fabian; Herglotz, Christian; Kaup, André

doi:10.1109/ICIP46576.2022.9897987

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2301.08533 (eess)

[Submitted on 20 Jan 2023]

Title:Learning Frequency-Specific Quantization Scaling in VVC for Standard-Compliant Task-driven Image Coding

Authors:Kristian Fischer, Fabian Brand, Christian Herglotz, André Kaup

View PDF

Abstract:Today, visual data is often analyzed by a neural network without any human being involved, which demands for specialized codecs. For standard-compliant codec adaptations towards certain information sinks, HEVC or VVC provide the possibility of frequency-specific quantization with scaling lists. This is a well-known method for the human visual system, where scaling lists are derived from psycho-visual models. In this work, we employ scaling lists when performing VVC intra coding for neural networks as information sink. To this end, we propose a novel data-driven method to obtain optimal scaling lists for arbitrary neural networks. Experiments with Mask R-CNN as information sink reveal that coding the Cityscapes dataset with the proposed scaling lists result in peak bitrate savings of 8.9 % over VVC with constant quantization. By that, our approach also outperforms scaling lists optimized for the human visual system. The generated scaling lists can be found under this https URL.

Comments:	Originally submitted at IEEE ICIP 2022
Subjects:	Image and Video Processing (eess.IV)
ACM classes:	I.4.2
Cite as:	arXiv:2301.08533 [eess.IV]
	(or arXiv:2301.08533v1 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2301.08533
Journal reference:	ICIP2022
Related DOI:	https://doi.org/10.1109/ICIP46576.2022.9897987

Submission history

From: Kristian Fischer [view email]
[v1] Fri, 20 Jan 2023 12:30:09 UTC (21,788 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Learning Frequency-Specific Quantization Scaling in VVC for Standard-Compliant Task-driven Image Coding

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Learning Frequency-Specific Quantization Scaling in VVC for Standard-Compliant Task-driven Image Coding

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators