High-Frequency Enhanced Hybrid Neural Representation for Video Compression

Yu, Li; Li, Zhihui; Xiao, Jimin; Gabbouj, Moncef

Computer Science > Computer Vision and Pattern Recognition

arXiv:2411.06685 (cs)

[Submitted on 11 Nov 2024 (v1), last revised 30 Apr 2025 (this version, v2)]

Title:High-Frequency Enhanced Hybrid Neural Representation for Video Compression

Authors:Li Yu, Zhihui Li, Jimin Xiao, Moncef Gabbouj

View PDF HTML (experimental)

Abstract:Neural Representations for Videos (NeRV) have simplified the video codec process and achieved swift decoding speeds by encoding video content into a neural network, presenting a promising solution for video compression. However, existing work overlooks the crucial issue that videos reconstructed by these methods lack high-frequency details. To address this problem, this paper introduces a High-Frequency Enhanced Hybrid Neural Representation Network. Our method focuses on leveraging high-frequency information to improve the synthesis of fine details by the network. Specifically, we design a wavelet high-frequency encoder that incorporates Wavelet Frequency Decomposer (WFD) blocks to generate high-frequency feature embeddings. Next, we design the High-Frequency Feature Modulation (HFM) block, which leverages the extracted high-frequency embeddings to enhance the fitting process of the decoder. Finally, with the refined Harmonic decoder block and a Dynamic Weighted Frequency Loss, we further reduce the potential loss of high-frequency information. Experiments on the Bunny and UVG datasets demonstrate that our method outperforms other methods, showing notable improvements in detail preservation and compression performance.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
Cite as:	arXiv:2411.06685 [cs.CV]
	(or arXiv:2411.06685v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2411.06685

Submission history

From: Zhihui Li [view email]
[v1] Mon, 11 Nov 2024 03:04:46 UTC (3,362 KB)
[v2] Wed, 30 Apr 2025 02:50:26 UTC (3,325 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:High-Frequency Enhanced Hybrid Neural Representation for Video Compression

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:High-Frequency Enhanced Hybrid Neural Representation for Video Compression

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators