Rethinking Generative Human Video Coding with Implicit Motion Transformation

Chen, Bolin; Liao, Ru-Ling; Chen, Jie; Ye, Yan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2506.10453 (cs)

[Submitted on 12 Jun 2025]

Title:Rethinking Generative Human Video Coding with Implicit Motion Transformation

Authors:Bolin Chen, Ru-Ling Liao, Jie Chen, Yan Ye

View PDF HTML (experimental)

Abstract:Beyond traditional hybrid-based video codec, generative video codec could achieve promising compression performance by evolving high-dimensional signals into compact feature representations for bitstream compactness at the encoder side and developing explicit motion fields as intermediate supervision for high-quality reconstruction at the decoder side. This paradigm has achieved significant success in face video compression. However, compared to facial videos, human body videos pose greater challenges due to their more complex and diverse motion patterns, i.e., when using explicit motion guidance for Generative Human Video Coding (GHVC), the reconstruction results could suffer severe distortions and inaccurate motion. As such, this paper highlights the limitations of explicit motion-based approaches for human body video compression and investigates the GHVC performance improvement with the aid of Implicit Motion Transformation, namely IMT. In particular, we propose to characterize complex human body signal into compact visual features and transform these features into implicit motion guidance for signal reconstruction. Experimental results demonstrate the effectiveness of the proposed IMT paradigm, which can facilitate GHVC to achieve high-efficiency compression and high-fidelity synthesis.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
Cite as:	arXiv:2506.10453 [cs.CV]
	(or arXiv:2506.10453v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2506.10453

Submission history

From: Bolin Chen [view email]
[v1] Thu, 12 Jun 2025 07:58:18 UTC (23,256 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Rethinking Generative Human Video Coding with Implicit Motion Transformation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Rethinking Generative Human Video Coding with Implicit Motion Transformation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators