Revisiting Learning-based Video Motion Magnification for Real-time Processing

Ha, Hyunwoo; Hyun-Bin, Oh; Jun-Seong, Kim; Byung-Ki, Kwon; Sung-Bin, Kim; Tran, Linh-Tam; Kim, Ji-Yun; Bae, Sung-Ho; Oh, Tae-Hyun

Computer Science > Computer Vision and Pattern Recognition

arXiv:2403.01898 (cs)

[Submitted on 4 Mar 2024]

Title:Revisiting Learning-based Video Motion Magnification for Real-time Processing

Authors:Hyunwoo Ha, Oh Hyun-Bin, Kim Jun-Seong, Kwon Byung-Ki, Kim Sung-Bin, Linh-Tam Tran, Ji-Yun Kim, Sung-Ho Bae, Tae-Hyun Oh

View PDF HTML (experimental)

Abstract:Video motion magnification is a technique to capture and amplify subtle motion in a video that is invisible to the naked eye. The deep learning-based prior work successfully demonstrates the modelling of the motion magnification problem with outstanding quality compared to conventional signal processing-based ones. However, it still lags behind real-time performance, which prevents it from being extended to various online applications. In this paper, we investigate an efficient deep learning-based motion magnification model that runs in real time for full-HD resolution videos. Due to the specified network design of the prior art, i.e. inhomogeneous architecture, the direct application of existing neural architecture search methods is complicated. Instead of automatic search, we carefully investigate the architecture module by module for its role and importance in the motion magnification task. Two key findings are 1) Reducing the spatial resolution of the latent motion representation in the decoder provides a good trade-off between computational efficiency and task quality, and 2) surprisingly, only a single linear layer and a single branch in the encoder are sufficient for the motion magnification task. Based on these findings, we introduce a real-time deep learning-based motion magnification model with4.2X fewer FLOPs and is 2.7X faster than the prior art while maintaining comparable quality.

Comments:	19 pages
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
Cite as:	arXiv:2403.01898 [cs.CV]
	(or arXiv:2403.01898v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2403.01898

Submission history

From: Hyunwoo Ha [view email]
[v1] Mon, 4 Mar 2024 09:57:08 UTC (6,801 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Revisiting Learning-based Video Motion Magnification for Real-time Processing

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Revisiting Learning-based Video Motion Magnification for Real-time Processing

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators