ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation

Danier, Duolikun; Zhang, Fan; Bull, David

doi:10.1109/CVPR52688.2022.00351

Computer Science > Computer Vision and Pattern Recognition

arXiv:2111.15483 (cs)

[Submitted on 30 Nov 2021 (v1), last revised 30 Mar 2022 (this version, v2)]

Title:ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation

Authors:Duolikun Danier, Fan Zhang, David Bull

View PDF

Abstract:Video frame interpolation (VFI) is currently a very active research topic, with applications spanning computer vision, post production and video encoding. VFI can be extremely challenging, particularly in sequences containing large motions, occlusions or dynamic textures, where existing approaches fail to offer perceptually robust interpolation performance. In this context, we present a novel deep learning based VFI method, ST-MFNet, based on a Spatio-Temporal Multi-Flow architecture. ST-MFNet employs a new multi-scale multi-flow predictor to estimate many-to-one intermediate flows, which are combined with conventional one-to-one optical flows to capture both large and complex motions. In order to enhance interpolation performance for various textures, a 3D CNN is also employed to model the content dynamics over an extended temporal window. Moreover, ST-MFNet has been trained within an ST-GAN framework, which was originally developed for texture synthesis, with the aim of further improving perceptual interpolation quality. Our approach has been comprehensively evaluated -- compared with fourteen state-of-the-art VFI algorithms -- clearly demonstrating that ST-MFNet consistently outperforms these benchmarks on varied and representative test datasets, with significant gains up to 1.09dB in PSNR for cases including large motions and dynamic textures. Project page: this https URL.

Comments:	Accepted in CVPR 2022
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
Cite as:	arXiv:2111.15483 [cs.CV]
	(or arXiv:2111.15483v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2111.15483
Related DOI:	https://doi.org/10.1109/CVPR52688.2022.00351

Submission history

From: Duolikun Danier [view email]
[v1] Tue, 30 Nov 2021 15:18:46 UTC (807 KB)
[v2] Wed, 30 Mar 2022 10:24:27 UTC (866 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators