Self-supervised Learning for Video Correspondence Flow

Lai, Zihang; Xie, Weidi

Computer Science > Computer Vision and Pattern Recognition

arXiv:1905.00875 (cs)

[Submitted on 2 May 2019 (v1), last revised 27 Jul 2019 (this version, v5)]

Title:Self-supervised Learning for Video Correspondence Flow

Authors:Zihang Lai, Weidi Xie

View PDF

Abstract:The objective of this paper is self-supervised learning of feature embeddings that are suitable for matching correspondences along the videos, which we term correspondence flow. By leveraging the natural spatial-temporal coherence in videos, we propose to train a ``pointer'' that reconstructs a target frame by copying pixels from a reference frame.
We make the following contributions: First, we introduce a simple information bottleneck that forces the model to learn robust features for correspondence matching, and prevent it from learning trivial solutions, \eg matching based on low-level colour information. Second, to tackle the challenges from tracker drifting, due to complex object deformations, illumination changes and occlusions, we propose to train a recursive model over long temporal windows with scheduled sampling and cycle consistency. Third, we achieve state-of-the-art performance on DAVIS 2017 video segmentation and JHMDB keypoint tracking tasks, outperforming all previous self-supervised learning approaches by a significant margin. Fourth, in order to shed light on the potential of self-supervised learning on the task of video correspondence flow, we probe the upper bound by training on additional data, \ie more diverse videos, further demonstrating significant improvements on video segmentation.

Comments:	BMVC2019 (Oral Presentation)
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:1905.00875 [cs.CV]
	(or arXiv:1905.00875v5 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1905.00875

Submission history

From: Weidi Xie [view email]
[v1] Thu, 2 May 2019 17:45:16 UTC (7,307 KB)
[v2] Thu, 9 May 2019 21:55:38 UTC (7,307 KB)
[v3] Sat, 6 Jul 2019 11:43:28 UTC (5,028 KB)
[v4] Sat, 20 Jul 2019 21:59:59 UTC (6,093 KB)
[v5] Sat, 27 Jul 2019 22:59:37 UTC (6,093 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Self-supervised Learning for Video Correspondence Flow

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Self-supervised Learning for Video Correspondence Flow

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators