Copy-Pasting Coherent Depth Regions Improves Contrastive Learning for Urban-Scene Segmentation

Zeng, Liang; Lengyel, Attila; Tömen, Nergis; van Gemert, Jan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2211.14074 (cs)

[Submitted on 25 Nov 2022]

Title:Copy-Pasting Coherent Depth Regions Improves Contrastive Learning for Urban-Scene Segmentation

Authors:Liang Zeng, Attila Lengyel, Nergis Tömen, Jan van Gemert

View PDF

Abstract:In this work, we leverage estimated depth to boost self-supervised contrastive learning for segmentation of urban scenes, where unlabeled videos are readily available for training self-supervised depth estimation. We argue that the semantics of a coherent group of pixels in 3D space is self-contained and invariant to the contexts in which they appear. We group coherent, semantically related pixels into coherent depth regions given their estimated depth and use copy-paste to synthetically vary their contexts. In this way, cross-context correspondences are built in contrastive learning and a context-invariant representation is learned. For unsupervised semantic segmentation of urban scenes, our method surpasses the previous state-of-the-art baseline by +7.14% in mIoU on Cityscapes and +6.65% on KITTI. For fine-tuning on Cityscapes and KITTI segmentation, our method is competitive with existing models, yet, we do not need to pre-train on ImageNet or COCO, and we are also more computationally efficient. Our code is available on this https URL

Comments:	BMVC 2022 Best Student Paper Award(Honourable Mention)
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2211.14074 [cs.CV]
	(or arXiv:2211.14074v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2211.14074

Submission history

From: Liang Zeng [view email]
[v1] Fri, 25 Nov 2022 12:52:08 UTC (22,195 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Copy-Pasting Coherent Depth Regions Improves Contrastive Learning for Urban-Scene Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Copy-Pasting Coherent Depth Regions Improves Contrastive Learning for Urban-Scene Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators