Multi-scale and Cross-scale Contrastive Learning for Semantic Segmentation

Pissas, Theodoros; Ravasio, Claudio S.; Da Cruz, Lyndon; Bergeles, Christos

Computer Science > Computer Vision and Pattern Recognition

arXiv:2203.13409 (cs)

[Submitted on 25 Mar 2022 (v1), last revised 19 Jul 2022 (this version, v2)]

Title:Multi-scale and Cross-scale Contrastive Learning for Semantic Segmentation

Authors:Theodoros Pissas, Claudio S. Ravasio, Lyndon Da Cruz, Christos Bergeles

View PDF

Abstract:This work considers supervised contrastive learning for semantic segmentation. We apply contrastive learning to enhance the discriminative power of the multi-scale features extracted by semantic segmentation networks. Our key methodological insight is to leverage samples from the feature spaces emanating from multiple stages of a model's encoder itself requiring neither data augmentation nor online memory banks to obtain a diverse set of samples. To allow for such an extension we introduce an efficient and effective sampling process, that enables applying contrastive losses over the encoder's features at multiple scales. Furthermore, by first mapping the encoder's multi-scale representations to a common feature space, we instantiate a novel form of supervised local-global constraint by introducing cross-scale contrastive learning linking high-resolution local features to low-resolution global features. Combined, our multi-scale and cross-scale contrastive losses boost performance of various models (DeepLabV3, HRNet, OCRNet, UPerNet) with both CNN and Transformer backbones, when evaluated on 4 diverse datasets from natural (Cityscapes, PascalContext, ADE20K) but also surgical (CaDIS) domains. Our code is available at this https URL. datasets from natural (Cityscapes, PascalContext, ADE20K) but also surgical (CaDIS) domains.

Comments:	to appear at ECCV 2022
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2203.13409 [cs.CV]
	(or arXiv:2203.13409v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2203.13409

Submission history

From: Theodoros Pissas [view email]
[v1] Fri, 25 Mar 2022 01:24:24 UTC (9,905 KB)
[v2] Tue, 19 Jul 2022 21:51:22 UTC (14,696 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Multi-scale and Cross-scale Contrastive Learning for Semantic Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Multi-scale and Cross-scale Contrastive Learning for Semantic Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators