Multi-domain semantic segmentation with pyramidal fusion

Bevandić, Petra; Oršić, Marin; Grubišić, Ivan; Šarić, Josip; Šegvić, Siniša

Computer Science > Computer Vision and Pattern Recognition

arXiv:2009.01636 (cs)

[Submitted on 2 Sep 2020 (v1), last revised 7 Oct 2021 (this version, v5)]

Title:Multi-domain semantic segmentation with pyramidal fusion

Authors:Petra Bevandić, Marin Oršić, Ivan Grubišić, Josip Šarić, Siniša Šegvić

View PDF

Abstract:We present our submission to the semantic segmentation contest of the Robust Vision Challenge held at ECCV 2020. The contest requires submitting the same model to seven benchmarks from three different domains. Our approach is based on the SwiftNet architecture with pyramidal fusion. We address inconsistent taxonomies with a single-level 193-dimensional softmax output. We strive to train with large batches in order to stabilize optimization of a hard recognition problem, and to favour smooth evolution of batchnorm statistics. We achieve this by implementing a custom backward step through log-sum-prob loss, and by using small crops before freezing the population statistics. Our model ranks first on the RVC semantic segmentation challenge as well as on the WildDash 2 leaderboard. This suggests that pyramidal fusion is competitive not only for efficient inference with lightweight backbones, but also in large-scale setups for multi-domain application.

Comments:	2 pages, 2 tables, no figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2009.01636 [cs.CV]
	(or arXiv:2009.01636v5 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2009.01636

Submission history

From: Marin Oršić [view email]
[v1] Wed, 2 Sep 2020 08:37:14 UTC (46 KB)
[v2] Tue, 8 Sep 2020 12:06:23 UTC (46 KB)
[v3] Wed, 16 Sep 2020 07:30:29 UTC (46 KB)
[v4] Wed, 20 Jan 2021 11:29:47 UTC (12,786 KB)
[v5] Thu, 7 Oct 2021 13:55:02 UTC (21,582 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2020-09

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Marin Orsic
Petra Bevandic
Josip Saric
Sinisa Segvic

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Multi-domain semantic segmentation with pyramidal fusion

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Multi-domain semantic segmentation with pyramidal fusion

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators