Missing Modality Robustness in Semi-Supervised Multi-Modal Semantic Segmentation

Maheshwari, Harsh; Liu, Yen-Cheng; Kira, Zsolt

Computer Science > Computer Vision and Pattern Recognition

arXiv:2304.10756 (cs)

[Submitted on 21 Apr 2023]

Title:Missing Modality Robustness in Semi-Supervised Multi-Modal Semantic Segmentation

Authors:Harsh Maheshwari, Yen-Cheng Liu, Zsolt Kira

View PDF

Abstract:Using multiple spatial modalities has been proven helpful in improving semantic segmentation performance. However, there are several real-world challenges that have yet to be addressed: (a) improving label efficiency and (b) enhancing robustness in realistic scenarios where modalities are missing at the test time. To address these challenges, we first propose a simple yet efficient multi-modal fusion mechanism Linear Fusion, that performs better than the state-of-the-art multi-modal models even with limited supervision. Second, we propose M3L: Multi-modal Teacher for Masked Modality Learning, a semi-supervised framework that not only improves the multi-modal performance but also makes the model robust to the realistic missing modality scenario using unlabeled data. We create the first benchmark for semi-supervised multi-modal semantic segmentation and also report the robustness to missing modalities. Our proposal shows an absolute improvement of up to 10% on robust mIoU above the most competitive baselines. Our code is available at this https URL

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2304.10756 [cs.CV]
	(or arXiv:2304.10756v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2304.10756

Submission history

From: Harsh Maheshwari [view email]
[v1] Fri, 21 Apr 2023 05:52:50 UTC (11,906 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Missing Modality Robustness in Semi-Supervised Multi-Modal Semantic Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Missing Modality Robustness in Semi-Supervised Multi-Modal Semantic Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators