Generative Model-Based Fusion for Improved Few-Shot Semantic Segmentation of Infrared Images

Yun, Junno; Akçakaya, Mehmet

Computer Science > Computer Vision and Pattern Recognition

arXiv:2412.05341 (cs)

[Submitted on 6 Dec 2024]

Title:Generative Model-Based Fusion for Improved Few-Shot Semantic Segmentation of Infrared Images

Authors:Junno Yun, Mehmet Akçakaya

View PDF HTML (experimental)

Abstract:Infrared (IR) imaging is commonly used in various scenarios, including autonomous driving, fire safety and defense applications. Thus, semantic segmentation of such images is of great interest. However, this task faces several challenges, including data scarcity, differing contrast and input channel number compared to natural images, and emergence of classes not represented in databases in certain scenarios, such as defense applications. Few-shot segmentation (FSS) provides a framework to overcome these issues by segmenting query images using a few labeled support samples. However, existing FSS models for IR images require paired visible RGB images, which is a major limitation since acquiring such paired data is difficult or impossible in some applications. In this work, we develop new strategies for FSS of IR images by using generative modeling and fusion techniques. To this end, we propose to synthesize auxiliary data to provide additional channel information to complement the limited contrast in the IR images, as well as IR data synthesis for data augmentation. Here, the former helps the FSS model to better capture the relationship between the support and query sets, while the latter addresses the issue of data scarcity. Finally, to further improve the former aspect, we propose a novel fusion ensemble module for integrating the two different modalities. Our methods are evaluated on different IR datasets, and improve upon the state-of-the-art (SOTA) FSS models.

Comments:	Winter Conference on Applications of Computer Vision (WACV), 2025
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
Cite as:	arXiv:2412.05341 [cs.CV]
	(or arXiv:2412.05341v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2412.05341

Submission history

From: Junno Yun [view email]
[v1] Fri, 6 Dec 2024 05:14:57 UTC (15,329 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Generative Model-Based Fusion for Improved Few-Shot Semantic Segmentation of Infrared Images

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Generative Model-Based Fusion for Improved Few-Shot Semantic Segmentation of Infrared Images

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators