In-Domain Self-Supervised Learning Improves Remote Sensing Image Scene Classification

Dimitrovski, Ivica; Kitanovski, Ivan; Simidjievski, Nikola; Kocev, Dragi

doi:10.1109/LGRS.2024.3352926

Computer Science > Computer Vision and Pattern Recognition

arXiv:2307.01645 (cs)

[Submitted on 4 Jul 2023 (v1), last revised 5 Feb 2024 (this version, v2)]

Title:In-Domain Self-Supervised Learning Improves Remote Sensing Image Scene Classification

Authors:Ivica Dimitrovski, Ivan Kitanovski, Nikola Simidjievski, Dragi Kocev

View PDF

Abstract:We investigate the utility of in-domain self-supervised pre-training of vision models in the analysis of remote sensing imagery. Self-supervised learning (SSL) has emerged as a promising approach for remote sensing image classification due to its ability to exploit large amounts of unlabeled data. Unlike traditional supervised learning, SSL aims to learn representations of data without the need for explicit labels. This is achieved by formulating auxiliary tasks that can be used for pre-training models before fine-tuning them on a given downstream task. A common approach in practice to SSL pre-training is utilizing standard pre-training datasets, such as ImageNet. While relevant, such a general approach can have a sub-optimal influence on the downstream performance of models, especially on tasks from challenging domains such as remote sensing. In this paper, we analyze the effectiveness of SSL pre-training by employing the iBOT framework coupled with Vision transformers trained on Million-AID, a large and unlabeled remote sensing dataset. We present a comprehensive study of different self-supervised pre-training strategies and evaluate their effect across 14 downstream datasets with diverse properties. Our results demonstrate that leveraging large in-domain datasets for self-supervised pre-training consistently leads to improved predictive downstream performance, compared to the standard approaches found in practice.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2307.01645 [cs.CV]
	(or arXiv:2307.01645v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2307.01645
Journal reference:	IEEE Geoscience and Remote Sensing Letters (2024)
Related DOI:	https://doi.org/10.1109/LGRS.2024.3352926

Submission history

From: Nikola Simidjievski [view email]
[v1] Tue, 4 Jul 2023 10:57:52 UTC (521 KB)
[v2] Mon, 5 Feb 2024 14:14:06 UTC (1,019 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:In-Domain Self-Supervised Learning Improves Remote Sensing Image Scene Classification

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:In-Domain Self-Supervised Learning Improves Remote Sensing Image Scene Classification

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators