Cross-View Image Retrieval -- Ground to Aerial Image Retrieval through Deep Learning

Khurshid, Numan; Hanif, Talha; Tharani, Mohbat; Taj, Murtaza

doi:10.1007/978-3-030-36711-4_19

Computer Science > Computer Vision and Pattern Recognition

arXiv:2005.00725 (cs)

[Submitted on 2 May 2020]

Title:Cross-View Image Retrieval -- Ground to Aerial Image Retrieval through Deep Learning

Authors:Numan Khurshid, Talha Hanif, Mohbat Tharani, Murtaza Taj

View PDF

Abstract:Cross-modal retrieval aims to measure the content similarity between different types of data. The idea has been previously applied to visual, text, and speech data. In this paper, we present a novel cross-modal retrieval method specifically for multi-view images, called Cross-view Image Retrieval CVIR. Our approach aims to find a feature space as well as an embedding space in which samples from street-view images are compared directly to satellite-view images (and vice-versa). For this comparison, a novel deep metric learning based solution "DeepCVIR" has been proposed. Previous cross-view image datasets are deficient in that they (1) lack class information; (2) were originally collected for cross-view image geolocalization task with coupled images; (3) do not include any images from off-street locations. To train, compare, and evaluate the performance of cross-view image retrieval, we present a new 6 class cross-view image dataset termed as CrossViewRet which comprises of images including freeway, mountain, palace, river, ship, and stadium with 700 high-resolution dual-view images for each class. Results show that the proposed DeepCVIR outperforms conventional matching approaches on the CVIR task for the given dataset and would also serve as the baseline for future research.

Comments:	International Conference on Neural Information Processing (ICONIP-2019)
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2005.00725 [cs.CV]
	(or arXiv:2005.00725v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2005.00725
Related DOI:	https://doi.org/10.1007/978-3-030-36711-4_19

Submission history

From: Numan Khurshid [view email]
[v1] Sat, 2 May 2020 06:52:16 UTC (4,988 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Cross-View Image Retrieval -- Ground to Aerial Image Retrieval through Deep Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Cross-View Image Retrieval -- Ground to Aerial Image Retrieval through Deep Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators