SACReg: Scene-Agnostic Coordinate Regression for Visual Localization

Revaud, Jerome; Cabon, Yohann; Brégier, Romain; Lee, JongMin; Weinzaepfel, Philippe

Computer Science > Computer Vision and Pattern Recognition

arXiv:2307.11702v1 (cs)

[Submitted on 21 Jul 2023 (this version), latest version 30 Nov 2023 (v3)]

Title:SACReg: Scene-Agnostic Coordinate Regression for Visual Localization

Authors:Jerome Revaud, Yohann Cabon, Romain Brégier, JongMin Lee, Philippe Weinzaepfel

View PDF

Abstract:Scene coordinates regression (SCR), i.e., predicting 3D coordinates for every pixel of a given image, has recently shown promising potential. However, existing methods remain mostly scene-specific or limited to small scenes and thus hardly scale to realistic datasets. In this paper, we propose a new paradigm where a single generic SCR model is trained once to be then deployed to new test scenes, regardless of their scale and without further finetuning. For a given query image, it collects inputs from off-the-shelf image retrieval techniques and Structure-from-Motion databases: a list of relevant database images with sparse pointwise 2D-3D annotations. The model is based on the transformer architecture and can take a variable number of images and sparse 2D-3D annotations as input. It is trained on a few diverse datasets and significantly outperforms other scene regression approaches on several benchmarks, including scene-specific models, for visual localization. In particular, we set a new state of the art on the Cambridge localization benchmark, even outperforming feature-matching-based approaches.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2307.11702 [cs.CV]
	(or arXiv:2307.11702v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2307.11702

Submission history

From: Jerome Revaud [view email]
[v1] Fri, 21 Jul 2023 16:56:36 UTC (7,423 KB)
[v2] Fri, 28 Jul 2023 10:36:58 UTC (7,423 KB)
[v3] Thu, 30 Nov 2023 11:22:53 UTC (5,076 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:SACReg: Scene-Agnostic Coordinate Regression for Visual Localization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:SACReg: Scene-Agnostic Coordinate Regression for Visual Localization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators