Learning Structure-from-Motion with Graph Attention Networks

Brynte, Lucas; Iglesias, José Pedro; Olsson, Carl; Kahl, Fredrik

Computer Science > Computer Vision and Pattern Recognition

arXiv:2308.15984 (cs)

[Submitted on 30 Aug 2023 (v1), last revised 18 May 2024 (this version, v3)]

Title:Learning Structure-from-Motion with Graph Attention Networks

Authors:Lucas Brynte, José Pedro Iglesias, Carl Olsson, Fredrik Kahl

View PDF HTML (experimental)

Abstract:In this paper we tackle the problem of learning Structure-from-Motion (SfM) through the use of graph attention networks. SfM is a classic computer vision problem that is solved though iterative minimization of reprojection errors, referred to as Bundle Adjustment (BA), starting from a good initialization. In order to obtain a good enough initialization to BA, conventional methods rely on a sequence of sub-problems (such as pairwise pose estimation, pose averaging or triangulation) which provide an initial solution that can then be refined using BA. In this work we replace these sub-problems by learning a model that takes as input the 2D keypoints detected across multiple views, and outputs the corresponding camera poses and 3D keypoint coordinates. Our model takes advantage of graph neural networks to learn SfM-specific primitives, and we show that it can be used for fast inference of the reconstruction for new and unseen sequences. The experimental results show that the proposed model outperforms competing learning-based methods, and challenges COLMAP while having lower runtime. Our code is available at this https URL.

Comments:	CVPR camera-ready updates
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2308.15984 [cs.CV]
	(or arXiv:2308.15984v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2308.15984

Submission history

From: Lucas Brynte [view email]
[v1] Wed, 30 Aug 2023 12:13:13 UTC (1,567 KB)
[v2] Mon, 4 Dec 2023 08:50:31 UTC (1,569 KB)
[v3] Sat, 18 May 2024 22:44:57 UTC (1,570 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Learning Structure-from-Motion with Graph Attention Networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Learning Structure-from-Motion with Graph Attention Networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators