NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations

Jampani, Varun; Maninis, Kevis-Kokitsi; Engelhardt, Andreas; Karpur, Arjun; Truong, Karen; Sargent, Kyle; Popov, Stefan; Araujo, André; Martin-Brualla, Ricardo; Patel, Kaushal; Vlasic, Daniel; Ferrari, Vittorio; Makadia, Ameesh; Liu, Ce; Li, Yuanzhen; Zhou, Howard

Computer Science > Computer Vision and Pattern Recognition

arXiv:2306.09109 (cs)

[Submitted on 15 Jun 2023 (v1), last revised 13 Oct 2023 (this version, v2)]

Title:NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations

Authors:Varun Jampani, Kevis-Kokitsi Maninis, Andreas Engelhardt, Arjun Karpur, Karen Truong, Kyle Sargent, Stefan Popov, André Araujo, Ricardo Martin-Brualla, Kaushal Patel, Daniel Vlasic, Vittorio Ferrari, Ameesh Makadia, Ce Liu, Yuanzhen Li, Howard Zhou

View PDF

Abstract:Recent advances in neural reconstruction enable high-quality 3D object reconstruction from casually captured image collections. Current techniques mostly analyze their progress on relatively simple image collections where Structure-from-Motion (SfM) techniques can provide ground-truth (GT) camera poses. We note that SfM techniques tend to fail on in-the-wild image collections such as image search results with varying backgrounds and illuminations. To enable systematic research progress on 3D reconstruction from casual image captures, we propose NAVI: a new dataset of category-agnostic image collections of objects with high-quality 3D scans along with per-image 2D-3D alignments providing near-perfect GT camera parameters. These 2D-3D alignments allow us to extract accurate derivative annotations such as dense pixel correspondences, depth and segmentation maps. We demonstrate the use of NAVI image collections on different problem settings and show that NAVI enables more thorough evaluations that were not possible with existing datasets. We believe NAVI is beneficial for systematic research progress on 3D reconstruction and correspondence estimation. Project page: this https URL

Comments:	NeurIPS 2023 camera ready. Project page: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2306.09109 [cs.CV]
	(or arXiv:2306.09109v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2306.09109

Submission history

From: Kevis-Kokitsi Maninis [view email]
[v1] Thu, 15 Jun 2023 13:11:30 UTC (7,050 KB)
[v2] Fri, 13 Oct 2023 16:12:32 UTC (16,716 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators