Common Objects in 3D: Large-Scale Learning and Evaluation of Real-life 3D Category Reconstruction

Reizenstein, Jeremy; Shapovalov, Roman; Henzler, Philipp; Sbordone, Luca; Labatut, Patrick; Novotny, David

Computer Science > Computer Vision and Pattern Recognition

arXiv:2109.00512 (cs)

[Submitted on 1 Sep 2021]

Title:Common Objects in 3D: Large-Scale Learning and Evaluation of Real-life 3D Category Reconstruction

Authors:Jeremy Reizenstein, Roman Shapovalov, Philipp Henzler, Luca Sbordone, Patrick Labatut, David Novotny

View PDF

Abstract:Traditional approaches for learning 3D object categories have been predominantly trained and evaluated on synthetic datasets due to the unavailability of real 3D-annotated category-centric data. Our main goal is to facilitate advances in this field by collecting real-world data in a magnitude similar to the existing synthetic counterparts. The principal contribution of this work is thus a large-scale dataset, called Common Objects in 3D, with real multi-view images of object categories annotated with camera poses and ground truth 3D point clouds. The dataset contains a total of 1.5 million frames from nearly 19,000 videos capturing objects from 50 MS-COCO categories and, as such, it is significantly larger than alternatives both in terms of the number of categories and objects. We exploit this new dataset to conduct one of the first large-scale "in-the-wild" evaluations of several new-view-synthesis and category-centric 3D reconstruction methods. Finally, we contribute NerFormer - a novel neural rendering method that leverages the powerful Transformer to reconstruct an object given a small number of its views. The CO3D dataset is available at this https URL .

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2109.00512 [cs.CV]
	(or arXiv:2109.00512v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2109.00512
Journal reference:	International Conference on Computer Vision, 2021

Submission history

From: David Novotný [view email]
[v1] Wed, 1 Sep 2021 17:59:05 UTC (35,384 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2021-09

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Jeremy Reizenstein
Roman Shapovalov
Philipp Henzler
David Novotný

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Common Objects in 3D: Large-Scale Learning and Evaluation of Real-life 3D Category Reconstruction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Common Objects in 3D: Large-Scale Learning and Evaluation of Real-life 3D Category Reconstruction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators