Scaled Inverse Graphics: Efficiently Learning Large Sets of 3D Scenes

Kassab, Karim; Schnepf, Antoine; Franceschi, Jean-Yves; Caraffa, Laurent; Vasile, Flavian; Mary, Jeremie; Comport, Andrew; Gouet-Brunet, Valérie

Computer Science > Computer Vision and Pattern Recognition

arXiv:2410.23742v1 (cs)

[Submitted on 31 Oct 2024 (this version), latest version 31 Jan 2025 (v2)]

Title:Scaled Inverse Graphics: Efficiently Learning Large Sets of 3D Scenes

Authors:Karim Kassab, Antoine Schnepf, Jean-Yves Franceschi, Laurent Caraffa, Flavian Vasile, Jeremie Mary, Andrew Comport, Valérie Gouet-Brunet

View PDF HTML (experimental)

Abstract:While the field of inverse graphics has been witnessing continuous growth, techniques devised thus far predominantly focus on learning individual scene representations. In contrast, learning large sets of scenes has been a considerable bottleneck in NeRF developments, as repeatedly applying inverse graphics on a sequence of scenes, though essential for various applications, remains largely prohibitive in terms of resource costs. We introduce a framework termed "scaled inverse graphics", aimed at efficiently learning large sets of scene representations, and propose a novel method to this end. It operates in two stages: (i) training a compression model on a subset of scenes, then (ii) training NeRF models on the resulting smaller representations, thereby reducing the optimization space per new scene. In practice, we compact the representation of scenes by learning NeRFs in a latent space to reduce the image resolution, and sharing information across scenes to reduce NeRF representation complexity. We experimentally show that our method presents both the lowest training time and memory footprint in scaled inverse graphics compared to other methods applied independently on each scene. Our codebase is publicly available as open-source. Our project page can be found at this https URL .

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2410.23742 [cs.CV]
	(or arXiv:2410.23742v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2410.23742

Submission history

From: Karim Kassab [view email]
[v1] Thu, 31 Oct 2024 08:58:00 UTC (2,485 KB)
[v2] Fri, 31 Jan 2025 11:23:37 UTC (3,021 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Scaled Inverse Graphics: Efficiently Learning Large Sets of 3D Scenes

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Scaled Inverse Graphics: Efficiently Learning Large Sets of 3D Scenes

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators