Bringing NeRFs to the Latent Space: Inverse Graphics Autoencoder

Schnepf, Antoine; Kassab, Karim; Franceschi, Jean-Yves; Caraffa, Laurent; Vasile, Flavian; Mary, Jeremie; Comport, Andrew; Gouet-Brunet, Valerie

Computer Science > Computer Vision and Pattern Recognition

arXiv:2410.22936 (cs)

[Submitted on 30 Oct 2024 (v1), last revised 24 Feb 2025 (this version, v2)]

Title:Bringing NeRFs to the Latent Space: Inverse Graphics Autoencoder

Authors:Antoine Schnepf, Karim Kassab, Jean-Yves Franceschi, Laurent Caraffa, Flavian Vasile, Jeremie Mary, Andrew Comport, Valerie Gouet-Brunet

View PDF HTML (experimental)

Abstract:While pre-trained image autoencoders are increasingly utilized in computer vision, the application of inverse graphics in 2D latent spaces has been under-explored. Yet, besides reducing the training and rendering complexity, applying inverse graphics in the latent space enables a valuable interoperability with other latent-based 2D methods. The major challenge is that inverse graphics cannot be directly applied to such image latent spaces because they lack an underlying 3D geometry. In this paper, we propose an Inverse Graphics Autoencoder (IG-AE) that specifically addresses this issue. To this end, we regularize an image autoencoder with 3D-geometry by aligning its latent space with jointly trained latent 3D scenes. We utilize the trained IG-AE to bring NeRFs to the latent space with a latent NeRF training pipeline, which we implement in an open-source extension of the Nerfstudio framework, thereby unlocking latent scene learning for its supported methods. We experimentally confirm that Latent NeRFs trained with IG-AE present an improved quality compared to a standard autoencoder, all while exhibiting training and rendering accelerations with respect to NeRFs trained in the image space. Our project page can be found at this https URL .

Comments:	Accepted at ICLR 2025. Available at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2410.22936 [cs.CV]
	(or arXiv:2410.22936v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2410.22936

Submission history

From: Karim Kassab [view email]
[v1] Wed, 30 Oct 2024 11:43:55 UTC (5,444 KB)
[v2] Mon, 24 Feb 2025 15:16:53 UTC (6,141 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Bringing NeRFs to the Latent Space: Inverse Graphics Autoencoder

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Bringing NeRFs to the Latent Space: Inverse Graphics Autoencoder

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators