Partial-View Object View Synthesis via Filtered Inversion

Sun, Fan-Yun; Tremblay, Jonathan; Blukis, Valts; Lin, Kevin; Xu, Danfei; Ivanovic, Boris; Karkus, Peter; Birchfield, Stan; Fox, Dieter; Zhang, Ruohan; Li, Yunzhu; Wu, Jiajun; Pavone, Marco; Haber, Nick

Computer Science > Computer Vision and Pattern Recognition

arXiv:2304.00673 (cs)

[Submitted on 3 Apr 2023 (v1), last revised 17 Aug 2024 (this version, v2)]

Title:Partial-View Object View Synthesis via Filtered Inversion

Authors:Fan-Yun Sun, Jonathan Tremblay, Valts Blukis, Kevin Lin, Danfei Xu, Boris Ivanovic, Peter Karkus, Stan Birchfield, Dieter Fox, Ruohan Zhang, Yunzhu Li, Jiajun Wu, Marco Pavone, Nick Haber

View PDF HTML (experimental)

Abstract:We propose Filtering Inversion (FINV), a learning framework and optimization process that predicts a renderable 3D object representation from one or few partial views. FINV addresses the challenge of synthesizing novel views of objects from partial observations, spanning cases where the object is not entirely in view, is partially occluded, or is only observed from similar views. To achieve this, FINV learns shape priors by training a 3D generative model. At inference, given one or more views of a novel real-world object, FINV first finds a set of latent codes for the object by inverting the generative model from multiple initial seeds. Maintaining the set of latent codes, FINV filters and resamples them after receiving each new observation, akin to particle filtering. The generator is then finetuned for each latent code on the available views in order to adapt to novel objects. We show that FINV successfully synthesizes novel views of real-world objects (e.g., chairs, tables, and cars), even if the generative prior is trained only on synthetic objects. The ability to address the sim-to-real problem allows FINV to be used for object categories without real-world datasets. FINV achieves state-of-the-art performance on multiple real-world datasets, recovers object shape and texture from partial and sparse views, is robust to occlusion, and is able to incrementally improve its representation with more observations.

Comments:	project website: this http URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2304.00673 [cs.CV]
	(or arXiv:2304.00673v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2304.00673

Submission history

From: Fan-Yun Sun [view email]
[v1] Mon, 3 Apr 2023 00:59:31 UTC (11,134 KB)
[v2] Sat, 17 Aug 2024 20:41:03 UTC (21,489 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Partial-View Object View Synthesis via Filtered Inversion

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Partial-View Object View Synthesis via Filtered Inversion

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators