Masks make discriminative models great again!

Cao, Tianshi; Rakotosaona, Marie-Julie; Poole, Ben; Tombari, Federico; Niemeyer, Michael

Computer Science > Computer Vision and Pattern Recognition

arXiv:2507.00916 (cs)

[Submitted on 1 Jul 2025]

Title:Masks make discriminative models great again!

Authors:Tianshi Cao, Marie-Julie Rakotosaona, Ben Poole, Federico Tombari, Michael Niemeyer

View PDF HTML (experimental)

Abstract:We present Image2GS, a novel approach that addresses the challenging problem of reconstructing photorealistic 3D scenes from a single image by focusing specifically on the image-to-3D lifting component of the reconstruction process. By decoupling the lifting problem (converting an image to a 3D model representing what is visible) from the completion problem (hallucinating content not present in the input), we create a more deterministic task suitable for discriminative models. Our method employs visibility masks derived from optimized 3D Gaussian splats to exclude areas not visible from the source view during training. This masked training strategy significantly improves reconstruction quality in visible regions compared to strong baselines. Notably, despite being trained only on masked regions, Image2GS remains competitive with state-of-the-art discriminative models trained on full target images when evaluated on complete scenes. Our findings highlight the fundamental struggle discriminative models face when fitting unseen regions and demonstrate the advantages of addressing image-to-3D lifting as a distinct problem with specialized techniques.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2507.00916 [cs.CV]
	(or arXiv:2507.00916v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2507.00916

Submission history

From: Tianshi Cao [view email]
[v1] Tue, 1 Jul 2025 16:22:23 UTC (12,240 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Masks make discriminative models great again!

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Masks make discriminative models great again!

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators