CHROMA: Consistent Harmonization of Multi-View Appearance via Bilateral Grid Prediction

Shin, Jisu; Shaw, Richard; Shin, Seunghyun; Zhang, Zhensong; Jeon, Hae-Gon; Perez-Pellitero, Eduardo

Computer Science > Computer Vision and Pattern Recognition

arXiv:2507.15748 (cs)

[Submitted on 21 Jul 2025 (v1), last revised 30 Sep 2025 (this version, v3)]

Title:CHROMA: Consistent Harmonization of Multi-View Appearance via Bilateral Grid Prediction

Authors:Jisu Shin, Richard Shaw, Seunghyun Shin, Zhensong Zhang, Hae-Gon Jeon, Eduardo Perez-Pellitero

View PDF HTML (experimental)

Abstract:Modern camera pipelines apply extensive on-device processing, such as exposure adjustment, white balance, and color correction, which, while beneficial individually, often introduce photometric inconsistencies across views. These appearance variations violate multi-view consistency and degrade novel view synthesis. Joint optimization of scene-specific representations and per-image appearance embeddings has been proposed to address this issue, but with increased computational complexity and slower training. In this work, we propose a generalizable, feed-forward approach that predicts spatially adaptive bilateral grids to correct photometric variations in a multi-view consistent manner. Our model processes hundreds of frames in a single step, enabling efficient large-scale harmonization, and seamlessly integrates into downstream 3D reconstruction models, providing cross-scene generalization without requiring scene-specific retraining. To overcome the lack of paired data, we employ a hybrid self-supervised rendering loss leveraging 3D foundation models, improving generalization to real-world variations. Extensive experiments show that our approach outperforms or matches the reconstruction quality of existing scene-specific optimization methods with appearance modeling, without significantly affecting the training time of baseline 3D models.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2507.15748 [cs.CV]
	(or arXiv:2507.15748v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2507.15748

Submission history

From: Seunghyun Shin Mr [view email]
[v1] Mon, 21 Jul 2025 16:03:58 UTC (17,241 KB)
[v2] Mon, 29 Sep 2025 16:41:52 UTC (9,020 KB)
[v3] Tue, 30 Sep 2025 15:11:14 UTC (9,021 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:CHROMA: Consistent Harmonization of Multi-View Appearance via Bilateral Grid Prediction

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:CHROMA: Consistent Harmonization of Multi-View Appearance via Bilateral Grid Prediction

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators