LookinGood: Enhancing Performance Capture with Real-time Neural Re-Rendering

Martin-Brualla, Ricardo; Pandey, Rohit; Yang, Shuoran; Pidlypenskyi, Pavel; Taylor, Jonathan; Valentin, Julien; Khamis, Sameh; Davidson, Philip; Tkach, Anastasia; Lincoln, Peter; Kowdle, Adarsh; Rhemann, Christoph; Goldman, Dan B; Keskin, Cem; Seitz, Steve; Izadi, Shahram; Fanello, Sean

Computer Science > Computer Vision and Pattern Recognition

arXiv:1811.05029 (cs)

[Submitted on 12 Nov 2018]

Title:LookinGood: Enhancing Performance Capture with Real-time Neural Re-Rendering

Authors:Ricardo Martin-Brualla, Rohit Pandey, Shuoran Yang, Pavel Pidlypenskyi, Jonathan Taylor, Julien Valentin, Sameh Khamis, Philip Davidson, Anastasia Tkach, Peter Lincoln, Adarsh Kowdle, Christoph Rhemann, Dan B Goldman, Cem Keskin, Steve Seitz, Shahram Izadi, Sean Fanello

View PDF

Abstract:Motivated by augmented and virtual reality applications such as telepresence, there has been a recent focus in real-time performance capture of humans under motion. However, given the real-time constraint, these systems often suffer from artifacts in geometry and texture such as holes and noise in the final rendering, poor lighting, and low-resolution textures. We take the novel approach to augment such real-time performance capture systems with a deep architecture that takes a rendering from an arbitrary viewpoint, and jointly performs completion, super resolution, and denoising of the imagery in real-time. We call this approach neural (re-)rendering, and our live system "LookinGood". Our deep architecture is trained to produce high resolution and high quality images from a coarse rendering in real-time. First, we propose a self-supervised training method that does not require manual ground-truth annotation. We contribute a specialized reconstruction error that uses semantic information to focus on relevant parts of the subject, e.g. the face. We also introduce a salient reweighing scheme of the loss function that is able to discard outliers. We specifically design the system for virtual and augmented reality headsets where the consistency between the left and right eye plays a crucial role in the final user experience. Finally, we generate temporally stable results by explicitly minimizing the difference between two consecutive frames. We tested the proposed system in two different scenarios: one involving a single RGB-D sensor, and upper body reconstruction of an actor, the second consisting of full body 360 degree capture. Through extensive experimentation, we demonstrate how our system generalizes across unseen sequences and subjects. The supplementary video is available at this http URL.

Comments:	The supplementary video is available at: this http URL To be presented at SIGGRAPH Asia 2018
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1811.05029 [cs.CV]
	(or arXiv:1811.05029v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1811.05029

Submission history

From: Ricardo Martin Brualla [view email]
[v1] Mon, 12 Nov 2018 22:51:19 UTC (7,193 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:LookinGood: Enhancing Performance Capture with Real-time Neural Re-Rendering

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:LookinGood: Enhancing Performance Capture with Real-time Neural Re-Rendering

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators