Volumetric Capture of Humans with a Single RGBD Camera via Semi-Parametric Learning

Pandey, Rohit; Tkach, Anastasia; Yang, Shuoran; Pidlypenskyi, Pavel; Taylor, Jonathan; Martin-Brualla, Ricardo; Tagliasacchi, Andrea; Papandreou, George; Davidson, Philip; Keskin, Cem; Izadi, Shahram; Fanello, Sean

Computer Science > Computer Vision and Pattern Recognition

arXiv:1905.12162 (cs)

[Submitted on 29 May 2019]

Title:Volumetric Capture of Humans with a Single RGBD Camera via Semi-Parametric Learning

Authors:Rohit Pandey, Anastasia Tkach, Shuoran Yang, Pavel Pidlypenskyi, Jonathan Taylor, Ricardo Martin-Brualla, Andrea Tagliasacchi, George Papandreou, Philip Davidson, Cem Keskin, Shahram Izadi, Sean Fanello

View PDF

Abstract:Volumetric (4D) performance capture is fundamental for AR/VR content generation. Whereas previous work in 4D performance capture has shown impressive results in studio settings, the technology is still far from being accessible to a typical consumer who, at best, might own a single RGBD sensor. Thus, in this work, we propose a method to synthesize free viewpoint renderings using a single RGBD camera. The key insight is to leverage previously seen "calibration" images of a given user to extrapolate what should be rendered in a novel viewpoint from the data available in the sensor. Given these past observations from multiple viewpoints, and the current RGBD image from a fixed view, we propose an end-to-end framework that fuses both these data sources to generate novel renderings of the performer. We demonstrate that the method can produce high fidelity images, and handle extreme changes in subject pose and camera viewpoints. We also show that the system generalizes to performers not seen in the training data. We run exhaustive experiments demonstrating the effectiveness of the proposed semi-parametric model (i.e. calibration images available to the neural network) compared to other state of the art machine learned solutions. Further, we compare the method with more traditional pipelines that employ multi-view capture. We show that our framework is able to achieve compelling results, with substantially less infrastructure than previously required.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1905.12162 [cs.CV]
	(or arXiv:1905.12162v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1905.12162

Submission history

From: Rohit Pandey [view email]
[v1] Wed, 29 May 2019 01:29:51 UTC (7,050 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Volumetric Capture of Humans with a Single RGBD Camera via Semi-Parametric Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Volumetric Capture of Humans with a Single RGBD Camera via Semi-Parametric Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators