Skip to main content

Showing 1–9 of 9 results for author: Rünz, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.00615  [pdf, ps, other

    cs.CV cs.AI

    Pixel3DMM: Versatile Screen-Space Priors for Single-Image 3D Face Reconstruction

    Authors: Simon Giebenhain, Tobias Kirschstein, Martin Rünz, Lourdes Agapito, Matthias Nießner

    Abstract: We address the 3D reconstruction of human faces from a single RGB image. To this end, we propose Pixel3DMM, a set of highly-generalized vision transformers which predict per-pixel geometric cues in order to constrain the optimization of a 3D morphable face model (3DMM). We exploit the latent features of the DINO foundation model, and introduce a tailored surface normal and uv-coordinate prediction… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

    Comments: Project Website: https://simongiebenhain.github.io/pixel3dmm/ ; Video: https://www.youtube.com/watch?v=BwxwEXJwUDc

  2. arXiv:2405.19331  [pdf, other

    cs.CV cs.AI cs.GR

    NPGA: Neural Parametric Gaussian Avatars

    Authors: Simon Giebenhain, Tobias Kirschstein, Martin Rünz, Lourdes Agapito, Matthias Nießner

    Abstract: The creation of high-fidelity, digital versions of human heads is an important stepping stone in the process of further integrating virtual components into our everyday lives. Constructing such avatars is a challenging research problem, due to a high demand for photo-realism and real-time rendering performance. In this work, we propose Neural Parametric Gaussian Avatars (NPGA), a data-driven appro… ▽ More

    Submitted 13 September, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: Project Page: see https://simongiebenhain.github.io/NPGA/ ; Youtube Video: see https://youtu.be/t0S0OK7WnA4

    Journal ref: SIGGRAPH Asia 2024 Conference Papers (SA Conference Papers '24), December 3-6, 2024, Tokyo, Japan

  3. arXiv:2312.06740  [pdf, other

    cs.CV

    MonoNPHM: Dynamic Head Reconstruction from Monocular Videos

    Authors: Simon Giebenhain, Tobias Kirschstein, Markos Georgopoulos, Martin Rünz, Lourdes Agapito, Matthias Nießner

    Abstract: We present Monocular Neural Parametric Head Models (MonoNPHM) for dynamic 3D head reconstructions from monocular RGB videos. To this end, we propose a latent appearance space that parameterizes a texture field on top of a neural parametric model. We constrain predicted color values to be correlated with the underlying geometry such that gradients from RGB effectively influence latent geometry code… ▽ More

    Submitted 29 May, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: Project Page: see https://simongiebenhain.github.io/MonoNPHM/ ; Video: see https://youtu.be/n-wjaC3UIeE

  4. arXiv:2305.06356  [pdf, other

    cs.CV cs.GR cs.LG

    HumanRF: High-Fidelity Neural Radiance Fields for Humans in Motion

    Authors: Mustafa Işık, Martin Rünz, Markos Georgopoulos, Taras Khakhulin, Jonathan Starck, Lourdes Agapito, Matthias Nießner

    Abstract: Representing human performance at high-fidelity is an essential building block in diverse applications, such as film production, computer games or videoconferencing. To close the gap to production-level quality, we introduce HumanRF, a 4D dynamic neural scene representation that captures full-body appearance in motion from multi-view video input, and enables playback from novel, unseen viewpoints.… ▽ More

    Submitted 11 May, 2023; v1 submitted 10 May, 2023; originally announced May 2023.

    Comments: Project webpage: https://synthesiaresearch.github.io/humanrf Dataset webpage: https://www.actors-hq.com/ Video: https://www.youtube.com/watch?v=OTnhiLLE7io Code: https://github.com/synthesiaresearch/humanrf

  5. arXiv:2212.02761  [pdf, other

    cs.CV

    Learning Neural Parametric Head Models

    Authors: Simon Giebenhain, Tobias Kirschstein, Markos Georgopoulos, Martin Rünz, Lourdes Agapito, Matthias Nießner

    Abstract: We propose a novel 3D morphable model for complete human heads based on hybrid neural fields. At the core of our model lies a neural parametric representation that disentangles identity and expressions in disjoint latent spaces. To this end, we capture a person's identity in a canonical space as a signed distance field (SDF), and model facial expressions with a neural deformation field. In additio… ▽ More

    Submitted 14 April, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

    Comments: Project Page: https://simongiebenhain.github.io/NPHM ; Project Video: https://www.youtube.com/watch?v=0mDk2tFOJCg ; Camer-Ready Version; Added Experiments

  6. arXiv:2108.09481  [pdf, other

    cs.CV cs.RO

    DSP-SLAM: Object Oriented SLAM with Deep Shape Priors

    Authors: Jingwen Wang, Martin Rünz, Lourdes Agapito

    Abstract: We propose DSP-SLAM, an object-oriented SLAM system that builds a rich and accurate joint map of dense 3D models for foreground objects, and sparse landmark points to represent the background. DSP-SLAM takes as input the 3D point cloud reconstructed by a feature-based SLAM system and equips it with the ability to enhance its sparse map with dense reconstructions of detected objects. Objects are de… ▽ More

    Submitted 22 October, 2021; v1 submitted 21 August, 2021; originally announced August 2021.

    Comments: To be published at 3DV 2021

  7. arXiv:2005.05125  [pdf, other

    cs.CV

    FroDO: From Detections to 3D Objects

    Authors: Kejie Li, Martin Rünz, Meng Tang, Lingni Ma, Chen Kong, Tanner Schmidt, Ian Reid, Lourdes Agapito, Julian Straub, Steven Lovegrove, Richard Newcombe

    Abstract: Object-oriented maps are important for scene understanding since they jointly capture geometry and semantics, allow individual instantiation and meaningful reasoning about objects. We introduce FroDO, a method for accurate 3D reconstruction of object instances from RGB video that infers object location, pose and shape in a coarse-to-fine manner. Key to FroDO is to embed object shapes in a novel le… ▽ More

    Submitted 11 May, 2020; originally announced May 2020.

    Comments: To be published in CVPR 2020. The first two authors contributed equally

  8. arXiv:1804.09194  [pdf, other

    cs.CV cs.RO

    MaskFusion: Real-Time Recognition, Tracking and Reconstruction of Multiple Moving Objects

    Authors: Martin Rünz, Maud Buffier, Lourdes Agapito

    Abstract: We present MaskFusion, a real-time, object-aware, semantic and dynamic RGB-D SLAM system that goes beyond traditional systems which output a purely geometric map of a static scene. MaskFusion recognizes, segments and assigns semantic class labels to different objects in the scene, while tracking and reconstructing them even when they move independently from the camera. As an RGB-D camera scans a… ▽ More

    Submitted 22 October, 2018; v1 submitted 24 April, 2018; originally announced April 2018.

    Comments: Presented at IEEE International Symposium on Mixed and Augmented Reality (ISMAR) 2018

  9. Co-Fusion: Real-time Segmentation, Tracking and Fusion of Multiple Objects

    Authors: Martin Rünz, Lourdes Agapito

    Abstract: In this paper we introduce Co-Fusion, a dense SLAM system that takes a live stream of RGB-D images as input and segments the scene into different objects (using either motion or semantic cues) while simultaneously tracking and reconstructing their 3D shape in real time. We use a multiple model fitting approach where each object can move independently from the background and still be effectively tr… ▽ More

    Submitted 20 June, 2017; originally announced June 2017.

    Comments: International Conference on Robotics and Automation (ICRA) 2017, http://visual.cs.ucl.ac.uk/pubs/cofusion, https://github.com/martinruenz/co-fusion