Skip to main content

Showing 1–50 of 60 results for author: Habermann, M

.
  1. arXiv:2506.01802  [pdf, ps, other

    cs.CV

    UMA: Ultra-detailed Human Avatars via Multi-level Surface Alignment

    Authors: Heming Zhu, Guoxing Sun, Christian Theobalt, Marc Habermann

    Abstract: Learning an animatable and clothed human avatar model with vivid dynamics and photorealistic appearance from multi-view videos is an important foundational research problem in computer graphics and vision. Fueled by recent advances in implicit representations, the quality of the animatable avatars has achieved an unprecedented level by attaching the implicit representation to drivable human templa… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

    Comments: For video results, see https://youtu.be/XMNCy7J2tuc

  2. arXiv:2505.15385  [pdf, other

    cs.CV cs.GR

    EVA: Expressive Virtual Avatars from Multi-view Videos

    Authors: Hendrik Junkawitsch, Guoxing Sun, Heming Zhu, Christian Theobalt, Marc Habermann

    Abstract: With recent advancements in neural rendering and motion capture algorithms, remarkable progress has been made in photorealistic human avatar modeling, unlocking immense potential for applications in virtual reality, augmented reality, remote communication, and industries such as gaming, film, and medicine. However, existing methods fail to provide complete, faithful, and expressive control over hu… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: Accepted at SIGGRAPH 2025 Conference Track, Project page: https://vcai.mpi-inf.mpg.de/projects/EVA/

  3. arXiv:2504.12905  [pdf, other

    cs.CV

    Second-order Optimization of Gaussian Splats with Importance Sampling

    Authors: Hamza Pehlivan, Andrea Boscolo Camiletto, Lin Geng Foo, Marc Habermann, Christian Theobalt

    Abstract: 3D Gaussian Splatting (3DGS) is widely used for novel view synthesis due to its high rendering quality and fast inference time. However, 3DGS predominantly relies on first-order optimizers such as Adam, which leads to long training times. To address this limitation, we propose a novel second-order optimization strategy based on Levenberg-Marquardt (LM) and Conjugate Gradient (CG), which we specifi… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

  4. arXiv:2504.07144  [pdf, other

    eess.IV

    GIGA: Generalizable Sparse Image-driven Gaussian Avatars

    Authors: Anton Zubekhin, Heming Zhu, Paulo Gotardo, Thabo Beeler, Marc Habermann, Christian Theobalt

    Abstract: Driving a high-quality and photorealistic full-body human avatar, from only a few RGB cameras, is a challenging problem that has become increasingly relevant with emerging virtual reality technologies. To democratize such technology, a promising solution may be a generalizable method that takes sparse multi-view images of an unseen person and then generates photoreal free-view renderings of such i… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

    Comments: 14 pages, 10 figures, project page: https://vcai.mpi-inf.mpg.de/projects/GIGA

  5. arXiv:2503.23094  [pdf, other

    cs.CV

    FRAME: Floor-aligned Representation for Avatar Motion from Egocentric Video

    Authors: Andrea Boscolo Camiletto, Jian Wang, Eduardo Alvarado, Rishabh Dabral, Thabo Beeler, Marc Habermann, Christian Theobalt

    Abstract: Egocentric motion capture with a head-mounted body-facing stereo camera is crucial for VR and AR applications but presents significant challenges such as heavy occlusions and limited annotated real-world data. Existing methods rely on synthetic pretraining and struggle to generate smooth and accurate predictions in real-world settings, particularly for lower limbs. Our work addresses these limitat… ▽ More

    Submitted 29 March, 2025; originally announced March 2025.

    Comments: Accepted at CVPR 2025

  6. arXiv:2503.19976  [pdf, other

    cs.GR cs.CV cs.LG

    Thin-Shell-SfT: Fine-Grained Monocular Non-rigid 3D Surface Tracking with Neural Deformation Fields

    Authors: Navami Kairanda, Marc Habermann, Shanthika Naik, Christian Theobalt, Vladislav Golyanik

    Abstract: 3D reconstruction of highly deformable surfaces (e.g. cloths) from monocular RGB videos is a challenging problem, and no solution provides a consistent and accurate recovery of fine-grained surface details. To account for the ill-posed nature of the setting, existing methods use deformation models with statistical, neural, or physical priors. They also predominantly rely on nonadaptive discrete su… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

    Comments: 15 pages, 12 figures and 3 tables; project page: https://4dqv.mpiinf.mpg.de/ThinShellSfT; CVPR 2025

  7. arXiv:2503.14093  [pdf, other

    q-bio.PE

    Functional Motifs in Foodwebs and Networks

    Authors: Melanie Habermann, Ashkaan K. Fahimipour, Justin D. Yeakel, Thilo Gross

    Abstract: When studying a complex system it is often useful to think of the system as a network of interacting units. One can then ask if some properties of the entire network are already explained by a small part of the network - a network motif. A famous example of an ecological motif is competitive exclusion in foodwebs, where the presence of two species competing for a shared resource precludes the exis… ▽ More

    Submitted 18 March, 2025; originally announced March 2025.

    Comments: 17 pages, 4 figures, supporting information: 13 pages, 1 figure

  8. arXiv:2503.12242  [pdf, other

    cs.CV

    RePerformer: Immersive Human-centric Volumetric Videos from Playback to Photoreal Reperformance

    Authors: Yuheng Jiang, Zhehao Shen, Chengcheng Guo, Yu Hong, Zhuo Su, Yingliang Zhang, Marc Habermann, Lan Xu

    Abstract: Human-centric volumetric videos offer immersive free-viewpoint experiences, yet existing methods focus either on replaying general dynamic scenes or animating human avatars, limiting their ability to re-perform general dynamic scenes. In this paper, we present RePerformer, a novel Gaussian-based representation that unifies playback and re-performance for high-fidelity human-centric volumetric vide… ▽ More

    Submitted 15 March, 2025; originally announced March 2025.

    Comments: Accepted by CVPR 2025. Project Page: https://moqiyinlun.github.io/Reperformer/

  9. arXiv:2412.13183  [pdf, other

    cs.CV

    Real-time Free-view Human Rendering from Sparse-view RGB Videos using Double Unprojected Textures

    Authors: Guoxing Sun, Rishabh Dabral, Heming Zhu, Pascal Fua, Christian Theobalt, Marc Habermann

    Abstract: Real-time free-view human rendering from sparse-view RGB inputs is a challenging task due to the sensor scarcity and the tight time budget. To ensure efficiency, recent methods leverage 2D CNNs operating in texture space to learn rendering primitives. However, they either jointly learn geometry and appearance, or completely ignore sparse image information for geometry estimation, significantly har… ▽ More

    Submitted 14 April, 2025; v1 submitted 17 December, 2024; originally announced December 2024.

    Comments: Accepted at CVPR 2025, Project page: https://vcai.mpi-inf.mpg.de/projects/DUT/

  10. arXiv:2412.05066  [pdf, other

    cs.CV cs.GR cs.RO

    BimArt: A Unified Approach for the Synthesis of 3D Bimanual Interaction with Articulated Objects

    Authors: Wanyue Zhang, Rishabh Dabral, Vladislav Golyanik, Vasileios Choutas, Eduardo Alvarado, Thabo Beeler, Marc Habermann, Christian Theobalt

    Abstract: We present BimArt, a novel generative approach for synthesizing 3D bimanual hand interactions with articulated objects. Unlike prior works, we do not rely on a reference grasp, a coarse hand trajectory, or separate modes for grasping and articulating. To achieve this, we first generate distance-based contact maps conditioned on the object trajectory with an articulation-aware feature representatio… ▽ More

    Submitted 25 March, 2025; v1 submitted 6 December, 2024; originally announced December 2024.

    Comments: CVPR2025

  11. arXiv:2411.14280  [pdf, other

    cs.CV

    EasyHOI: Unleashing the Power of Large Models for Reconstructing Hand-Object Interactions in the Wild

    Authors: Yumeng Liu, Xiaoxiao Long, Zemin Yang, Yuan Liu, Marc Habermann, Christian Theobalt, Yuexin Ma, Wenping Wang

    Abstract: Our work aims to reconstruct hand-object interactions from a single-view image, which is a fundamental but ill-posed task. Unlike methods that reconstruct from videos, multi-view images, or predefined 3D templates, single-view reconstruction faces significant challenges due to inherent ambiguities and occlusions. These challenges are further amplified by the diverse nature of hand poses and the va… ▽ More

    Submitted 29 April, 2025; v1 submitted 21 November, 2024; originally announced November 2024.

    Comments: Project page: https://lym29.github.io/EasyHOI-page/

  12. arXiv:2411.07138  [pdf, other

    cs.CV

    Nuremberg Letterbooks: A Multi-Transcriptional Dataset of Early 15th Century Manuscripts for Document Analysis

    Authors: Martin Mayr, Julian Krenz, Katharina Neumeier, Anna Bub, Simon Bürcky, Nina Brolich, Klaus Herbers, Mechthild Habermann, Peter Fleischmann, Andreas Maier, Vincent Christlein

    Abstract: Most datasets in the field of document analysis utilize highly standardized labels, which, while simplifying specific tasks, often produce outputs that are not directly applicable to humanities research. In contrast, the Nuremberg Letterbooks dataset, which comprises historical documents from the early 15th century, addresses this gap by providing multiple types of transcriptions and accompanying… ▽ More

    Submitted 11 November, 2024; originally announced November 2024.

  13. arXiv:2411.04249  [pdf, other

    cs.CV

    PocoLoco: A Point Cloud Diffusion Model of Human Shape in Loose Clothing

    Authors: Siddharth Seth, Rishabh Dabral, Diogo Luvizon, Marc Habermann, Ming-Hsuan Yang, Christian Theobalt, Adam Kortylewski

    Abstract: Modeling a human avatar that can plausibly deform to articulations is an active area of research. We present PocoLoco -- the first template-free, point-based, pose-conditioned generative model for 3D humans in loose clothing. We motivate our work by noting that most methods require a parametric model of the human body to ground pose-dependent deformations. Consequently, they are restricted to mode… ▽ More

    Submitted 8 November, 2024; v1 submitted 6 November, 2024; originally announced November 2024.

    Comments: WACV 2025

  14. arXiv:2410.01835  [pdf, other

    cs.CV cs.GR

    EgoAvatar: Egocentric View-Driven and Photorealistic Full-body Avatars

    Authors: Jianchun Chen, Jian Wang, Yinda Zhang, Rohit Pandey, Thabo Beeler, Marc Habermann, Christian Theobalt

    Abstract: Immersive VR telepresence ideally means being able to interact and communicate with digital avatars that are indistinguishable from and precisely reflect the behaviour of their real counterparts. The core technical challenge is two fold: Creating a digital double that faithfully reflects the real human and tracking the real human solely from egocentric sensing devices that are lightweight and have… ▽ More

    Submitted 8 October, 2024; v1 submitted 22 September, 2024; originally announced October 2024.

    Comments: Project Page: https://vcai.mpi-inf.mpg.de/projects/EgoAvatar/

  15. Manifold Sampling for Differentiable Uncertainty in Radiance Fields

    Authors: Linjie Lyu, Ayush Tewari, Marc Habermann, Shunsuke Saito, Michael Zollhöfer, Thomas Leimkühler, Christian Theobalt

    Abstract: Radiance fields are powerful and, hence, popular models for representing the appearance of complex scenes. Yet, constructing them based on image observations gives rise to ambiguities and uncertainties. We propose a versatile approach for learning Gaussian radiance fields with explicit and fine-grained uncertainty estimates that impose only little additional cost compared to uncertainty-agnostic t… ▽ More

    Submitted 19 September, 2024; originally announced September 2024.

    Comments: Siggraph Asia 2024 conference

  16. arXiv:2409.11951  [pdf, other

    cs.CV cs.GR

    GaussianHeads: End-to-End Learning of Drivable Gaussian Head Avatars from Coarse-to-fine Representations

    Authors: Kartik Teotia, Hyeongwoo Kim, Pablo Garrido, Marc Habermann, Mohamed Elgharib, Christian Theobalt

    Abstract: Real-time rendering of human head avatars is a cornerstone of many computer graphics applications, such as augmented reality, video games, and films, to name a few. Recent approaches address this challenge with computationally efficient geometry primitives in a carefully calibrated multi-view setup. Albeit producing photorealistic head renderings, it often fails to represent complex motion changes… ▽ More

    Submitted 18 September, 2024; originally announced September 2024.

    Comments: ACM Transaction on Graphics (SIGGRAPH Asia 2024); Project page: https://vcai.mpi-inf.mpg.de/projects/GaussianHeads/

  17. arXiv:2409.06231  [pdf, other

    cs.CV

    A Latent Implicit 3D Shape Model for Multiple Levels of Detail

    Authors: Benoit Guillard, Marc Habermann, Christian Theobalt, Pascal Fua

    Abstract: Implicit neural representations map a shape-specific latent code and a 3D coordinate to its corresponding signed distance (SDF) value. However, this approach only offers a single level of detail. Emulating low levels of detail can be achieved with shallow networks, but the generated shapes are typically not smooth. Alternatively, some network designs offer multiple levels of detail, but are limite… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

    Comments: Published in GCPR 2024 proceedings

  18. arXiv:2408.15995  [pdf, other

    cs.CV

    TEDRA: Text-based Editing of Dynamic and Photoreal Actors

    Authors: Basavaraj Sunagad, Heming Zhu, Mohit Mendiratta, Adam Kortylewski, Christian Theobalt, Marc Habermann

    Abstract: Over the past years, significant progress has been made in creating photorealistic and drivable 3D avatars solely from videos of real humans. However, a core remaining challenge is the fine-grained and user-friendly editing of clothing styles by means of textual descriptions. To this end, we present TEDRA, the first method allowing text-based edits of an avatar, which maintains the avatar's high f… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: For project page, see this https://vcai.mpi-inf.mpg.de/projects/Tedra

  19. arXiv:2403.18820  [pdf, other

    cs.CV

    MetaCap: Meta-learning Priors from Multi-View Imagery for Sparse-view Human Performance Capture and Rendering

    Authors: Guoxing Sun, Rishabh Dabral, Pascal Fua, Christian Theobalt, Marc Habermann

    Abstract: Faithful human performance capture and free-view rendering from sparse RGB observations is a long-standing problem in Vision and Graphics. The main challenges are the lack of observations and the inherent ambiguities of the setting, e.g. occlusions and depth ambiguity. As a result, radiance fields, which have shown great promise in capturing high-frequency appearance and geometry details in dense… ▽ More

    Submitted 24 July, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: Project page: https://vcai.mpi-inf.mpg.de/projects/MetaCap/

  20. arXiv:2403.17936  [pdf, other

    cs.CV

    ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis

    Authors: Muhammad Hamza Mughal, Rishabh Dabral, Ikhsanul Habibie, Lucia Donatelli, Marc Habermann, Christian Theobalt

    Abstract: Gestures play a key role in human communication. Recent methods for co-speech gesture generation, while managing to generate beat-aligned motions, struggle generating gestures that are semantically aligned with the utterance. Compared to beat gestures that align naturally to the audio signal, semantically coherent gestures require modeling the complex interactions between the language and human mo… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: CVPR 2024. Project Page: https://vcai.mpi-inf.mpg.de/projects/ConvoFusion/

  21. arXiv:2312.11587  [pdf, other

    cs.CV

    Relightable Neural Actor with Intrinsic Decomposition and Pose Control

    Authors: Diogo Luvizon, Vladislav Golyanik, Adam Kortylewski, Marc Habermann, Christian Theobalt

    Abstract: Creating a controllable and relightable digital avatar from multi-view video with fixed illumination is a very challenging problem since humans are highly articulated, creating pose-dependent appearance effects, and skin as well as clothing require space-varying BRDF modeling. Existing works on creating animatible avatars either do not focus on relighting at all, require controlled illumination se… ▽ More

    Submitted 26 July, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: Accepted to ECCV 2024. Project page: https://vcai.mpi-inf.mpg.de/projects/RNA/

  22. arXiv:2312.07423  [pdf, other

    cs.CV

    Holoported Characters: Real-time Free-viewpoint Rendering of Humans from Sparse RGB Cameras

    Authors: Ashwath Shetty, Marc Habermann, Guoxing Sun, Diogo Luvizon, Vladislav Golyanik, Christian Theobalt

    Abstract: We present the first approach to render highly realistic free-viewpoint videos of a human actor in general apparel, from sparse multi-view recording to display, in real-time at an unprecedented 4K resolution. At inference, our method only requires four camera views of the moving actor and the respective 3D skeletal pose. It handles actors in wide clothing, and reproduces even fine-scale dynamic de… ▽ More

    Submitted 25 July, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: Project page: https://vcai.mpi-inf.mpg.de/projects/holochar/ 8 pages, 2 tables and 8 figures; presented at Computer Vision and Pattern Recognition (CVPR) 2024

  23. arXiv:2312.05941  [pdf, other

    cs.CV

    ASH: Animatable Gaussian Splats for Efficient and Photoreal Human Rendering

    Authors: Haokai Pang, Heming Zhu, Adam Kortylewski, Christian Theobalt, Marc Habermann

    Abstract: Real-time rendering of photorealistic and controllable human avatars stands as a cornerstone in Computer Vision and Graphics. While recent advances in neural implicit rendering have unlocked unprecedented photorealism for digital avatars, real-time performance has mostly been demonstrated for static scenes only. To address this, we propose ASH, an animatable Gaussian splatting approach for photore… ▽ More

    Submitted 15 April, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

    Comments: For project page, see https://vcai.mpi-inf.mpg.de/projects/ash/

  24. arXiv:2312.05161  [pdf, other

    cs.CV

    TriHuman : A Real-time and Controllable Tri-plane Representation for Detailed Human Geometry and Appearance Synthesis

    Authors: Heming Zhu, Fangneng Zhan, Christian Theobalt, Marc Habermann

    Abstract: Creating controllable, photorealistic, and geometrically detailed digital doubles of real humans solely from video data is a key challenge in Computer Graphics and Vision, especially when real-time performance is required. Recent methods attach a neural radiance field (NeRF) to an articulated structure, e.g., a body model or a skeleton, to map points into a pose canonical space while conditioning… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  25. arXiv:2311.17050  [pdf, other

    cs.CV cs.GR

    Surf-D: Generating High-Quality Surfaces of Arbitrary Topologies Using Diffusion Models

    Authors: Zhengming Yu, Zhiyang Dou, Xiaoxiao Long, Cheng Lin, Zekun Li, Yuan Liu, Norman Müller, Taku Komura, Marc Habermann, Christian Theobalt, Xin Li, Wenping Wang

    Abstract: We present Surf-D, a novel method for generating high-quality 3D shapes as Surfaces with arbitrary topologies using Diffusion models. Previous methods explored shape generation with different representations and they suffer from limited topologies and poor geometry details. To generate high-quality surfaces of arbitrary topologies, we use the Unsigned Distance Field (UDF) as our surface representa… ▽ More

    Submitted 24 July, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: Accepted to ECCV 2024. Project Page: https://yzmblog.github.io/projects/SurfD/

  26. arXiv:2310.15008  [pdf, other

    cs.CV

    Wonder3D: Single Image to 3D using Cross-Domain Diffusion

    Authors: Xiaoxiao Long, Yuan-Chen Guo, Cheng Lin, Yuan Liu, Zhiyang Dou, Lingjie Liu, Yuexin Ma, Song-Hai Zhang, Marc Habermann, Christian Theobalt, Wenping Wang

    Abstract: In this work, we introduce Wonder3D, a novel method for efficiently generating high-fidelity textured meshes from single-view images.Recent methods based on Score Distillation Sampling (SDS) have shown the potential to recover 3D geometry from 2D diffusion priors, but they typically suffer from time-consuming per-shape optimization and inconsistent geometry. In contrast, certain works directly pro… ▽ More

    Submitted 8 November, 2023; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: Project page: https://www.xxlong.site/Wonder3D/

  27. arXiv:2310.11449  [pdf, other

    cs.CV cs.GR cs.LG

    DELIFFAS: Deformable Light Fields for Fast Avatar Synthesis

    Authors: Youngjoong Kwon, Lingjie Liu, Henry Fuchs, Marc Habermann, Christian Theobalt

    Abstract: Generating controllable and photorealistic digital human avatars is a long-standing and important problem in Vision and Graphics. Recent methods have shown great progress in terms of either photorealism or inference speed while the combination of the two desired properties still remains unsolved. To this end, we propose a novel method, called DELIFFAS, which parameterizes the appearance of the hum… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  28. Discovering Fatigued Movements for Virtual Character Animation

    Authors: Noshaba Cheema, Rui Xu, Nam Hee Kim, Perttu Hämäläinen, Vladislav Golyanik, Marc Habermann, Christian Theobalt, Philipp Slusallek

    Abstract: Virtual character animation and movement synthesis have advanced rapidly during recent years, especially through a combination of extensive motion capture datasets and machine learning. A remaining challenge is interactively simulating characters that fatigue when performing extended motions, which is indispensable for the realism of generated animations. However, capturing such movements is probl… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: 16 pages, 22 figures. To be published in ACM SIGGRAPH Asia Conference Papers 2023. ACM ISBN 979-8-4007-0315-7/23/12

    ACM Class: I.3.7

    Journal ref: ACM SIGGRAPH Asia Conference Papers 2023

  29. Diffusion Posterior Illumination for Ambiguity-aware Inverse Rendering

    Authors: Linjie Lyu, Ayush Tewari, Marc Habermann, Shunsuke Saito, Michael Zollhöfer, Thomas Leimkühler, Christian Theobalt

    Abstract: Inverse rendering, the process of inferring scene properties from images, is a challenging inverse problem. The task is ill-posed, as many different scene configurations can give rise to the same image. Most existing solutions incorporate priors into the inverse-rendering pipeline to encourage plausible solutions, but they do not consider the inherent ambiguities and the multi-modal distribution o… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

    Comments: SIGGRAPH Asia 2023

  30. arXiv:2308.12970  [pdf, other

    cs.GR cs.LG

    NeuralClothSim: Neural Deformation Fields Meet the Thin Shell Theory

    Authors: Navami Kairanda, Marc Habermann, Christian Theobalt, Vladislav Golyanik

    Abstract: Despite existing 3D cloth simulators producing realistic results, they predominantly operate on discrete surface representations (e.g. points and meshes) with a fixed spatial resolution, which often leads to large memory consumption and resolution-dependent simulations. Moreover, back-propagating gradients through the existing solvers is difficult, and they cannot be easily integrated into modern… ▽ More

    Submitted 7 November, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: 33 pages, 23 figures and 3 tables; project page: https://4dqv.mpi-inf.mpg.de/NeuralClothSim/

  31. arXiv:2308.12969  [pdf, other

    cs.CV

    ROAM: Robust and Object-Aware Motion Generation Using Neural Pose Descriptors

    Authors: Wanyue Zhang, Rishabh Dabral, Thomas Leimkühler, Vladislav Golyanik, Marc Habermann, Christian Theobalt

    Abstract: Existing automatic approaches for 3D virtual character motion synthesis supporting scene interactions do not generalise well to new objects outside training distributions, even when trained on extensive motion capture datasets with diverse objects and annotated interactions. This paper addresses this limitation and shows that robustness and generalisation to novel scene objects in 3D object-aware… ▽ More

    Submitted 15 February, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: 14 pages, 11 figures; project page: https://vcai.mpi-inf.mpg.de/projects/ROAM/

    Journal ref: International Conference on 3D Vision 2024

  32. arXiv:2307.00842  [pdf, other

    cs.CV

    VINECS: Video-based Neural Character Skinning

    Authors: Zhouyingcheng Liao, Vladislav Golyanik, Marc Habermann, Christian Theobalt

    Abstract: Rigging and skinning clothed human avatars is a challenging task and traditionally requires a lot of manual work and expertise. Recent methods addressing it either generalize across different characters or focus on capturing the dynamics of a single character observed under different pose configurations. However, the former methods typically predict solely static skinning weights, which perform po… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

  33. arXiv:2305.01599  [pdf, other

    cs.CV cs.GR

    EgoLocate: Real-time Motion Capture, Localization, and Mapping with Sparse Body-mounted Sensors

    Authors: Xinyu Yi, Yuxiao Zhou, Marc Habermann, Vladislav Golyanik, Shaohua Pan, Christian Theobalt, Feng Xu

    Abstract: Human and environment sensing are two important topics in Computer Vision and Graphics. Human motion is often captured by inertial sensors, while the environment is mostly reconstructed using cameras. We integrate the two techniques together in EgoLocate, a system that simultaneously performs human motion capture (mocap), localization, and mapping in real time from sparse body-mounted sensors, inc… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

    Comments: Accepted by SIGGRAPH 2023. Project page: https://xinyu-yi.github.io/EgoLocate/

  34. arXiv:2302.07672  [pdf, other

    cs.GR

    LiveHand: Real-time and Photorealistic Neural Hand Rendering

    Authors: Akshay Mundra, Mallikarjun B R, Jiayi Wang, Marc Habermann, Christian Theobalt, Mohamed Elgharib

    Abstract: The human hand is the main medium through which we interact with our surroundings, making its digitization an important problem. While there are several works modeling the geometry of hands, little attention has been paid to capturing photo-realistic appearance. Moreover, for applications in extended reality and gaming, real-time rendering is critical. We present the first neural-implicit approach… ▽ More

    Submitted 20 August, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: Project page: https://vcai.mpi-inf.mpg.de/projects/LiveHand/ | Accepted at ICCV '23 | 11 pages, 7 figures

  35. arXiv:2301.05175  [pdf, other

    cs.CV

    Scene-Aware 3D Multi-Human Motion Capture from a Single Camera

    Authors: Diogo Luvizon, Marc Habermann, Vladislav Golyanik, Adam Kortylewski, Christian Theobalt

    Abstract: In this work, we consider the problem of estimating the 3D position of multiple humans in a scene as well as their body shape and articulation from a single RGB video recorded with a static camera. In contrast to expensive marker-based or multi-view systems, our lightweight setup is ideal for private users as it enables an affordable 3D motion capture that is easy to install and does not require e… ▽ More

    Submitted 27 March, 2023; v1 submitted 12 January, 2023; originally announced January 2023.

    Comments: Accepted to Eurographics 2023. See also github: https://github.com/dluvizon/scene-aware-3d-multi-human project page: https://vcai.mpi-inf.mpg.de/projects/scene-aware-3d-multi-human/

  36. arXiv:2212.05231  [pdf, other

    cs.CV cs.GR

    NeuS2: Fast Learning of Neural Implicit Surfaces for Multi-view Reconstruction

    Authors: Yiming Wang, Qin Han, Marc Habermann, Kostas Daniilidis, Christian Theobalt, Lingjie Liu

    Abstract: Recent methods for neural surface representation and rendering, for example NeuS, have demonstrated the remarkably high-quality reconstruction of static scenes. However, the training of NeuS takes an extremely long time (8 hours), which makes it almost impossible to apply them to dynamic scenes with thousands of frames. We propose a fast neural surface reconstruction approach, called NeuS2, which… ▽ More

    Submitted 16 November, 2023; v1 submitted 10 December, 2022; originally announced December 2022.

    Comments: ICCV 2023

  37. arXiv:2210.15664  [pdf, other

    cs.CV cs.GR

    State of the Art in Dense Monocular Non-Rigid 3D Reconstruction

    Authors: Edith Tretschk, Navami Kairanda, Mallikarjun B R, Rishabh Dabral, Adam Kortylewski, Bernhard Egger, Marc Habermann, Pascal Fua, Christian Theobalt, Vladislav Golyanik

    Abstract: 3D reconstruction of deformable (or non-rigid) scenes from a set of monocular 2D image observations is a long-standing and actively researched area of computer vision and graphics. It is an ill-posed inverse problem, since -- without additional prior assumptions -- it permits infinitely many solutions leading to accurate projection to the input 2D images. Non-rigid reconstruction is a foundational… ▽ More

    Submitted 24 March, 2023; v1 submitted 27 October, 2022; originally announced October 2022.

    Comments: 36 pages, 18 figures, 3 tables; State-of-the-Art Report at EUROGRAPHICS 2023

    Journal ref: Computer Graphics Forum, 2023

  38. arXiv:2210.12003  [pdf, other

    cs.CV

    HDHumans: A Hybrid Approach for High-fidelity Digital Humans

    Authors: Marc Habermann, Lingjie Liu, Weipeng Xu, Gerard Pons-Moll, Michael Zollhoefer, Christian Theobalt

    Abstract: Photo-real digital human avatars are of enormous importance in graphics, as they enable immersive communication over the globe, improve gaming and entertainment experiences, and can be particularly beneficial for AR and VR settings. However, current avatar generation approaches either fall short in high-fidelity novel view synthesis, generalization to novel motions, reproduction of loose clothing,… ▽ More

    Submitted 14 July, 2023; v1 submitted 21 October, 2022; originally announced October 2022.

  39. arXiv:2210.05665  [pdf, other

    cs.CV cs.AI cs.GR

    HiFECap: Monocular High-Fidelity and Expressive Capture of Human Performances

    Authors: Yue Jiang, Marc Habermann, Vladislav Golyanik, Christian Theobalt

    Abstract: Monocular 3D human performance capture is indispensable for many applications in computer graphics and vision for enabling immersive experiences. However, detailed capture of humans requires tracking of multiple aspects, including the skeletal pose, the dynamic surface, which includes clothing, hand gestures as well as facial expressions. No existing monocular method allows joint tracking of all t… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: Got accepted by BMVC2022

  40. arXiv:2207.13607  [pdf, other

    cs.CV

    Neural Radiance Transfer Fields for Relightable Novel-view Synthesis with Global Illumination

    Authors: Linjie Lyu, Ayush Tewari, Thomas Leimkuehler, Marc Habermann, Christian Theobalt

    Abstract: Given a set of images of a scene, the re-rendering of this scene from novel views and lighting conditions is an important and challenging problem in Computer Vision and Graphics. On the one hand, most existing works in Computer Vision usually impose many assumptions regarding the image formation process, e.g. direct illumination and predefined materials, to make scene parameter estimation tractabl… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

  41. arXiv:2206.08368  [pdf, other

    cs.CV

    Unbiased 4D: Monocular 4D Reconstruction with a Neural Deformation Model

    Authors: Erik C. M. Johnson, Marc Habermann, Soshi Shimada, Vladislav Golyanik, Christian Theobalt

    Abstract: Capturing general deforming scenes from monocular RGB video is crucial for many computer graphics and vision applications. However, current approaches suffer from drawbacks such as struggling with large scene deformations, inaccurate shape completion or requiring 2D point tracks. In contrast, our method, Ub4D, handles large deformations, performs shape completion in occluded regions, and can opera… ▽ More

    Submitted 4 May, 2023; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: 26 pages, 17 figures, 8 tables

  42. arXiv:2205.12947  [pdf, other

    math.SG math.AG

    Homological Berglund-Hübsch-Henningson mirror symmetry for curve singularities

    Authors: Matthew Habermann

    Abstract: In this article, we establish homological Berglund--Hübsch mirror symmetry for curve singularities where the A--model incorporates equivariance, otherwise known as homological Berglund--Hübsch--Henningson mirror symmetry, including for certain deformations of categories. More precisely, we prove a conjecture of Futaki and Ueda in arXiv:1004.0078 which posits that the equivariance in the A-model ca… ▽ More

    Submitted 25 March, 2025; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: 48 pages, 12 figures. Comments welcome. V2 minor updates. V3 name change, expanded and clarified arguments and added section on deformations. To appear in Mathematical Proceedings of the Cambridge Philosophical Society

    MSC Class: 53D37; 16G50

  43. arXiv:2204.05090  [pdf, other

    physics.flu-dyn physics.app-ph

    Direct 3D observation and unraveling of electroconvection phenomena during concentration polarization at ion-exchange membranes

    Authors: Felix Stockmeier, Michael Schatz, Malte Habermann, John Linkhorst, Ali Mani, Matthias Wessling

    Abstract: A decade ago, two-dimensional microscopic flow visualization proved the theoretically predicted existence of electroconvection roles as well as their decisive role in destabilizing the concentration polarization layer at ion-selective fluid/membrane interfaces. Electroconvection induces chaotic flow vortices injecting volume having bulk concentration into the ion-depleted diffusion layer at the in… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

  44. arXiv:2203.08528  [pdf, other

    cs.GR

    Physical Inertial Poser (PIP): Physics-aware Real-time Human Motion Tracking from Sparse Inertial Sensors

    Authors: Xinyu Yi, Yuxiao Zhou, Marc Habermann, Soshi Shimada, Vladislav Golyanik, Christian Theobalt, Feng Xu

    Abstract: Motion capture from sparse inertial sensors has shown great potential compared to image-based approaches since occlusions do not lead to a reduced tracking quality and the recording space is not restricted to be within the viewing frustum of the camera. However, capturing the motion and global position only from a sparse set of inertial sensors is inherently ambiguous and challenging. In consequen… ▽ More

    Submitted 16 March, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

    Comments: Accepted by CVPR 2022 with 3 strong accepts. Project page: https://xinyu-yi.github.io/PIP/

  45. arXiv:2111.10563  [pdf, other

    cs.CV

    A Deeper Look into DeepCap

    Authors: Marc Habermann, Weipeng Xu, Michael Zollhoefer, Gerard Pons-Moll, Christian Theobalt

    Abstract: Human performance capture is a highly important computer vision problem with many applications in movie production and virtual/augmented reality. Many previous performance capture approaches either required expensive multi-view setups or did not recover dense space-time coherent geometry with frame-to-frame correspondences. We propose a novel deep learning approach for monocular dense human perfor… ▽ More

    Submitted 20 November, 2021; originally announced November 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2003.08325

  46. arXiv:2107.02407  [pdf, other

    cs.CV

    NRST: Non-rigid Surface Tracking from Monocular Video

    Authors: Marc Habermann, Weipeng Xu, Helge Rhodin, Michael Zollhoefer, Gerard Pons-Moll, Christian Theobalt

    Abstract: We propose an efficient method for non-rigid surface tracking from monocular RGB videos. Given a video and a template mesh, our algorithm sequentially registers the template non-rigidly to each frame. We formulate the per-frame registration as an optimization problem that includes a novel texture term specifically tailored towards tracking objects with uniform texture but fine-scale structure, suc… ▽ More

    Submitted 12 July, 2021; v1 submitted 6 July, 2021; originally announced July 2021.

  47. arXiv:2106.02019  [pdf, other

    cs.CV cs.GR cs.LG

    Neural Actor: Neural Free-view Synthesis of Human Actors with Pose Control

    Authors: Lingjie Liu, Marc Habermann, Viktor Rudnev, Kripasindhu Sarkar, Jiatao Gu, Christian Theobalt

    Abstract: We propose Neural Actor (NA), a new method for high-quality synthesis of humans from arbitrary viewpoints and under arbitrary controllable poses. Our method is built upon recent neural scene representation and rendering works which learn representations of geometry and appearance from only 2D images. While existing works demonstrated compelling rendering of static scenes and playback of dynamic sc… ▽ More

    Submitted 4 January, 2022; v1 submitted 3 June, 2021; originally announced June 2021.

  48. Real-time Deep Dynamic Characters

    Authors: Marc Habermann, Lingjie Liu, Weipeng Xu, Michael Zollhoefer, Gerard Pons-Moll, Christian Theobalt

    Abstract: We propose a deep videorealistic 3D human character model displaying highly realistic shape, motion, and dynamic appearance learned in a new weakly supervised way from multi-view imagery. In contrast to previous work, our controllable 3D character displays dynamics, e.g., the swing of the skirt, dependent on skeletal body motion in an efficient data-driven way, without requiring complex physics si… ▽ More

    Submitted 31 August, 2021; v1 submitted 4 May, 2021; originally announced May 2021.

    Journal ref: ACM Transactions on Graphics (SIGGRAPH 2021)

  49. arXiv:2104.00359  [pdf, other

    cs.CV

    Efficient and Differentiable Shadow Computation for Inverse Problems

    Authors: Linjie Lyu, Marc Habermann, Lingjie Liu, Mallikarjun B R, Ayush Tewari, Christian Theobalt

    Abstract: Differentiable rendering has received increasing interest for image-based inverse problems. It can benefit traditional optimization-based solutions to inverse problems, but also allows for self-supervision of learning-based approaches for which training data with ground truth annotation is hard to obtain. However, existing differentiable renderers either do not model visibility of the light source… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.

  50. arXiv:2101.12178  [pdf, other

    math.AG math.SG

    Homological mirror symmetry for nodal stacky curves

    Authors: Matthew Habermann

    Abstract: In this paper, we establish homological mirror symmetry where the A-model is a finite quotient of the Milnor fibre of an invertible curve singularity, proving a conjecture of Lekili and Ueda from arXiv:1806.04345 in this dimension. Our strategy is to view the B--model as a cycle of stacky projective lines and generalise the approach of Lekili and Polishchuk in arXiv:1705.06023 to allow the irreduc… ▽ More

    Submitted 8 November, 2023; v1 submitted 28 January, 2021; originally announced January 2021.

    Comments: 27 pages plus appendix, 9 figures. Comments welcome. V4 moved section on root stacks to appendix. To appear in Mathematical Research Letters

    MSC Class: 53D37