Skip to main content

Showing 1–10 of 10 results for author: Guzov, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2412.15664  [pdf, other

    cs.CV

    SCENIC: Scene-aware Semantic Navigation with Instruction-guided Control

    Authors: Xiaohan Zhang, Sebastian Starke, Vladimir Guzov, Zhensong Zhang, Eduardo Pérez Pellitero, Gerard Pons-Moll

    Abstract: Synthesizing natural human motion that adapts to complex environments while allowing creative control remains a fundamental challenge in motion synthesis. Existing models often fall short, either by assuming flat terrain or lacking the ability to control motion semantics through text. To address these limitations, we introduce SCENIC, a diffusion model designed to generate human motion that adapts… ▽ More

    Submitted 20 December, 2024; originally announced December 2024.

  2. arXiv:2410.17858  [pdf, other

    cs.CV cs.GR

    Blendify -- Python rendering framework for Blender

    Authors: Vladimir Guzov, Ilya A. Petrov, Gerard Pons-Moll

    Abstract: With the rapid growth of the volume of research fields like computer vision and computer graphics, researchers require effective and user-friendly rendering tools to visualize results. While advanced tools like Blender offer powerful capabilities, they also require a significant effort to master. This technical report introduces Blendify, a lightweight Python-based framework that seamlessly integr… ▽ More

    Submitted 23 October, 2024; originally announced October 2024.

    Comments: Project page: https://virtualhumans.mpi-inf.mpg.de/blendify/

  3. arXiv:2409.18127  [pdf, other

    cs.CV

    EgoLM: Multi-Modal Language Model of Egocentric Motions

    Authors: Fangzhou Hong, Vladimir Guzov, Hyo Jin Kim, Yuting Ye, Richard Newcombe, Ziwei Liu, Lingni Ma

    Abstract: As the prevalence of wearable devices, learning egocentric motions becomes essential to develop contextual AI. In this work, we present EgoLM, a versatile framework that tracks and understands egocentric motions from multi-modal inputs, e.g., egocentric videos and motion sensors. EgoLM exploits rich contexts for the disambiguation of egomotion tracking and understanding, which are ill-posed under… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

    Comments: Project Page: https://hongfz16.github.io/projects/EgoLM

  4. arXiv:2409.13426  [pdf, other

    cs.CV

    HMD^2: Environment-aware Motion Generation from Single Egocentric Head-Mounted Device

    Authors: Vladimir Guzov, Yifeng Jiang, Fangzhou Hong, Gerard Pons-Moll, Richard Newcombe, C. Karen Liu, Yuting Ye, Lingni Ma

    Abstract: This paper investigates the generation of realistic full-body human motion using a single head-mounted device with an outward-facing color camera and the ability to perform visual SLAM. To address the ambiguity of this setup, we present HMD^2, a novel system that balances motion reconstruction and generation. From a reconstruction standpoint, it aims to maximally utilize the camera streams to prod… ▽ More

    Submitted 2 March, 2025; v1 submitted 20 September, 2024; originally announced September 2024.

    Comments: International Conference on 3D Vision 2025 (3DV 2025)

  5. arXiv:2406.09905  [pdf, other

    cs.CV cs.GR

    Nymeria: A Massive Collection of Multimodal Egocentric Daily Motion in the Wild

    Authors: Lingni Ma, Yuting Ye, Fangzhou Hong, Vladimir Guzov, Yifeng Jiang, Rowan Postyeni, Luis Pesqueira, Alexander Gamino, Vijay Baiyya, Hyo Jin Kim, Kevin Bailey, David Soriano Fosas, C. Karen Liu, Ziwei Liu, Jakob Engel, Renzo De Nardi, Richard Newcombe

    Abstract: We introduce Nymeria - a large-scale, diverse, richly annotated human motion dataset collected in the wild with multiple multimodal egocentric devices. The dataset comes with a) full-body ground-truth motion; b) multiple multimodal egocentric data from Project Aria devices with videos, eye tracking, IMUs and etc; and c) a third-person perspective by an additional observer. All devices are precisel… ▽ More

    Submitted 19 September, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

  6. arXiv:2403.11237  [pdf, other

    cs.CV cs.RO

    FORCE: Physics-aware Human-object Interaction

    Authors: Xiaohan Zhang, Bharat Lal Bhatnagar, Sebastian Starke, Ilya Petrov, Vladimir Guzov, Helisa Dhamo, Eduardo Pérez-Pellitero, Gerard Pons-Moll

    Abstract: Interactions between human and objects are influenced not only by the object's pose and shape, but also by physical attributes such as object mass and surface friction. They introduce important motion nuances that are essential for diversity and realism. Despite advancements in recent human-object interaction methods, this aspect has been overlooked. Generating nuanced human motion presents two ch… ▽ More

    Submitted 20 December, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

    Comments: 24 pages, 9 figures

  7. arXiv:2205.02830  [pdf, other

    cs.CV

    Interaction Replica: Tracking Human-Object Interaction and Scene Changes From Human Motion

    Authors: Vladimir Guzov, Julian Chibane, Riccardo Marin, Yannan He, Yunus Saracoglu, Torsten Sattler, Gerard Pons-Moll

    Abstract: Our world is not static and humans naturally cause changes in their environments through interactions, e.g., opening doors or moving furniture. Modeling changes caused by humans is essential for building digital twins, e.g., in the context of shared physical-virtual spaces (metaverses) and robotics. In order for widespread adoption of such emerging applications, the sensor setup used to capture th… ▽ More

    Submitted 18 March, 2024; v1 submitted 5 May, 2022; originally announced May 2022.

    Comments: International Conference on 3D Vision 2024 (3DV'24)

  8. arXiv:2205.00541  [pdf, other

    cs.CV

    COUCH: Towards Controllable Human-Chair Interactions

    Authors: Xiaohan Zhang, Bharat Lal Bhatnagar, Vladimir Guzov, Sebastian Starke, Gerard Pons-Moll

    Abstract: Humans interact with an object in many different ways by making contact at different locations, creating a highly complex motion space that can be difficult to learn, particularly when synthesizing such human interactions in a controllable manner. Existing works on synthesizing human scene interaction focus on the high-level control of action but do not consider the fine-grained control of motion.… ▽ More

    Submitted 1 May, 2022; originally announced May 2022.

  9. arXiv:2204.10850  [pdf, other

    cs.CV

    Control-NeRF: Editable Feature Volumes for Scene Rendering and Manipulation

    Authors: Verica Lazova, Vladimir Guzov, Kyle Olszewski, Sergey Tulyakov, Gerard Pons-Moll

    Abstract: We present a novel method for performing flexible, 3D-aware image content manipulation while enabling high-quality novel view synthesis. While NeRF-based approaches are effective for novel view synthesis, such models memorize the radiance for every point in a scene within a neural network. Since these models are scene-specific and lack a 3D scene representation, classical editing such as shape man… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

  10. arXiv:2103.17265  [pdf, other

    cs.CV

    Human POSEitioning System (HPS): 3D Human Pose Estimation and Self-localization in Large Scenes from Body-Mounted Sensors

    Authors: Vladimir Guzov, Aymen Mir, Torsten Sattler, Gerard Pons-Moll

    Abstract: We introduce (HPS) Human POSEitioning System, a method to recover the full 3D pose of a human registered with a 3D scan of the surrounding environment using wearable sensors. Using IMUs attached at the body limbs and a head mounted camera looking outwards, HPS fuses camera based self-localization with IMU-based human body tracking. The former provides drift-free but noisy position and orientation… ▽ More

    Submitted 31 March, 2021; originally announced March 2021.

    Comments: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)