Skip to main content

Showing 1–14 of 14 results for author: Mihajlovic, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2411.06390  [pdf, other

    cs.CV

    SplatFormer: Point Transformer for Robust 3D Gaussian Splatting

    Authors: Yutong Chen, Marko Mihajlovic, Xiyi Chen, Yiming Wang, Sergey Prokudin, Siyu Tang

    Abstract: 3D Gaussian Splatting (3DGS) has recently transformed photorealistic reconstruction, achieving high visual fidelity and real-time performance. However, rendering quality significantly deteriorates when test views deviate from the camera angles used during training, posing a major challenge for applications in immersive free-viewpoint rendering and navigation. In this work, we conduct a comprehensi… ▽ More

    Submitted 10 March, 2025; v1 submitted 10 November, 2024; originally announced November 2024.

    Comments: ICLR 2025

  2. arXiv:2410.05050  [pdf, other

    cs.LG cs.AI stat.ML

    FreSh: Frequency Shifting for Accelerated Neural Representation Learning

    Authors: Adam Kania, Marko Mihajlovic, Sergey Prokudin, Jacek Tabor, Przemysław Spurek

    Abstract: Implicit Neural Representations (INRs) have recently gained attention as a powerful approach for continuously representing signals such as images, videos, and 3D shapes using multilayer perceptrons (MLPs). However, MLPs are known to exhibit a low-frequency bias, limiting their ability to capture high-frequency details accurately. This limitation is typically addressed by incorporating high-frequen… ▽ More

    Submitted 8 October, 2024; v1 submitted 7 October, 2024; originally announced October 2024.

    Comments: Code at https://github.com/gmum/FreSh/

  3. arXiv:2409.20140  [pdf, other

    cs.CV cs.GR

    RISE-SDF: a Relightable Information-Shared Signed Distance Field for Glossy Object Inverse Rendering

    Authors: Deheng Zhang, Jingyu Wang, Shaofei Wang, Marko Mihajlovic, Sergey Prokudin, Hendrik P. A. Lensch, Siyu Tang

    Abstract: In this paper, we propose a novel end-to-end relightable neural inverse rendering system that achieves high-quality reconstruction of geometry and material properties, thus enabling high-quality relighting. The cornerstone of our method is a two-stage approach for learning a better factorization of scene parameters. In the first stage, we develop a reflection-aware radiance field using a neural si… ▽ More

    Submitted 10 October, 2024; v1 submitted 30 September, 2024; originally announced September 2024.

    Comments: https://dehezhang2.github.io/RISE-SDF/

  4. arXiv:2409.11211  [pdf, other

    cs.CV

    SplatFields: Neural Gaussian Splats for Sparse 3D and 4D Reconstruction

    Authors: Marko Mihajlovic, Sergey Prokudin, Siyu Tang, Robert Maier, Federica Bogo, Tony Tung, Edmond Boyer

    Abstract: Digitizing 3D static scenes and 4D dynamic events from multi-view images has long been a challenge in computer vision and graphics. Recently, 3D Gaussian Splatting (3DGS) has emerged as a practical and scalable reconstruction method, gaining popularity due to its impressive reconstruction quality, real-time rendering capabilities, and compatibility with widely used visualization tools. However, th… ▽ More

    Submitted 17 September, 2024; originally announced September 2024.

    Comments: ECCV 2024 paper. The project page and code are available at https://markomih.github.io/SplatFields/

  5. arXiv:2406.03625  [pdf, other

    cs.CV cs.AI

    Degrees of Freedom Matter: Inferring Dynamics from Point Trajectories

    Authors: Yan Zhang, Sergey Prokudin, Marko Mihajlovic, Qianli Ma, Siyu Tang

    Abstract: Understanding the dynamics of generic 3D scenes is fundamentally challenging in computer vision, essential in enhancing applications related to scene reconstruction, motion tracking, and avatar creation. In this work, we address the task as the problem of inferring dense, long-range motion of 3D points. By observing a set of point trajectories, we aim to learn an implicit motion field parameterize… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: cvpr24 post camera ready

  6. arXiv:2401.04728  [pdf, other

    cs.CV cs.AI

    Morphable Diffusion: 3D-Consistent Diffusion for Single-image Avatar Creation

    Authors: Xiyi Chen, Marko Mihajlovic, Shaofei Wang, Sergey Prokudin, Siyu Tang

    Abstract: Recent advances in generative diffusion models have enabled the previously unfeasible capability of generating 3D assets from a single input image or a text prompt. In this work, we aim to enhance the quality and functionality of these models for the task of creating controllable, photorealistic human avatars. We achieve this by integrating a 3D morphable model into the state-of-the-art multi-view… ▽ More

    Submitted 2 April, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

    Comments: [CVPR 2024] Project page: https://xiyichen.github.io/morphablediffusion/

  7. arXiv:2312.09228  [pdf, other

    cs.CV

    3DGS-Avatar: Animatable Avatars via Deformable 3D Gaussian Splatting

    Authors: Zhiyin Qian, Shaofei Wang, Marko Mihajlovic, Andreas Geiger, Siyu Tang

    Abstract: We introduce an approach that creates animatable human avatars from monocular videos using 3D Gaussian Splatting (3DGS). Existing methods based on neural radiance fields (NeRFs) achieve high-quality novel-view/novel-pose image synthesis but often require days of training, and are extremely slow at inference time. Recently, the community has explored fast grid structures for efficient training of c… ▽ More

    Submitted 4 April, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: Project page: https://neuralbodies.github.io/3DGS-Avatar

  8. arXiv:2309.03160  [pdf, other

    cs.CV

    ResFields: Residual Neural Fields for Spatiotemporal Signals

    Authors: Marko Mihajlovic, Sergey Prokudin, Marc Pollefeys, Siyu Tang

    Abstract: Neural fields, a category of neural networks trained to represent high-frequency signals, have gained significant attention in recent years due to their impressive performance in modeling complex 3D data, such as signed distance (SDFs) or radiance fields (NeRFs), via a single multi-layer perceptron (MLP). However, despite the power and simplicity of representing signals with an MLP, these methods… ▽ More

    Submitted 11 February, 2024; v1 submitted 6 September, 2023; originally announced September 2023.

    Comments: [ICLR 2024 Spotlight] Project and code at: https://markomih.github.io/ResFields/

  9. arXiv:2205.04992  [pdf, other

    cs.CV

    KeypointNeRF: Generalizing Image-based Volumetric Avatars using Relative Spatial Encoding of Keypoints

    Authors: Marko Mihajlovic, Aayush Bansal, Michael Zollhoefer, Siyu Tang, Shunsuke Saito

    Abstract: Image-based volumetric humans using pixel-aligned features promise generalization to unseen poses and identities. Prior work leverages global spatial encodings and multi-view geometric consistency to reduce spatial ambiguity. However, global encodings often suffer from overfitting to the distribution of the training data, and it is difficult to learn multi-view consistent reconstruction from spars… ▽ More

    Submitted 21 July, 2022; v1 submitted 10 May, 2022; originally announced May 2022.

    Comments: To appear at ECCV 2022. The project page is available at https://markomih.github.io/KeypointNeRF

  10. arXiv:2204.06184  [pdf, other

    cs.CV

    COAP: Compositional Articulated Occupancy of People

    Authors: Marko Mihajlovic, Shunsuke Saito, Aayush Bansal, Michael Zollhoefer, Siyu Tang

    Abstract: We present a novel neural implicit representation for articulated human bodies. Compared to explicit template meshes, neural implicit body representations provide an efficient mechanism for modeling interactions with the environment, which is essential for human motion reconstruction and synthesis in 3D scenes. However, existing neural implicit bodies suffer from either poor generalization on high… ▽ More

    Submitted 13 April, 2022; originally announced April 2022.

    Comments: To appear at CVPR 2022. The project page is available at https://neuralbodies.github.io/COAP/index.html

  11. arXiv:2106.11944  [pdf, other

    cs.CV

    MetaAvatar: Learning Animatable Clothed Human Models from Few Depth Images

    Authors: Shaofei Wang, Marko Mihajlovic, Qianli Ma, Andreas Geiger, Siyu Tang

    Abstract: In this paper, we aim to create generalizable and controllable neural signed distance fields (SDFs) that represent clothed humans from monocular depth observations. Recent advances in deep learning, especially neural implicit representations, have enabled human shape reconstruction and controllable avatar generation from different sensor inputs. However, to generate realistic cloth deformations fr… ▽ More

    Submitted 20 January, 2022; v1 submitted 22 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021 final camera-ready revision. Project page: https://neuralbodies.github.io/metavatar/

  12. arXiv:2104.06849  [pdf, other

    cs.CV cs.AI

    LEAP: Learning Articulated Occupancy of People

    Authors: Marko Mihajlovic, Yan Zhang, Michael J. Black, Siyu Tang

    Abstract: Substantial progress has been made on modeling rigid 3D objects using deep implicit representations. Yet, extending these methods to learn neural models of human shape is still in its infancy. Human bodies are complex and the key challenge is to learn a representation that generalizes such that it can express body shape deformations for unseen subjects in unseen, highly-articulated, poses. To addr… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

    Comments: In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021

  13. arXiv:2012.14240  [pdf, other

    cs.CV

    DeepSurfels: Learning Online Appearance Fusion

    Authors: Marko Mihajlovic, Silvan Weder, Marc Pollefeys, Martin R. Oswald

    Abstract: We present DeepSurfels, a novel hybrid scene representation for geometry and appearance information. DeepSurfels combines explicit and neural building blocks to jointly encode geometry and appearance information. In contrast to established representations, DeepSurfels better represents high-frequency textures, is well-suited for online updates of appearance information, and can be easily combined… ▽ More

    Submitted 30 May, 2021; v1 submitted 28 December, 2020; originally announced December 2020.

    Comments: In Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2021

  14. arXiv:1911.00262  [pdf, other

    cs.LG cs.CL cs.IR stat.ML

    Finding the most similar textual documents using Case-Based Reasoning

    Authors: Marko Mihajlovic, Ning Xiong

    Abstract: In recent years, huge amounts of unstructured textual data on the Internet are a big difficulty for AI algorithms to provide the best recommendations for users and their search queries. Since the Internet became widespread, a lot of research has been done in the field of Natural Language Processing (NLP) and machine learning. Almost every solution transforms documents into Vector Space Models (VSM… ▽ More

    Submitted 1 November, 2019; originally announced November 2019.