Skip to main content

Showing 1–11 of 11 results for author: Sandström, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.10154  [pdf, other

    cs.CV cs.RO

    LoopSplat: Loop Closure by Registering 3D Gaussian Splats

    Authors: Liyuan Zhu, Yue Li, Erik Sandström, Shengyu Huang, Konrad Schindler, Iro Armeni

    Abstract: Simultaneous Localization and Mapping (SLAM) based on 3D Gaussian Splats (3DGS) has recently shown promise towards more accurate, dense 3D scene maps. However, existing 3DGS-based methods fail to address the global consistency of the scene via loop closure and/or global bundle adjustment. To this end, we propose LoopSplat, which takes RGB-D images as input and performs dense mapping with 3DGS subm… ▽ More

    Submitted 19 August, 2024; v1 submitted 19 August, 2024; originally announced August 2024.

    Comments: Project page: https://loopsplat.github.io/

  2. arXiv:2408.08766  [pdf, other

    cs.CV

    VF-NeRF: Learning Neural Vector Fields for Indoor Scene Reconstruction

    Authors: Albert Gassol Puigjaner, Edoardo Mello Rella, Erik Sandström, Ajad Chhatkuli, Luc Van Gool

    Abstract: Implicit surfaces via neural radiance fields (NeRF) have shown surprising accuracy in surface reconstruction. Despite their success in reconstructing richly textured surfaces, existing methods struggle with planar regions with weak textures, which account for the majority of indoor scenes. In this paper, we address indoor dense surface reconstruction by revisiting key aspects of NeRF in order to u… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

    Comments: 15 pages

  3. arXiv:2405.16544  [pdf, other

    cs.CV

    Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians

    Authors: Erik Sandström, Keisuke Tateno, Michael Oechsle, Michael Niemeyer, Luc Van Gool, Martin R. Oswald, Federico Tombari

    Abstract: 3D Gaussian Splatting has emerged as a powerful representation of geometry and appearance for RGB-only dense Simultaneous Localization and Mapping (SLAM), as it provides a compact dense map representation while enabling efficient and high-quality map rendering. However, existing methods show significantly worse reconstruction quality than competing methods using other 3D representations, e.g. neur… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 21 pages

  4. arXiv:2403.19549  [pdf, other

    cs.CV cs.RO

    GlORIE-SLAM: Globally Optimized RGB-only Implicit Encoding Point Cloud SLAM

    Authors: Ganlin Zhang, Erik Sandström, Youmin Zhang, Manthan Patel, Luc Van Gool, Martin R. Oswald

    Abstract: Recent advancements in RGB-only dense Simultaneous Localization and Mapping (SLAM) have predominantly utilized grid-based neural implicit encodings and/or struggle to efficiently realize global map and pose consistency. To this end, we propose an efficient RGB-only dense SLAM system using a flexible neural point cloud scene representation that adapts to keyframe poses and depth updates, without ne… ▽ More

    Submitted 27 May, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

  5. arXiv:2402.13255  [pdf, other

    cs.CV cs.RO

    How NeRFs and 3D Gaussian Splatting are Reshaping SLAM: a Survey

    Authors: Fabio Tosi, Youmin Zhang, Ziren Gong, Erik Sandström, Stefano Mattoccia, Martin R. Oswald, Matteo Poggi

    Abstract: Over the past two decades, research in the field of Simultaneous Localization and Mapping (SLAM) has undergone a significant evolution, highlighting its critical role in enabling autonomous exploration of unknown environments. This evolution ranges from hand-crafted methods, through the era of deep learning, to more recent developments focused on Neural Radiance Fields (NeRFs) and 3D Gaussian Spla… ▽ More

    Submitted 27 March, 2025; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: Updated to November 2024

  6. arXiv:2402.09944  [pdf, other

    cs.CV

    Loopy-SLAM: Dense Neural SLAM with Loop Closures

    Authors: Lorenzo Liso, Erik Sandström, Vladimir Yugay, Luc Van Gool, Martin R. Oswald

    Abstract: Neural RGBD SLAM techniques have shown promise in dense Simultaneous Localization And Mapping (SLAM), yet face challenges such as error accumulation during camera tracking resulting in distorted maps. In response, we introduce Loopy-SLAM that globally optimizes poses and the dense 3D model. We use frame-to-model tracking using a data-driven point-based submap generation method and trigger loop clo… ▽ More

    Submitted 10 June, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  7. arXiv:2306.11048  [pdf, other

    cs.CV

    UncLe-SLAM: Uncertainty Learning for Dense Neural SLAM

    Authors: Erik Sandström, Kevin Ta, Luc Van Gool, Martin R. Oswald

    Abstract: We present an uncertainty learning framework for dense neural simultaneous localization and mapping (SLAM). Estimating pixel-wise uncertainties for the depth input of dense SLAM methods allows re-weighing the tracking and mapping losses towards image regions that contain more suitable information that is more reliable for SLAM. To this end, we propose an online framework for sensor uncertainty est… ▽ More

    Submitted 6 September, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: ICCV 2023 Workshop. 20 pages, 9 figures

  8. arXiv:2304.04278  [pdf, other

    cs.CV

    Point-SLAM: Dense Neural Point Cloud-based SLAM

    Authors: Erik Sandström, Yue Li, Luc Van Gool, Martin R. Oswald

    Abstract: We propose a dense neural simultaneous localization and mapping (SLAM) approach for monocular RGBD input which anchors the features of a neural scene representation in a point cloud that is iteratively generated in an input-dependent data-driven manner. We demonstrate that both tracking and mapping can be performed with the same point-based neural scene representation by minimizing an RGBD-based r… ▽ More

    Submitted 12 September, 2023; v1 submitted 9 April, 2023; originally announced April 2023.

    Comments: ICCV 2023. 18 Pages, 12 Figures

  9. arXiv:2204.03353  [pdf, other

    cs.CV

    Learning Online Multi-Sensor Depth Fusion

    Authors: Erik Sandström, Martin R. Oswald, Suryansh Kumar, Silvan Weder, Fisher Yu, Cristian Sminchisescu, Luc Van Gool

    Abstract: Many hand-held or mixed reality devices are used with a single sensor for 3D reconstruction, although they often comprise multiple sensors. Multi-sensor depth fusion is able to substantially improve the robustness and accuracy of 3D reconstruction methods, but existing techniques are not robust enough to handle sensors which operate with diverse value ranges as well as noise and outlier statistics… ▽ More

    Submitted 21 September, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

    Comments: Accepted to ECCV 2022. 31 pages, 17 figures, 15 Tables

  10. arXiv:2108.10992  [pdf, other

    cs.CV

    OOWL500: Overcoming Dataset Collection Bias in the Wild

    Authors: Brandon Leung, Chih-Hui Ho, Amir Persekian, David Orozco, Yen Chang, Erik Sandstrom, Bo Liu, Nuno Vasconcelos

    Abstract: The hypothesis that image datasets gathered online "in the wild" can produce biased object recognizers, e.g. preferring professional photography or certain viewing angles, is studied. A new "in the lab" data collection infrastructure is proposed consisting of a drone which captures images as it circles around objects. Crucially, the control provided by this setup and the natural camera shake inher… ▽ More

    Submitted 24 August, 2021; originally announced August 2021.

  11. arXiv:2108.05246  [pdf, other

    cs.CV

    A Real-Time Online Learning Framework for Joint 3D Reconstruction and Semantic Segmentation of Indoor Scenes

    Authors: Davide Menini, Suryansh Kumar, Martin R. Oswald, Erik Sandstrom, Cristian Sminchisescu, Luc Van Gool

    Abstract: This paper presents a real-time online vision framework to jointly recover an indoor scene's 3D structure and semantic label. Given noisy depth maps, a camera trajectory, and 2D semantic labels at train time, the proposed deep neural network based approach learns to fuse the depth over frames with suitable semantic labels in the scene space. Our approach exploits the joint volumetric representatio… ▽ More

    Submitted 28 December, 2021; v1 submitted 11 August, 2021; originally announced August 2021.

    Comments: Accepted for publication at IEEE Robotics and Automation Letters (RA-L), 2022. Draft info: 9 pages, 5 figures, 4 tables