Skip to main content

Showing 1–15 of 15 results for author: Schönberger, J L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.20040  [pdf, other

    cs.CV cs.RO

    MP-SfM: Monocular Surface Priors for Robust Structure-from-Motion

    Authors: Zador Pataki, Paul-Edouard Sarlin, Johannes L. Schönberger, Marc Pollefeys

    Abstract: While Structure-from-Motion (SfM) has seen much progress over the years, state-of-the-art systems are prone to failure when facing extreme viewpoint changes in low-overlap, low-parallax or high-symmetry scenarios. Because capturing images that avoid these pitfalls is challenging, this severely limits the wider use of SfM, especially by non-expert users. We overcome these limitations by augmenting… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

    Comments: CVPR 2025

  2. arXiv:2409.19811  [pdf, other

    cs.CV

    Robust Incremental Structure-from-Motion with Hybrid Features

    Authors: Shaohui Liu, Yidan Gao, Tianyi Zhang, Rémi Pautrat, Johannes L. Schönberger, Viktor Larsson, Marc Pollefeys

    Abstract: Structure-from-Motion (SfM) has become a ubiquitous tool for camera calibration and scene reconstruction with many downstream applications in computer vision and beyond. While the state-of-the-art SfM pipelines have reached a high level of maturity in well-textured and well-configured scenes over the last decades, they still fall short of robustly solving the SfM problem in challenging scenarios.… ▽ More

    Submitted 29 September, 2024; originally announced September 2024.

    Comments: 40 pages, 16 figures, 9 tables. To appear in ECCV 2024

  3. arXiv:2407.20219  [pdf, other

    cs.CV

    Global Structure-from-Motion Revisited

    Authors: Linfei Pan, Dániel Baráth, Marc Pollefeys, Johannes L. Schönberger

    Abstract: Recovering 3D structure and camera motion from images has been a long-standing focus of computer vision research and is known as Structure-from-Motion (SfM). Solutions to this problem are categorized into incremental and global approaches. Until now, the most popular systems follow the incremental paradigm due to its superior accuracy and robustness, while global approaches are drastically more sc… ▽ More

    Submitted 22 September, 2024; v1 submitted 29 July, 2024; originally announced July 2024.

    Comments: accepted at ECCV2024

  4. arXiv:2311.18068  [pdf, other

    cs.CV

    ALSTER: A Local Spatio-Temporal Expert for Online 3D Semantic Reconstruction

    Authors: Silvan Weder, Francis Engelmann, Johannes L. Schönberger, Akihito Seki, Marc Pollefeys, Martin R. Oswald

    Abstract: We propose an online 3D semantic segmentation method that incrementally reconstructs a 3D semantic map from a stream of RGB-D frames. Unlike offline methods, ours is directly applicable to scenarios with real-time constraints, such as robotics or mixed reality. To overcome the inherent challenges of online methods, we make two main contributions. First, to effectively extract information from the… ▽ More

    Submitted 3 December, 2023; v1 submitted 29 November, 2023; originally announced November 2023.

  5. arXiv:2210.10770  [pdf, other

    cs.CV

    LaMAR: Benchmarking Localization and Mapping for Augmented Reality

    Authors: Paul-Edouard Sarlin, Mihai Dusmanu, Johannes L. Schönberger, Pablo Speciale, Lukas Gruber, Viktor Larsson, Ondrej Miksik, Marc Pollefeys

    Abstract: Localization and mapping is the foundational technology for augmented reality (AR) that enables sharing and persistence of digital content in the real world. While significant progress has been made, researchers are still mostly driven by unrealistic benchmarks not representative of real-world AR scenarios. These benchmarks are often based on small-scale datasets with low scene diversity, captured… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: Accepted at ECCV 2022, website at https://lamar.ethz.ch/

  6. arXiv:2109.04409  [pdf, other

    cs.CV

    Reconstructing and grounding narrated instructional videos in 3D

    Authors: Dimitri Zhukov, Ignacio Rocco, Ivan Laptev, Josef Sivic, Johannes L. Schönberger, Bugra Tekin, Marc Pollefeys

    Abstract: Narrated instructional videos often show and describe manipulations of similar objects, e.g., repairing a particular model of a car or laptop. In this work we aim to reconstruct such objects and to localize associated narrations in 3D. Contrary to the standard scenario of instance-level 3D reconstruction, where identical objects or scenes are present in all views, objects in different instructiona… ▽ More

    Submitted 10 September, 2021; v1 submitted 9 September, 2021; originally announced September 2021.

  7. arXiv:2012.01377  [pdf, other

    cs.CV

    Cross-Descriptor Visual Localization and Mapping

    Authors: Mihai Dusmanu, Ondrej Miksik, Johannes L. Schönberger, Marc Pollefeys

    Abstract: Visual localization and mapping is the key technology underlying the majority of mixed reality and robotics systems. Most state-of-the-art approaches rely on local features to establish correspondences between images. In this paper, we present three novel scenarios for localization and mapping which require the continuous update of feature representations and the ability to match across different… ▽ More

    Submitted 21 September, 2021; v1 submitted 2 December, 2020; originally announced December 2020.

    Comments: Accepted at ICCV 2021. 18 pages, 15 figures, 6 tables

  8. arXiv:2011.14791  [pdf, other

    cs.CV

    NeuralFusion: Online Depth Fusion in Latent Space

    Authors: Silvan Weder, Johannes L. Schönberger, Marc Pollefeys, Martin R. Oswald

    Abstract: We present a novel online depth map fusion approach that learns depth map aggregation in a latent feature space. While previous fusion methods use an explicit scene representation like signed distance functions (SDFs), we propose a learned feature representation for the fusion. The key idea is a separation between the scene representation used for the fusion and the output scene representation, vi… ▽ More

    Submitted 8 June, 2021; v1 submitted 30 November, 2020; originally announced November 2020.

  9. arXiv:2008.11239  [pdf, other

    cs.CV

    HoloLens 2 Research Mode as a Tool for Computer Vision Research

    Authors: Dorin Ungureanu, Federica Bogo, Silvano Galliani, Pooja Sama, Xin Duan, Casey Meekhof, Jan Stühmer, Thomas J. Cashman, Bugra Tekin, Johannes L. Schönberger, Pawel Olszta, Marc Pollefeys

    Abstract: Mixed reality headsets, such as the Microsoft HoloLens 2, are powerful sensing devices with integrated compute capabilities, which makes it an ideal platform for computer vision research. In this technical report, we present HoloLens 2 Research Mode, an API and a set of tools enabling access to the raw sensor streams. We provide an overview of the API and explain how it can be used to build mixed… ▽ More

    Submitted 25 August, 2020; originally announced August 2020.

  10. arXiv:2006.06634  [pdf, other

    cs.CV

    Privacy-Preserving Image Features via Adversarial Affine Subspace Embeddings

    Authors: Mihai Dusmanu, Johannes L. Schönberger, Sudipta N. Sinha, Marc Pollefeys

    Abstract: Many computer vision systems require users to upload image features to the cloud for processing and storage. These features can be exploited to recover sensitive information about the scene or subjects, e.g., by reconstructing the appearance of the original image. To address this privacy concern, we propose a new privacy-preserving feature representation. The core idea of our work is to drop const… ▽ More

    Submitted 30 March, 2021; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: Accepted at CVPR 2021. 16 pages, 10 figures, 4 tables

  11. arXiv:2003.08348  [pdf, other

    cs.CV

    Multi-View Optimization of Local Feature Geometry

    Authors: Mihai Dusmanu, Johannes L. Schönberger, Marc Pollefeys

    Abstract: In this work, we address the problem of refining the geometry of local image features from multiple views without known scene or camera geometry. Current approaches to local feature detection are inherently limited in their keypoint localization accuracy because they only operate on a single view. This limitation has a negative impact on downstream tasks such as Structure-from-Motion, where inaccu… ▽ More

    Submitted 22 July, 2020; v1 submitted 18 March, 2020; originally announced March 2020.

    Comments: Accepted at ECCV 2020. 28 pages, 11 figures, 6 tables

  12. arXiv:2001.04388  [pdf, other

    cs.CV

    RoutedFusion: Learning Real-time Depth Map Fusion

    Authors: Silvan Weder, Johannes L. Schönberger, Marc Pollefeys, Martin R. Oswald

    Abstract: The efficient fusion of depth maps is a key part of most state-of-the-art 3D reconstruction methods. Besides requiring high accuracy, these depth fusion methods need to be scalable and real-time capable. To this end, we present a novel real-time capable machine learning-based method for depth map fusion. Similar to the seminal depth map fusion approach by Curless and Levoy, we only update a local… ▽ More

    Submitted 3 April, 2020; v1 submitted 13 January, 2020; originally announced January 2020.

    Comments: 11 pages, 8 figures, accepted to CVPR 2020

  13. arXiv:1903.05572  [pdf, other

    cs.CV

    Privacy Preserving Image-Based Localization

    Authors: Pablo Speciale, Johannes L. Schönberger, Sing Bing Kang, Sudipta N. Sinha, Marc Pollefeys

    Abstract: Image-based localization is a core component of many augmented/mixed reality (AR/MR) and autonomous robotic systems. Current localization systems rely on the persistent storage of 3D point clouds of the scene to enable camera pose estimation, but such data reveals potentially sensitive scene information. This gives rise to significant privacy risks, especially as for many applications 3D mapping i… ▽ More

    Submitted 13 March, 2019; originally announced March 2019.

  14. arXiv:1712.05773  [pdf, other

    cs.CV

    Semantic Visual Localization

    Authors: Johannes L. Schönberger, Marc Pollefeys, Andreas Geiger, Torsten Sattler

    Abstract: Robust visual localization under a wide range of viewing conditions is a fundamental problem in computer vision. Handling the difficult cases of this problem is not only very challenging but also of high practical relevance, e.g., in the context of life-long localization for augmented reality or autonomous robots. In this paper, we propose a novel approach based on a joint 3D geometric and semanti… ▽ More

    Submitted 16 April, 2018; v1 submitted 15 December, 2017; originally announced December 2017.

  15. scikit-image: Image processing in Python

    Authors: Stefan van der Walt, Johannes L. Schönberger, Juan Nunez-Iglesias, François Boulogne, Joshua D. Warner, Neil Yager, Emmanuelle Gouillart, Tony Yu, the scikit-image contributors

    Abstract: scikit-image is an image processing library that implements algorithms and utilities for use in research, education and industry applications. It is released under the liberal "Modified BSD" open source license, provides a well-documented API in the Python programming language, and is developed by an active, international team of collaborators. In this paper we highlight the advantages of open sou… ▽ More

    Submitted 23 July, 2014; originally announced July 2014.

    Comments: Distributed under Creative Commons CC-BY 4.0. Published in PeerJ