Skip to main content

Showing 1–26 of 26 results for author: Ranftl, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2303.12134  [pdf, other

    cs.CV cs.RO

    Monocular Visual-Inertial Depth Estimation

    Authors: Diana Wofk, René Ranftl, Matthias Müller, Vladlen Koltun

    Abstract: We present a visual-inertial depth estimation pipeline that integrates monocular depth estimation and visual-inertial odometry to produce dense depth estimates with metric scale. Our approach performs global scale and shift alignment against sparse metric depth, followed by learning-based dense alignment. We evaluate on the TartanAir and VOID datasets, observing up to 30% reduction in inverse RMSE… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: Accepted for publication at ICRA'23

  2. arXiv:2204.08399  [pdf, other

    cs.CV

    Unsupervised Contrastive Domain Adaptation for Semantic Segmentation

    Authors: Feihu Zhang, Vladlen Koltun, Philip Torr, René Ranftl, Stephan R. Richter

    Abstract: Semantic segmentation models struggle to generalize in the presence of domain shift. In this paper, we introduce contrastive learning for feature alignment in cross-domain adaptation. We assemble both in-domain contrastive pairs and cross-domain contrastive pairs to learn discriminative features that align across domains. Based on the resulting well-aligned feature representations we introduce a l… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

  3. arXiv:2201.03546  [pdf, other

    cs.CV cs.CL cs.LG

    Language-driven Semantic Segmentation

    Authors: Boyi Li, Kilian Q. Weinberger, Serge Belongie, Vladlen Koltun, René Ranftl

    Abstract: We present LSeg, a novel model for language-driven semantic image segmentation. LSeg uses a text encoder to compute embeddings of descriptive input labels (e.g., "grass" or "building") together with a transformer-based image encoder that computes dense per-pixel embeddings of the input image. The image encoder is trained with a contrastive objective to align pixel embeddings to the text embedding… ▽ More

    Submitted 2 April, 2022; v1 submitted 10 January, 2022; originally announced January 2022.

    Comments: ICLR 2022

  4. arXiv:2112.11340  [pdf, other

    cs.CV

    Transferable End-to-end Room Layout Estimation via Implicit Encoding

    Authors: Hao Zhao, Rene Ranftl, Yurong Chen, Hongbin Zha

    Abstract: We study the problem of estimating room layouts from a single panorama image. Most former works have two stages: feature extraction and parametric model fitting. Here we propose an end-to-end method that directly predicts parametric layouts from an input panorama image. It exploits an implicit encoding procedure that embeds parametric layouts into a latent space. Then learning a mapping from image… ▽ More

    Submitted 21 December, 2021; originally announced December 2021.

    Comments: Project: https://sites.google.com/view/transferrl/

  5. arXiv:2110.05113  [pdf, other

    cs.RO cs.LG eess.SY

    Learning High-Speed Flight in the Wild

    Authors: Antonio Loquercio, Elia Kaufmann, René Ranftl, Matthias Müller, Vladlen Koltun, Davide Scaramuzza

    Abstract: Quadrotors are agile. Unlike most other machines, they can traverse extremely complex environments at high speeds. To date, only expert human pilots have been able to fully exploit their capabilities. Autonomous operation with on-board sensing and computation has been limited to low speeds. State-of-the-art methods generally separate the navigation problem into subtasks: sensing, mapping, and plan… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

    Comments: 16 pages (+7 supplementary)

    Journal ref: Science Robotics 2021 Vol. 6, Issue 59, abg5810

  6. arXiv:2110.01154  [pdf, other

    cs.LG

    An Analysis of Super-Net Heuristics in Weight-Sharing NAS

    Authors: Kaicheng Yu, René Ranftl, Mathieu Salzmann

    Abstract: Weight sharing promises to make neural architecture search (NAS) tractable even on commodity hardware. Existing methods in this space rely on a diverse set of heuristics to design and train the shared-weight backbone network, a.k.a. the super-net. Since heuristics substantially vary across different methods and have not been carefully studied, it is unclear to which extent they impact super-net tr… ▽ More

    Submitted 3 October, 2021; originally announced October 2021.

    Comments: Accepted to T-PAMI

  7. arXiv:2104.05309  [pdf, other

    cs.LG cs.CV

    Landmark Regularization: Ranking Guided Super-Net Training in Neural Architecture Search

    Authors: Kaicheng Yu, Rene Ranftl, Mathieu Salzmann

    Abstract: Weight sharing has become a de facto standard in neural architecture search because it enables the search to be done on commodity hardware. However, recent works have empirically shown a ranking disorder between the performance of stand-alone architectures and that of the corresponding shared-weight networks. This violates the main assumption of weight-sharing NAS algorithms, thus limiting their e… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: Accepted to CVPR 2021

  8. arXiv:2103.13413  [pdf, other

    cs.CV

    Vision Transformers for Dense Prediction

    Authors: René Ranftl, Alexey Bochkovskiy, Vladlen Koltun

    Abstract: We introduce dense vision transformers, an architecture that leverages vision transformers in place of convolutional networks as a backbone for dense prediction tasks. We assemble tokens from various stages of the vision transformer into image-like representations at various resolutions and progressively combine them into full-resolution predictions using a convolutional decoder. The transformer b… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

    Comments: 15 pages

  9. arXiv:2006.05768  [pdf, other

    cs.RO

    Deep Drone Acrobatics

    Authors: Elia Kaufmann, Antonio Loquercio, René Ranftl, Matthias Müller, Vladlen Koltun, Davide Scaramuzza

    Abstract: Performing acrobatic maneuvers with quadrotors is extremely challenging. Acrobatic flight requires high thrust and extreme angular accelerations that push the platform to its physical limits. Professional drone pilots often measure their level of mastery by flying such maneuvers in competitions. In this paper, we propose to learn a sensorimotor policy that enables an autonomous quadrotor to fly ex… ▽ More

    Submitted 11 June, 2020; v1 submitted 10 June, 2020; originally announced June 2020.

    Comments: 8 pages + 2 pages references. Video: https://youtu.be/2N_wKXQ6MXA. Code: https://github.com/uzh-rpg/deep_drone_acrobatics

    Journal ref: Robotics, Science, and Systems (RSS), 2020

  10. arXiv:2005.08144  [pdf, other

    cs.CV cs.LG stat.ML

    High-dimensional Convolutional Networks for Geometric Pattern Recognition

    Authors: Christopher Choy, Junha Lee, Rene Ranftl, Jaesik Park, Vladlen Koltun

    Abstract: Many problems in science and engineering can be formulated in terms of geometric patterns in high-dimensional spaces. We present high-dimensional convolutional networks (ConvNets) for pattern recognition problems that arise in the context of geometric registration. We first study the effectiveness of convolutional networks in detecting linear subspaces in high-dimensional spaces with up to 32 dime… ▽ More

    Submitted 16 May, 2020; originally announced May 2020.

    Comments: Accepted for CVPR 2020 oral presentation

  11. arXiv:2003.04276  [pdf, other

    cs.LG cs.CV stat.ML

    How to Train Your Super-Net: An Analysis of Training Heuristics in Weight-Sharing NAS

    Authors: Kaicheng Yu, Rene Ranftl, Mathieu Salzmann

    Abstract: Weight sharing promises to make neural architecture search (NAS) tractable even on commodity hardware. Existing methods in this space rely on a diverse set of heuristics to design and train the shared-weight backbone network, a.k.a. the super-net. Since heuristics and hyperparameters substantially vary across different methods, a fair comparison between them can only be achieved by systematically… ▽ More

    Submitted 17 June, 2020; v1 submitted 9 March, 2020; originally announced March 2020.

    Comments: Updated with latest results on NASBench-101, now we achieve 0.48 sparse Kendall-Tau on this space

  12. Safe Robot Navigation via Multi-Modal Anomaly Detection

    Authors: Lorenz Wellhausen, René Ranftl, Marco Hutter

    Abstract: Navigation in natural outdoor environments requires a robust and reliable traversability classification method to handle the plethora of situations a robot can encounter. Binary classification algorithms perform well in their native domain but tend to provide overconfident predictions when presented with out-of-distribution samples, which can lead to catastrophic failure when navigating unknown en… ▽ More

    Submitted 22 January, 2020; originally announced January 2020.

  13. arXiv:1907.01341  [pdf, other

    cs.CV

    Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer

    Authors: René Ranftl, Katrin Lasinger, David Hafner, Konrad Schindler, Vladlen Koltun

    Abstract: The success of monocular depth estimation relies on large and diverse training sets. Due to the challenges associated with acquiring dense ground-truth depth across different environments at scale, a number of datasets with distinct characteristics and biases have emerged. We develop tools that enable mixing multiple datasets during training, even if their annotations are incompatible. In particul… ▽ More

    Submitted 25 August, 2020; v1 submitted 2 July, 2019; originally announced July 2019.

    Comments: To appear in TPAMI (accepted August 2020)

  14. arXiv:1906.07165  [pdf, other

    cs.CV

    High Speed and High Dynamic Range Video with an Event Camera

    Authors: Henri Rebecq, René Ranftl, Vladlen Koltun, Davide Scaramuzza

    Abstract: Event cameras are novel sensors that report brightness changes in the form of a stream of asynchronous "events" instead of intensity frames. They offer significant advantages with respect to conventional cameras: high temporal resolution, high dynamic range, and no motion blur. While the stream of events encodes in principle the complete visual signal, the reconstruction of an intensity image from… ▽ More

    Submitted 15 June, 2019; originally announced June 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1904.08298

  15. Deep Drone Racing: From Simulation to Reality with Domain Randomization

    Authors: Antonio Loquercio, Elia Kaufmann, René Ranftl, Alexey Dosovitskiy, Vladlen Koltun, Davide Scaramuzza

    Abstract: Dynamically changing environments, unreliable state estimation, and operation under severe resource constraints are fundamental challenges that limit the deployment of small autonomous drones. We address these challenges in the context of autonomous, vision-based drone racing in dynamic environments. A racing drone must traverse a track with possibly moving gates at high speed. We enable this func… ▽ More

    Submitted 25 November, 2019; v1 submitted 20 May, 2019; originally announced May 2019.

    Comments: Accepted as a Regular Paper to the IEEE Transactions on Robotics Journal. arXiv admin note: substantial text overlap with arXiv:1806.08548

    Journal ref: IEEE Transactions on Robotics 2019

  16. arXiv:1905.06144  [pdf, other

    cs.RO

    Feedback MPC for Torque-Controlled Legged Robots

    Authors: Ruben Grandia, Farbod Farshidian, René Ranftl, Marco Hutter

    Abstract: The computational power of mobile robots is currently insufficient to achieve torque level whole-body Model Predictive Control (MPC) at the update rates required for complex dynamic systems such as legged robots. This problem is commonly circumvented by using a fast tracking controller to compensate for model errors between updates. In this work, we show that the feedback policy from a Differentia… ▽ More

    Submitted 9 August, 2019; v1 submitted 15 May, 2019; originally announced May 2019.

    Comments: Paper accepted to IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2019)

  17. arXiv:1905.03678  [pdf, other

    cs.CV

    What Do Single-view 3D Reconstruction Networks Learn?

    Authors: Maxim Tatarchenko, Stephan R. Richter, René Ranftl, Zhuwen Li, Vladlen Koltun, Thomas Brox

    Abstract: Convolutional networks for single-view object reconstruction have shown impressive performance and have become a popular subject of research. All existing techniques are united by the idea of having an encoder-decoder network that performs non-trivial reasoning about the 3D structure of the output space. In this work, we set up two alternative approaches that perform image classification and retri… ▽ More

    Submitted 9 May, 2019; originally announced May 2019.

  18. arXiv:1904.08298  [pdf, other

    cs.CV

    Events-to-Video: Bringing Modern Computer Vision to Event Cameras

    Authors: Henri Rebecq, René Ranftl, Vladlen Koltun, Davide Scaramuzza

    Abstract: Event cameras are novel sensors that report brightness changes in the form of asynchronous "events" instead of intensity frames. They have significant advantages over conventional cameras: high temporal resolution, high dynamic range, and no motion blur. Since the output of event cameras is fundamentally different from conventional cameras, it is commonly accepted that they require the development… ▽ More

    Submitted 17 April, 2019; originally announced April 2019.

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, 2019

  19. arXiv:1810.06224  [pdf, other

    cs.RO

    Beauty and the Beast: Optimal Methods Meet Learning for Drone Racing

    Authors: Elia Kaufmann, Mathias Gehrig, Philipp Foehn, René Ranftl, Alexey Dosovitskiy, Vladlen Koltun, Davide Scaramuzza

    Abstract: Autonomous micro aerial vehicles still struggle with fast and agile maneuvers, dynamic environments, imperfect sensing, and state estimation drift. Autonomous drone racing brings these challenges to the fore. Human pilots can fly a previously unseen track after a handful of practice runs. In contrast, state-of-the-art autonomous navigation algorithms require either a precise metric map of the envi… ▽ More

    Submitted 1 March, 2019; v1 submitted 15 October, 2018; originally announced October 2018.

    Comments: 6 pages (+1 references)

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA), 2019

  20. Frequency-Aware Model Predictive Control

    Authors: Ruben Grandia, Farbod Farshidian, Alexey Dosovitskiy, René Ranftl, Marco Hutter

    Abstract: Transferring solutions found by trajectory optimization to robotic hardware remains a challenging task. When the optimization fully exploits the provided model to perform dynamic tasks, the presence of unmodeled dynamics renders the motion infeasible on the real system. Model errors can be a result of model simplifications, but also naturally arise when deploying the robot in unstructured and nond… ▽ More

    Submitted 8 February, 2019; v1 submitted 12 September, 2018; originally announced September 2018.

    Journal ref: IEEE Robotics and Automation Letters 2019

  21. arXiv:1806.08548  [pdf, other

    cs.RO

    Deep Drone Racing: Learning Agile Flight in Dynamic Environments

    Authors: Elia Kaufmann, Antonio Loquercio, Rene Ranftl, Alexey Dosovitskiy, Vladlen Koltun, Davide Scaramuzza

    Abstract: Autonomous agile flight brings up fundamental challenges in robotics, such as coping with unreliable state estimation, reacting optimally to dynamically changing environments, and coupling perception and action in real time under severe resource constraints. In this paper, we consider these challenges in the context of autonomous, vision-based drone racing in dynamic environments. Our approach com… ▽ More

    Submitted 9 October, 2018; v1 submitted 22 June, 2018; originally announced June 2018.

    Comments: Accepted for publication in the Conference on Robotic Learning (CoRL) 2018, Zurich. 10 pages (+3 supplementary)

    Journal ref: Conference on Robotic Learning (CoRL), 2018

  22. arXiv:1704.07325  [pdf, other

    cs.CV

    Accurate Optical Flow via Direct Cost Volume Processing

    Authors: Jia Xu, René Ranftl, Vladlen Koltun

    Abstract: We present an optical flow estimation approach that operates on the full four-dimensional cost volume. This direct approach shares the structural benefits of leading stereo matching pipelines, which are known to yield high accuracy. To this day, such approaches have been considered impractical due to the size of the cost volume. We show that the full four-dimensional cost volume can be constructed… ▽ More

    Submitted 24 April, 2017; originally announced April 2017.

    Comments: Published at the Conference on Computer Vision and Pattern Recognition (CVPR 2017)

  23. A higher-order MRF based variational model for multiplicative noise reduction

    Authors: Yunjin Chen, Wensen Feng, René Ranftl, Hong Qiao, Thomas Pock

    Abstract: The Fields of Experts (FoE) image prior model, a filter-based higher-order Markov Random Fields (MRF) model, has been shown to be effective for many image restoration problems. Motivated by the successes of FoE-based approaches, in this letter, we propose a novel variational model for multiplicative noise reduction based on the FoE image prior model. The resulted model corresponds to a non-convex… ▽ More

    Submitted 7 July, 2014; v1 submitted 21 April, 2014; originally announced April 2014.

    Comments: 5 pages, 5 figures, to appear in IEEE Signal Processing Letters

  24. arXiv:1401.4112  [pdf, other

    cs.CV

    A bi-level view of inpainting - based image compression

    Authors: Yunjin Chen, René Ranftl, Thomas Pock

    Abstract: Inpainting based image compression approaches, especially linear and non-linear diffusion models, are an active research topic for lossy image compression. The major challenge in these compression models is to find a small set of descriptive supporting points, which allow for an accurate reconstruction of the original image. It turns out in practice that this is a challenging problem even for the… ▽ More

    Submitted 9 May, 2014; v1 submitted 16 January, 2014; originally announced January 2014.

    Comments: 8 pages, 4 figures, best paper award of CVWW 2014, Computer Vision Winter Workshop, Křtiny, Czech Republic, 3-5th February 2014

  25. Revisiting loss-specific training of filter-based MRFs for image restoration

    Authors: Yunjin Chen, Thomas Pock, René Ranftl, Horst Bischof

    Abstract: It is now well known that Markov random fields (MRFs) are particularly effective for modeling image priors in low-level vision. Recent years have seen the emergence of two main approaches for learning the parameters in MRFs: (1) probabilistic learning using sampling-based algorithms and (2) loss-specific training based on MAP estimate. After investigating existing training approaches, it turns out… ▽ More

    Submitted 16 January, 2014; originally announced January 2014.

    Comments: 10 pages, 2 figures, appear at 35th German Conference, GCPR 2013, Saarbrücken, Germany, September 3-6, 2013. Proceedings

  26. Insights into analysis operator learning: From patch-based sparse models to higher-order MRFs

    Authors: Yunjin Chen, René Ranftl, Thomas Pock

    Abstract: This paper addresses a new learning algorithm for the recently introduced co-sparse analysis model. First, we give new insights into the co-sparse analysis model by establishing connections to filter-based MRF models, such as the Field of Experts (FoE) model of Roth and Black. For training, we introduce a technique called bi-level optimization to learn the analysis operators. Compared to existing… ▽ More

    Submitted 13 January, 2014; originally announced January 2014.

    Comments: 13 pages, 10 figures, accepted to IEEE Image Processing