Skip to main content

Showing 1–50 of 151 results for author: Burgard, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.06862  [pdf, ps, other

    cs.RO cs.AI cs.CV cs.LG cs.SD eess.AS

    Multimodal Spatial Language Maps for Robot Navigation and Manipulation

    Authors: Chenguang Huang, Oier Mees, Andy Zeng, Wolfram Burgard

    Abstract: Grounding language to a navigating agent's observations can leverage pretrained multimodal foundation models to match perceptions to object or event descriptions. However, previous approaches remain disconnected from environment mapping, lack the spatial precision of geometric maps, or neglect additional modality information beyond vision. To address this, we propose multimodal spatial language ma… ▽ More

    Submitted 7 June, 2025; originally announced June 2025.

    Comments: accepted to International Journal of Robotics Research (IJRR). 24 pages, 18 figures. The paper contains texts from VLMaps(arXiv:2210.05714) and AVLMaps(arXiv:2303.07522). The project page is https://mslmaps.github.io/

  2. arXiv:2505.08627  [pdf, other

    cs.RO

    Augmented Reality for RObots (ARRO): Pointing Visuomotor Policies Towards Visual Robustness

    Authors: Reihaneh Mirjalili, Tobias Jülg, Florian Walter, Wolfram Burgard

    Abstract: Visuomotor policies trained on human expert demonstrations have recently shown strong performance across a wide range of robotic manipulation tasks. However, these policies remain highly sensitive to domain shifts stemming from background or robot embodiment changes, which limits their generalization capabilities. In this paper, we present ARRO, a novel calibration-free visual representation that… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

  3. arXiv:2503.10370  [pdf, other

    cs.RO cs.CV cs.LG

    LUMOS: Language-Conditioned Imitation Learning with World Models

    Authors: Iman Nematollahi, Branton DeMoss, Akshay L Chandra, Nick Hawes, Wolfram Burgard, Ingmar Posner

    Abstract: We introduce LUMOS, a language-conditioned multi-task imitation learning framework for robotics. LUMOS learns skills by practicing them over many long-horizon rollouts in the latent space of a learned world model and transfers these skills zero-shot to a real robot. By learning on-policy in the latent space of the learned world model, our algorithm mitigates policy-induced distribution shift which… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

    Comments: Accepted at the 2025 IEEE International Conference on Robotics and Automation (ICRA)

  4. arXiv:2503.08445  [pdf, other

    cs.RO

    LLM-Pack: Intuitive Grocery Handling for Logistics Applications

    Authors: Yannik Blei, Michael Krawez, Tobias Jülg, Pierre Krack, Florian Walter, Wolfram Burgard

    Abstract: Robotics and automation are increasingly influential in logistics but remain largely confined to traditional warehouses. In grocery retail, advancements such as cashier-less supermarkets exist, yet customers still manually pick and pack groceries. While there has been a substantial focus in robotics on the bin picking problem, the task of packing objects and groceries has remained largely untouche… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

    Comments: 6 Pages, 6 Figures

  5. arXiv:2503.05833  [pdf, other

    cs.RO cs.LG

    Refined Policy Distillation: From VLA Generalists to RL Experts

    Authors: Tobias Jülg, Wolfram Burgard, Florian Walter

    Abstract: Recent generalist Vision-Language-Action Models (VLAs) can perform a variety of tasks on real robots with remarkable generalization capabilities. However, reported success rates are often not on par with those of expert policies. Moreover, VLAs usually do not work out of the box and often must be fine-tuned as they are sensitive to setup changes. In this work, we present Refined Policy Distillatio… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

  6. arXiv:2503.03599  [pdf, other

    cs.CV cs.RO

    REGRACE: A Robust and Efficient Graph-based Re-localization Algorithm using Consistency Evaluation

    Authors: Débora N. P. Oliveira, Joshua Knights, Sebastián Barbas Laina, Simon Boche, Wolfram Burgard, Stefan Leutenegger

    Abstract: Loop closures are essential for correcting odometry drift and creating consistent maps, especially in the context of large-scale navigation. Current methods using dense point clouds for accurate place recognition do not scale well due to computationally expensive scan-to-scan comparisons. Alternative object-centric approaches are more efficient but often struggle with sensitivity to viewpoint vari… ▽ More

    Submitted 5 March, 2025; originally announced March 2025.

    Comments: Submitted to IROS2025

  7. arXiv:2503.02372  [pdf, other

    cs.CV cs.RO

    Label-Efficient LiDAR Panoptic Segmentation

    Authors: Ahmet Selim Çanakçı, Niclas Vödisch, Kürsat Petek, Wolfram Burgard, Abhinav Valada

    Abstract: A main bottleneck of learning-based robotic scene understanding methods is the heavy reliance on extensive annotated training data, which often limits their generalization ability. In LiDAR panoptic segmentation, this challenge becomes even more pronounced due to the need to simultaneously address both semantic and instance segmentation from complex, high-dimensional point cloud data. In this work… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

  8. arXiv:2502.19374  [pdf, other

    cs.RO

    LiDAR Registration with Visual Foundation Models

    Authors: Niclas Vödisch, Giovanni Cioffi, Marco Cannici, Wolfram Burgard, Davide Scaramuzza

    Abstract: LiDAR registration is a fundamental task in robotic mapping and localization. A critical component of aligning two point clouds is identifying robust point correspondences using point descriptors. This step becomes particularly challenging in scenarios involving domain shifts, seasonal changes, and variations in point cloud structures. These factors substantially impact both handcrafted and learni… ▽ More

    Submitted 26 February, 2025; originally announced February 2025.

  9. arXiv:2412.02449  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    BYE: Build Your Encoder with One Sequence of Exploration Data for Long-Term Dynamic Scene Understanding

    Authors: Chenguang Huang, Shengchao Yan, Wolfram Burgard

    Abstract: Dynamic scene understanding remains a persistent challenge in robotic applications. Early dynamic mapping methods focused on mitigating the negative influence of short-term dynamic objects on camera motion estimation by masking or tracking specific categories, which often fall short in adapting to long-term scene changes. Recent efforts address object association in long-term dynamic environments… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

  10. arXiv:2411.09524  [pdf, other

    cs.RO

    FlowNav: Combining Flow Matching and Depth Priors for Efficient Navigation

    Authors: Samiran Gode, Abhijeet Nayak, Débora N. P. Oliveira, Michael Krawez, Cordelia Schmid, Wolfram Burgard

    Abstract: Effective robot navigation in unseen environments is a challenging task that requires precise control actions at high frequencies. Recent advances have framed it as an image-goal-conditioned control problem, where the robot generates navigation actions using frontal RGB images. Current state-of-the-art methods in this area use diffusion policies to generate these control actions. Despite their pro… ▽ More

    Submitted 3 March, 2025; v1 submitted 14 November, 2024; originally announced November 2024.

    Comments: Submitted to IROS'25. Previous version accepted at CoRL 2024 workshop on Learning Effective Abstractions for Planning (LEAP) and workshop on Differentiable Optimization Everywhere: Simulation, Estimation, Learning, and Control

  11. arXiv:2410.03904  [pdf, other

    cs.SD cs.AI eess.AS

    Did You Hear That? Introducing AADG: A Framework for Generating Benchmark Data in Audio Anomaly Detection

    Authors: Ksheeraja Raghavan, Samiran Gode, Ankit Shah, Surabhi Raghavan, Wolfram Burgard, Bhiksha Raj, Rita Singh

    Abstract: We introduce a novel, general-purpose audio generation framework specifically designed for anomaly detection and localization. Unlike existing datasets that predominantly focus on industrial and machine-related sounds, our framework focuses a broader range of environments, particularly useful in real-world scenarios where only audio data are available, such as in video-derived or telephonic audio.… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

    Comments: 9 pages, under review

  12. arXiv:2409.16111  [pdf, other

    cs.RO cs.CV

    CloudTrack: Scalable UAV Tracking with Cloud Semantics

    Authors: Yannik Blei, Michael Krawez, Nisarga Nilavadi, Tanja Katharina Kaiser, Wolfram Burgard

    Abstract: Nowadays, unmanned aerial vehicles (UAVs) are commonly used in search and rescue scenarios to gather information in the search area. The automatic identification of the person searched for in aerial footage could increase the autonomy of such systems, reduce the search time, and thus increase the missed person's chances of survival. In this paper, we present a novel approach to perform semanticall… ▽ More

    Submitted 8 May, 2025; v1 submitted 24 September, 2024; originally announced September 2024.

    Comments: 7 pages, 3 figures

  13. arXiv:2409.14096  [pdf, other

    cs.RO

    VLM-Vac: Enhancing Smart Vacuums through VLM Knowledge Distillation and Language-Guided Experience Replay

    Authors: Reihaneh Mirjalili, Michael Krawez, Florian Walter, Wolfram Burgard

    Abstract: In this paper, we propose VLM-Vac, a novel framework designed to enhance the autonomy of smart robot vacuum cleaners. Our approach integrates the zero-shot object detection capabilities of a Vision-Language Model (VLM) with a Knowledge Distillation (KD) strategy. By leveraging the VLM, the robot can categorize objects into actionable classes -- either to avoid or to suck -- across diverse backgrou… ▽ More

    Submitted 21 September, 2024; originally announced September 2024.

  14. arXiv:2408.02297  [pdf, other

    cs.RO cs.CV

    Perception Matters: Enhancing Embodied AI with Uncertainty-Aware Semantic Segmentation

    Authors: Sai Prasanna, Daniel Honerkamp, Kshitij Sirohi, Tim Welschehold, Wolfram Burgard, Abhinav Valada

    Abstract: Embodied AI has made significant progress acting in unexplored environments. However, tasks such as object search have largely focused on efficient policy learning. In this work, we identify several gaps in current search methods: They largely focus on dated perception models, neglect temporal aggregation, and transfer from ground truth directly to noisy perception at test time, without accounting… ▽ More

    Submitted 14 January, 2025; v1 submitted 5 August, 2024; originally announced August 2024.

    Journal ref: Proceedings of the International Symposium on Robotics Research (ISRR), 2024

  15. arXiv:2407.13431  [pdf, other

    cs.LG cs.AI

    Improving Out-of-Distribution Generalization of Trajectory Prediction for Autonomous Driving via Polynomial Representations

    Authors: Yue Yao, Shengchao Yan, Daniel Goehring, Wolfram Burgard, Joerg Reichardt

    Abstract: Robustness against Out-of-Distribution (OoD) samples is a key performance indicator of a trajectory prediction model. However, the development and ranking of state-of-the-art (SotA) models are driven by their In-Distribution (ID) performance on individual competition datasets. We present an OoD testing protocol that homogenizes datasets and prediction tasks across two large-scale motion datasets.… ▽ More

    Submitted 25 January, 2025; v1 submitted 18 July, 2024; originally announced July 2024.

  16. A Good Foundation is Worth Many Labels: Label-Efficient Panoptic Segmentation

    Authors: Niclas Vödisch, Kürsat Petek, Markus Käppeler, Abhinav Valada, Wolfram Burgard

    Abstract: A key challenge for the widespread application of learning-based models for robotic perception is to significantly reduce the required amount of annotated training data while achieving accurate predictions. This is essential not only to decrease operating costs but also to speed up deployment time. In this work, we address this challenge for PAnoptic SegmenTation with fEw Labels (PASTEL) by exploi… ▽ More

    Submitted 3 December, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    Journal ref: IEEE Robotics and Automation Letters, vol. 10, no. 1, pp. 216-223, January 2025

  17. arXiv:2405.18852  [pdf, other

    cs.CV cs.AI cs.RO

    LetsMap: Unsupervised Representation Learning for Semantic BEV Mapping

    Authors: Nikhil Gosala, Kürsat Petek, B Ravi Kiran, Senthil Yogamani, Paulo Drews-Jr, Wolfram Burgard, Abhinav Valada

    Abstract: Semantic Bird's Eye View (BEV) maps offer a rich representation with strong occlusion reasoning for various decision making tasks in autonomous driving. However, most BEV mapping approaches employ a fully supervised learning paradigm that relies on large amounts of human-annotated BEV ground truth data. In this work, we address this limitation by proposing the first unsupervised representation lea… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 23 pages, 5 figures

  18. Automatic Target-Less Camera-LiDAR Calibration From Motion and Deep Point Correspondences

    Authors: Kürsat Petek, Niclas Vödisch, Johannes Meyer, Daniele Cattaneo, Abhinav Valada, Wolfram Burgard

    Abstract: Sensor setups of robotic platforms commonly include both camera and LiDAR as they provide complementary information. However, fusing these two modalities typically requires a highly accurate calibration between them. In this paper, we propose MDPCalib which is a novel method for camera-LiDAR calibration that requires neither human supervision nor any specific target objects. Instead, we utilize se… ▽ More

    Submitted 4 November, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

    Journal ref: IEEE Robotics and Automation Letters, vol. 9, no. 11, pp. 9978-9985, November 2024

  19. arXiv:2403.17846  [pdf, other

    cs.RO cs.AI cs.CL cs.CV cs.LG

    Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation

    Authors: Abdelrhman Werby, Chenguang Huang, Martin Büchner, Abhinav Valada, Wolfram Burgard

    Abstract: Recent open-vocabulary robot mapping methods enrich dense geometric maps with pre-trained visual-language features. While these maps allow for the prediction of point-wise saliency maps when queried for a certain language concept, large-scale environments and abstract queries beyond the object level still pose a considerable hurdle, ultimately limiting language-grounded robotic navigation. In this… ▽ More

    Submitted 3 June, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: Code and video are available at http://hovsg.github.io/

  20. arXiv:2403.14305  [pdf, other

    cs.RO

    Bayesian Optimization for Sample-Efficient Policy Improvement in Robotic Manipulation

    Authors: Adrian Röfer, Iman Nematollahi, Tim Welschehold, Wolfram Burgard, Abhinav Valada

    Abstract: Sample efficient learning of manipulation skills poses a major challenge in robotics. While recent approaches demonstrate impressive advances in the type of task that can be addressed and the sensing modalities that can be incorporated, they still require large amounts of training data. Especially with regard to learning actions on robots in the real world, this poses a major problem due to the hi… ▽ More

    Submitted 7 October, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: 8 pages, 5 figures, 2 tables, Accepted at the 2024 IEEE International Conference on Intelligent Robots and Systems (IROS)

  21. arXiv:2403.11914  [pdf, other

    cs.LG cs.RO

    Agent-Agnostic Centralized Training for Decentralized Multi-Agent Cooperative Driving

    Authors: Shengchao Yan, Lukas König, Wolfram Burgard

    Abstract: Active traffic management with autonomous vehicles offers the potential for reduced congestion and improved traffic flow. However, developing effective algorithms for real-world scenarios requires overcoming challenges related to infinite-horizon traffic flow and partial observability. To address these issues and further decentralize traffic management, we propose an asymmetric actor-critic model… ▽ More

    Submitted 3 September, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: accepted by IROS 2024

  22. BEVCar: Camera-Radar Fusion for BEV Map and Object Segmentation

    Authors: Jonas Schramm, Niclas Vödisch, Kürsat Petek, B Ravi Kiran, Senthil Yogamani, Wolfram Burgard, Abhinav Valada

    Abstract: Semantic scene segmentation from a bird's-eye-view (BEV) perspective plays a crucial role in facilitating planning and decision-making for mobile robots. Although recent vision-only methods have demonstrated notable advancements in performance, they often struggle under adverse illumination conditions such as rain or nighttime. While active sensors offer a solution to this challenge, the prohibiti… ▽ More

    Submitted 25 July, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted for the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2024

    Journal ref: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2024, pp. 1435-1442

  23. arXiv:2402.07691  [pdf, other

    cs.RO

    Evaluation of a Smart Mobile Robotic System for Industrial Plant Inspection and Supervision

    Authors: Georg K. J. Fischer, Max Bergau, D. Adriana Gómez-Rosal, Andreas Wachaja, Johannes Gräter, Matthias Odenweller, Uwe Piechottka, Fabian Hoeflinger, Nikhil Gosala, Niklas Wetzel, Daniel Büscher, Abhinav Valada, Wolfram Burgard

    Abstract: Automated and autonomous industrial inspection is a longstanding research field, driven by the necessity to enhance safety and efficiency within industrial settings. In addressing this need, we introduce an autonomously navigating robotic system designed for comprehensive plant inspection. This innovative system comprises a robotic platform equipped with a diverse array of sensors integrated to fa… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: Submitted for publication in IEEE Sensors Journal

  24. arXiv:2402.05840  [pdf, other

    cs.RO

    uPLAM: Robust Panoptic Localization and Mapping Leveraging Perception Uncertainties

    Authors: Kshitij Sirohi, Daniel Büscher, Wolfram Burgard

    Abstract: The availability of a robust map-based localization system is essential for the operation of many autonomously navigating vehicles. Since uncertainty is an inevitable part of perception, it is beneficial for the robustness of the robot to consider it in typical downstream tasks of navigation stacks. In particular localization and mapping methods, which in modern systems often employ convolutional… ▽ More

    Submitted 20 March, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

  25. arXiv:2312.08240  [pdf, other

    cs.RO cs.CV

    CenterGrasp: Object-Aware Implicit Representation Learning for Simultaneous Shape Reconstruction and 6-DoF Grasp Estimation

    Authors: Eugenio Chisari, Nick Heppert, Tim Welschehold, Wolfram Burgard, Abhinav Valada

    Abstract: Reliable object grasping is a crucial capability for autonomous robots. However, many existing grasping approaches focus on general clutter removal without explicitly modeling objects and thus only relying on the visible local geometry. We introduce CenterGrasp, a novel framework that combines object awareness and holistic grasping. CenterGrasp learns a general object prior by encoding shapes and… ▽ More

    Submitted 5 April, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: Accepted at RA-L. Video, code and models available at http://centergrasp.cs.uni-freiburg.de

  26. arXiv:2310.15059  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Robot Skill Generalization via Keypoint Integrated Soft Actor-Critic Gaussian Mixture Models

    Authors: Iman Nematollahi, Kirill Yankov, Wolfram Burgard, Tim Welschehold

    Abstract: A long-standing challenge for a robotic manipulation system operating in real-world scenarios is adapting and generalizing its acquired motor skills to unseen environments. We tackle this challenge employing hybrid skill models that integrate imitation and reinforcement paradigms, to explore how the learning and adaptation of a skill, along with its core grounding in the scene through a learned ke… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted at the International Symposium on Experimental Robotics (ISER) 2023. Videos at http://kis-gmm.cs.uni-freiburg.de/

  27. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, Ajinkya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (269 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 14 May, 2025; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  28. arXiv:2310.05600  [pdf, other

    cs.RO cs.CV

    Care3D: An Active 3D Object Detection Dataset of Real Robotic-Care Environments

    Authors: Michael G. Adam, Sebastian Eger, Martin Piccolrovazzi, Maged Iskandar, Joern Vogel, Alexander Dietrich, Seongjien Bien, Jon Skerlj, Abdeldjallil Naceri, Eckehard Steinbach, Alin Albu-Schaeffer, Sami Haddadin, Wolfram Burgard

    Abstract: As labor shortage increases in the health sector, the demand for assistive robotics grows. However, the needed test data to develop those robots is scarce, especially for the application of active 3D object detection, where no real data exists at all. This short paper counters this by introducing such an annotated dataset of real environments. The captured environments represent areas which are al… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  29. arXiv:2310.05239  [pdf, other

    cs.RO

    Lan-grasp: Using Large Language Models for Semantic Object Grasping

    Authors: Reihaneh Mirjalili, Michael Krawez, Simone Silenzi, Yannik Blei, Wolfram Burgard

    Abstract: In this paper, we propose Lan-grasp, a novel approach towards more appropriate semantic grasping. We use foundation models to provide the robot with a deeper understanding of the objects, the right place to grasp an object, or even the parts to avoid. This allows our robot to grasp and utilize objects in a more meaningful and safe manner. We leverage the combination of a Large Language Model, a Vi… ▽ More

    Submitted 11 December, 2024; v1 submitted 8 October, 2023; originally announced October 2023.

  30. Few-Shot Panoptic Segmentation With Foundation Models

    Authors: Markus Käppeler, Kürsat Petek, Niclas Vödisch, Wolfram Burgard, Abhinav Valada

    Abstract: Current state-of-the-art methods for panoptic segmentation require an immense amount of annotated training data that is both arduous and expensive to obtain posing a significant challenge for their widespread adoption. Concurrently, recent breakthroughs in visual representation learning have sparked a paradigm shift leading to the advent of large foundation models that can be trained with complete… ▽ More

    Submitted 1 March, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: Accepted for "IEEE International Conference on Robotics and Automation (ICRA) 2024"

    Journal ref: IEEE International Conference on Robotics and Automation, 2024, pp. 7718-7724

  31. Collaborative Dynamic 3D Scene Graphs for Automated Driving

    Authors: Elias Greve, Martin Büchner, Niclas Vödisch, Wolfram Burgard, Abhinav Valada

    Abstract: Maps have played an indispensable role in enabling safe and automated driving. Although there have been many advances on different fronts ranging from SLAM to semantics, building an actionable hierarchical semantic representation of urban dynamic scenes and processing information from multiple agents are still challenging problems. In this work, we present Collaborative URBan Scene Graphs (CURB-SG… ▽ More

    Submitted 4 March, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: Accepted for "IEEE International Conference on Robotics and Automation (ICRA) 2024"

    Journal ref: IEEE International Conference on Robotics and Automation, 2024, pp. 11118-11124

  32. arXiv:2308.05612  [pdf, other

    cs.RO cs.AI

    A Smart Robotic System for Industrial Plant Supervision

    Authors: D. Adriana Gómez-Rosal, Max Bergau, Georg K. J. Fischer, Andreas Wachaja, Johannes Gräter, Matthias Odenweller, Uwe Piechottka, Fabian Hoeflinger, Nikhil Gosala, Niklas Wetzel, Daniel Büscher, Abhinav Valada, Wolfram Burgard

    Abstract: In today's chemical plants, human field operators perform frequent integrity checks to guarantee high safety standards, and thus are possibly the first to encounter dangerous operating conditions. To alleviate their task, we present a system consisting of an autonomously navigating robot integrated with various sensors and intelligent data processing. It is able to detect methane leaks and estimat… ▽ More

    Submitted 1 September, 2023; v1 submitted 10 August, 2023; originally announced August 2023.

    Comments: Final submission for IEEE Sensors 2023

  33. arXiv:2307.00488  [pdf, other

    cs.RO

    POV-SLAM: Probabilistic Object-Aware Variational SLAM in Semi-Static Environments

    Authors: Jingxing Qian, Veronica Chatrath, James Servos, Aaron Mavrinac, Wolfram Burgard, Steven L. Waslander, Angela P. Schoellig

    Abstract: Simultaneous localization and mapping (SLAM) in slowly varying scenes is important for long-term robot task completion. Failing to detect scene changes may lead to inaccurate maps and, ultimately, lost robots. Classical SLAM algorithms assume static scenes, and recent works take dynamics into account, but require scene changes to be observed in consecutive frames. Semi-static scenes, wherein objec… ▽ More

    Submitted 2 July, 2023; originally announced July 2023.

    Comments: Published in Robotics: Science and Systems (RSS) 2023

  34. arXiv:2306.16316  [pdf, other

    cs.RO

    Learning Continuous Control with Geometric Regularity from Robot Intrinsic Symmetry

    Authors: Shengchao Yan, Baohe Zhang, Yuan Zhang, Joschka Boedecker, Wolfram Burgard

    Abstract: Geometric regularity, which leverages data symmetry, has been successfully incorporated into deep learning architectures such as CNNs, RNNs, GNNs, and Transformers. While this concept has been widely applied in robotics to address the curse of dimensionality when learning from high-dimensional data, the inherent reflectional and rotational symmetry of robot structures has not been adequately explo… ▽ More

    Submitted 18 March, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: accepted by ICRA 2024

  35. arXiv:2306.15410  [pdf, other

    cs.CV

    AutoGraph: Predicting Lane Graphs from Traffic Observations

    Authors: Jannik Zürn, Ingmar Posner, Wolfram Burgard

    Abstract: Lane graph estimation is a long-standing problem in the context of autonomous driving. Previous works aimed at solving this problem by relying on large-scale, hand-annotated lane graphs, introducing a data bottleneck for training models to solve this task. To overcome this limitation, we propose to use the motion patterns of traffic participants as lane graph annotations. In our AutoGraph approach… ▽ More

    Submitted 10 November, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: 8 pages, 6 figures

  36. arXiv:2306.11346  [pdf, ps, other

    cs.RO cs.CV

    End-to-end 2D-3D Registration between Image and LiDAR Point Cloud for Vehicle Localization

    Authors: Guangming Wang, Yu Zheng, Yuxuan Wu, Yanfeng Guo, Zhe Liu, Yixiang Zhu, Wolfram Burgard, Hesheng Wang

    Abstract: Robot localization using a built map is essential for a variety of tasks including accurate navigation and mobile manipulation. A popular approach to robot localization is based on image-to-point cloud registration, which combines illumination-invariant LiDAR-based mapping with economical image-based localization. However, the recent works for image-to-point cloud registration either divide the re… ▽ More

    Submitted 5 July, 2025; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: Accepted to T-RO. Source codes are released at https://github.com/IRMVLab/I2PNet

  37. Fast yet predictable braking manoeuvers for real-time robot control

    Authors: Mazin Hamad, Jesus Gutierrez-Moreno, Hugo T. M. Kussaba, Nico Mansfeld, Saeed Abdolshah, Abdalla Swikir, Wolfram Burgard, Sami Haddadin

    Abstract: This paper proposes a framework for generating fast, smooth and predictable braking manoeuvers for a controlled robot. The proposed framework integrates two approaches to obtain feasible modal limits for designing braking trajectories. The first approach is real-time capable but conservative considering the usage of the available feasible actuator control region, resulting in longer braking times.… ▽ More

    Submitted 10 June, 2023; originally announced June 2023.

    Comments: This work has been accepted to the 22nd IFAC World Congress

  38. arXiv:2305.04718  [pdf, other

    cs.RO cs.AI cs.CV

    The Treachery of Images: Bayesian Scene Keypoints for Deep Policy Learning in Robotic Manipulation

    Authors: Jan Ole von Hartz, Eugenio Chisari, Tim Welschehold, Wolfram Burgard, Joschka Boedecker, Abhinav Valada

    Abstract: In policy learning for robotic manipulation, sample efficiency is of paramount importance. Thus, learning and extracting more compact representations from camera observations is a promising avenue. However, current methods often assume full observability of the scene and struggle with scale invariance. In many tasks and settings, this assumption does not hold as objects in the scene are often occl… ▽ More

    Submitted 20 September, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Journal ref: IEEE Robotics and Automation Letters, vol. 8, no. 11, pp. 6931-6938, Nov. 2023

  39. arXiv:2304.07058  [pdf, other

    cs.RO

    FM-Loc: Using Foundation Models for Improved Vision-based Localization

    Authors: Reihaneh Mirjalili, Michael Krawez, Wolfram Burgard

    Abstract: Visual place recognition is essential for vision-based robot localization and SLAM. Despite the tremendous progress made in recent years, place recognition in changing environments remains challenging. A promising approach to cope with appearance variations is to leverage high-level semantic features like objects or place categories. In this paper, we propose FM-Loc which is a novel image-based lo… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

  40. arXiv:2303.11756  [pdf, other

    cs.RO cs.LG

    Improving Deep Dynamics Models for Autonomous Vehicles with Multimodal Latent Mapping of Surfaces

    Authors: Johan Vertens, Nicolai Dorka, Tim Welschehold, Michael Thompson, Wolfram Burgard

    Abstract: The safe deployment of autonomous vehicles relies on their ability to effectively react to environmental changes. This can require maneuvering on varying surfaces which is still a difficult problem, especially for slippery terrains. To address this issue we propose a new approach that learns a surface-aware dynamics model by conditioning it on a latent variable vector storing surface information a… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

  41. CoVIO: Online Continual Learning for Visual-Inertial Odometry

    Authors: Niclas Vödisch, Daniele Cattaneo, Wolfram Burgard, Abhinav Valada

    Abstract: Visual odometry is a fundamental task for many applications on mobile devices and robotic platforms. Since such applications are oftentimes not limited to predefined target domains and learning-based vision systems are known to generalize poorly to unseen environments, methods for continual adaptation during inference time are of significant interest. In this work, we introduce CoVIO for online co… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

    Journal ref: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

  42. CoDEPS: Online Continual Learning for Depth Estimation and Panoptic Segmentation

    Authors: Niclas Vödisch, Kürsat Petek, Wolfram Burgard, Abhinav Valada

    Abstract: Operating a robot in the open world requires a high level of robustness with respect to previously unseen environments. Optimally, the robot is able to adapt by itself to new conditions without human supervision, e.g., automatically adjusting its perception system to changing lighting conditions. In this work, we address the task of continual learning for deep learning-based monocular depth estima… ▽ More

    Submitted 31 May, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

    Comments: Accepted for "Robotics: Science and Systems (RSS) 2023"

    Journal ref: Robotics: Science and Systems, 2023

  43. arXiv:2303.10144  [pdf, other

    cs.LG stat.ML

    Dynamic Update-to-Data Ratio: Minimizing World Model Overfitting

    Authors: Nicolai Dorka, Tim Welschehold, Wolfram Burgard

    Abstract: Early stopping based on the validation set performance is a popular approach to find the right balance between under- and overfitting in the context of supervised learning. However, in reinforcement learning, even for supervised sub-problems such as world model learning, early stopping is not applicable as the dataset is continually evolving. As a solution, we propose a new general method that dyn… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

    Comments: ICLR 2023

  44. arXiv:2303.07522  [pdf, other

    cs.RO cs.AI cs.CL cs.CV cs.LG

    Audio Visual Language Maps for Robot Navigation

    Authors: Chenguang Huang, Oier Mees, Andy Zeng, Wolfram Burgard

    Abstract: While interacting in the world is a multi-sensory experience, many robots continue to predominantly rely on visual perception to map and navigate in their environments. In this work, we propose Audio-Visual-Language Maps (AVLMaps), a unified 3D spatial map representation for storing cross-modal information from audio, visual, and language cues. AVLMaps integrate the open-vocabulary capabilities of… ▽ More

    Submitted 27 March, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

    Comments: Project page: https://avlmaps.github.io/

  45. arXiv:2303.03037  [pdf, other

    cs.CV cs.AI

    EvCenterNet: Uncertainty Estimation for Object Detection using Evidential Learning

    Authors: Monish R. Nallapareddy, Kshitij Sirohi, Paulo L. J. Drews-Jr, Wolfram Burgard, Chih-Hong Cheng, Abhinav Valada

    Abstract: Uncertainty estimation is crucial in safety-critical settings such as automated driving as it provides valuable information for several downstream tasks including high-level decision making and path planning. In this work, we propose EvCenterNet, a novel uncertainty-aware 2D object detection framework using evidential learning to directly estimate both classification and regression uncertainties.… ▽ More

    Submitted 28 September, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

  46. arXiv:2302.06175  [pdf, other

    cs.CV cs.RO

    Learning and Aggregating Lane Graphs for Urban Automated Driving

    Authors: Martin Büchner, Jannik Zürn, Ion-George Todoran, Abhinav Valada, Wolfram Burgard

    Abstract: Lane graph estimation is an essential and highly challenging task in automated driving and HD map learning. Existing methods using either onboard or aerial imagery struggle with complex lane topologies, out-of-distribution scenarios, or significant occlusions in the image space. Moreover, merging overlapping lane graphs to obtain consistent large-scale graphs remains difficult. To overcome these c… ▽ More

    Submitted 17 March, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: 22 pages, 17 figures

  47. arXiv:2302.04233  [pdf, other

    cs.CV cs.AI cs.RO

    SkyEye: Self-Supervised Bird's-Eye-View Semantic Mapping Using Monocular Frontal View Images

    Authors: Nikhil Gosala, Kürsat Petek, Paulo L. J. Drews-Jr, Wolfram Burgard, Abhinav Valada

    Abstract: Bird's-Eye-View (BEV) semantic maps have become an essential component of automated driving pipelines due to the rich representation they provide for decision-making tasks. However, existing approaches for generating these maps still follow a fully supervised training paradigm and hence rely on large amounts of annotated BEV data. In this work, we address this limitation by proposing the first sel… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

    Comments: 14 pages, 7 figures

  48. arXiv:2210.05714  [pdf, other

    cs.RO cs.AI cs.CL cs.CV cs.LG

    Visual Language Maps for Robot Navigation

    Authors: Chenguang Huang, Oier Mees, Andy Zeng, Wolfram Burgard

    Abstract: Grounding language to the visual observations of a navigating agent can be performed using off-the-shelf visual-language models pretrained on Internet-scale data (e.g., image captions). While this is useful for matching images to natural language descriptions of object goals, it remains disjoint from the process of mapping the environment, so that it lacks the spatial precision of classic geometri… ▽ More

    Submitted 8 March, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: Accepted at the 2023 IEEE International Conference on Robotics and Automation (ICRA). Project page: https://vlmaps.github.io

  49. arXiv:2210.04472  [pdf, other

    cs.CV cs.RO

    Uncertainty-aware LiDAR Panoptic Segmentation

    Authors: Kshitij Sirohi, Sajad Marvi, Daniel Büscher, Wolfram Burgard

    Abstract: Modern autonomous systems often rely on LiDAR scanners, in particular for autonomous driving scenarios. In this context, reliable scene understanding is indispensable. Current learning-based methods typically try to achieve maximum performance for this task, while neglecting a proper estimation of the associated uncertainties. In this work, we introduce a novel approach for solving the task of unc… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

  50. arXiv:2210.01911  [pdf, other

    cs.RO cs.AI cs.CL cs.CV cs.LG

    Grounding Language with Visual Affordances over Unstructured Data

    Authors: Oier Mees, Jessica Borja-Diaz, Wolfram Burgard

    Abstract: Recent works have shown that Large Language Models (LLMs) can be applied to ground natural language to a wide variety of robot skills. However, in practice, learning multi-task, language-conditioned robotic skills typically requires large-scale data collection and frequent human intervention to reset the environment or help correcting the current policies. In this work, we propose a novel approach… ▽ More

    Submitted 8 March, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: Accepted at the 2023 IEEE International Conference on Robotics and Automation (ICRA). Project website: http://hulc2.cs.uni-freiburg.de