Skip to main content

Showing 1–19 of 19 results for author: Holler, D

.
  1. arXiv:2408.06876  [pdf, other

    cs.AI cs.RO

    Decision-Focused Learning to Predict Action Costs for Planning

    Authors: Jayanta Mandi, Marco Foschini, Daniel Holler, Sylvie Thiebaux, Jorg Hoffmann, Tias Guns

    Abstract: In many automated planning applications, action costs can be hard to specify. An example is the time needed to travel through a certain road segment, which depends on many factors, such as the current weather conditions. A natural way to address this issue is to learn to predict these parameters based on input features (e.g., weather forecasts) and use the predicted action costs in automated plann… ▽ More

    Submitted 26 August, 2024; v1 submitted 13 August, 2024; originally announced August 2024.

  2. arXiv:2311.10484  [pdf, other

    cs.RO

    Learning Agile Locomotion on Risky Terrains

    Authors: Chong Zhang, Nikita Rudin, David Hoeller, Marco Hutter

    Abstract: Quadruped robots have shown remarkable mobility on various terrains through reinforcement learning. Yet, in the presence of sparse footholds and risky terrains such as stepping stones and balance beams, which require precise foot placement to avoid falls, model-based approaches are often used. In this paper, we show that end-to-end reinforcement learning can also enable the robot to traverse risky… ▽ More

    Submitted 8 August, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

    Comments: 8 pages, 11 figures. IROS 2024

  3. arXiv:2306.14874  [pdf, other

    cs.RO

    ANYmal Parkour: Learning Agile Navigation for Quadrupedal Robots

    Authors: David Hoeller, Nikita Rudin, Dhionis Sako, Marco Hutter

    Abstract: Performing agile navigation with four-legged robots is a challenging task due to the highly dynamic motions, contacts with various parts of the robot, and the limited field of view of the perception sensors. In this paper, we propose a fully-learned approach to train such robots and conquer scenarios that are reminiscent of parkour challenges. The method involves training advanced locomotion skill… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  4. Orbit: A Unified Simulation Framework for Interactive Robot Learning Environments

    Authors: Mayank Mittal, Calvin Yu, Qinxi Yu, Jingzhou Liu, Nikita Rudin, David Hoeller, Jia Lin Yuan, Ritvik Singh, Yunrong Guo, Hammad Mazhar, Ajay Mandlekar, Buck Babich, Gavriel State, Marco Hutter, Animesh Garg

    Abstract: We present Orbit, a unified and modular framework for robot learning powered by NVIDIA Isaac Sim. It offers a modular design to easily and efficiently create robotic environments with photo-realistic scenes and high-fidelity rigid and deformable body simulation. With Orbit, we provide a suite of benchmark tasks of varying difficulty -- from single-stage cabinet opening and cloth folding to multi-s… ▽ More

    Submitted 16 February, 2024; v1 submitted 10 January, 2023; originally announced January 2023.

    Comments: Project website: https://isaac-orbit.github.io/

    Journal ref: IEEE Robotics and Automation Letters (Volume: 8, Issue: 6, June 2023)

  5. arXiv:2209.12827  [pdf, other

    cs.RO cs.LG

    Advanced Skills by Learning Locomotion and Local Navigation End-to-End

    Authors: Nikita Rudin, David Hoeller, Marko Bjelonic, Marco Hutter

    Abstract: The common approach for local navigation on challenging environments with legged robots requires path planning, path following and locomotion, which usually requires a locomotion control policy that accurately tracks a commanded velocity. However, by breaking down the navigation problem into these sub-tasks, we limit the robot's capabilities since the individual tasks do not consider the full solu… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

    Comments: IROS 2022, Project website: https://sites.google.com/ leggedrobotics.com/end-to-end-loco-navigation

  6. arXiv:2206.08077  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Neural Scene Representation for Locomotion on Structured Terrain

    Authors: David Hoeller, Nikita Rudin, Christopher Choy, Animashree Anandkumar, Marco Hutter

    Abstract: We propose a learning-based method to reconstruct the local terrain for locomotion with a mobile robot traversing urban environments. Using a stream of depth measurements from the onboard cameras and the robot's trajectory, the algorithm estimates the topography in the robot's vicinity. The raw measurements from these cameras are noisy and only provide partial and occluded observations that in man… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

  7. Locomotion Policy Guided Traversability Learning using Volumetric Representations of Complex Environments

    Authors: Jonas Frey, David Hoeller, Shehryar Khattak, Marco Hutter

    Abstract: Despite the progress in legged robotic locomotion, autonomous navigation in unknown environments remains an open problem. Ideally, the navigation system utilizes the full potential of the robots' locomotion capabilities while operating within safety limits under uncertainty. The robot must sense and analyze the traversability of the surrounding terrain, which depends on the hardware, locomotion co… ▽ More

    Submitted 21 August, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

    Comments: accepted for 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2022)

    Journal ref: 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  8. arXiv:2109.11978  [pdf, other

    cs.RO cs.LG

    Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement Learning

    Authors: Nikita Rudin, David Hoeller, Philipp Reist, Marco Hutter

    Abstract: In this work, we present and study a training set-up that achieves fast policy generation for real-world robotic tasks by using massive parallelism on a single workstation GPU. We analyze and discuss the impact of different training algorithm components in the massively parallel regime on the final policy performance and training times. In addition, we present a novel game-inspired curriculum that… ▽ More

    Submitted 19 August, 2022; v1 submitted 24 September, 2021; originally announced September 2021.

    Comments: CoRL 2021 Project website: : https://leggedrobotics.github.io/legged_gym/ Video: https://youtu.be/8sO7VS3q8d0

  9. arXiv:2108.10470  [pdf, other

    cs.RO cs.LG

    Isaac Gym: High Performance GPU-Based Physics Simulation For Robot Learning

    Authors: Viktor Makoviychuk, Lukasz Wawrzyniak, Yunrong Guo, Michelle Lu, Kier Storey, Miles Macklin, David Hoeller, Nikita Rudin, Arthur Allshire, Ankur Handa, Gavriel State

    Abstract: Isaac Gym offers a high performance learning platform to train policies for wide variety of robotics tasks directly on GPU. Both physics simulation and the neural network policy training reside on GPU and communicate by directly passing data from physics buffers to PyTorch tensors without ever going through any CPU bottlenecks. This leads to blazing fast training times for complex robotics tasks o… ▽ More

    Submitted 25 August, 2021; v1 submitted 23 August, 2021; originally announced August 2021.

    Comments: tech report on isaac-gym

  10. Articulated Object Interaction in Unknown Scenes with Whole-Body Mobile Manipulation

    Authors: Mayank Mittal, David Hoeller, Farbod Farshidian, Marco Hutter, Animesh Garg

    Abstract: A kitchen assistant needs to operate human-scale objects, such as cabinets and ovens, in unmapped environments with dynamic obstacles. Autonomous interactions in such environments require integrating dexterous manipulation and fluid mobility. While mobile manipulators in different form factors provide an extended workspace, their real-world adoption has been limited. Executing a high-level task fo… ▽ More

    Submitted 18 March, 2022; v1 submitted 18 March, 2021; originally announced March 2021.

  11. arXiv:2103.04351  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Learning a State Representation and Navigation in Cluttered and Dynamic Environments

    Authors: David Hoeller, Lorenz Wellhausen, Farbod Farshidian, Marco Hutter

    Abstract: In this work, we present a learning-based pipeline to realise local navigation with a quadrupedal robot in cluttered environments with static and dynamic obstacles. Given high-level navigation commands, the robot is able to safely locomote to a target location based on frames from a depth camera without any explicit mapping of the environment. First, the sequence of images and the current trajecto… ▽ More

    Submitted 7 March, 2021; originally announced March 2021.

    Comments: 8 pages, 8 figures, 2 tables

    Journal ref: IEEE Robotics and Automation Letters 2021

  12. arXiv:2011.06332  [pdf, other

    cs.RO

    Joint Space Control via Deep Reinforcement Learning

    Authors: Visak Kumar, David Hoeller, Balakumar Sundaralingam, Jonathan Tremblay, Stan Birchfield

    Abstract: The dominant way to control a robot manipulator uses hand-crafted differential equations leveraging some form of inverse kinematics / dynamics. We propose a simple, versatile joint-level controller that dispenses with differential equations entirely. A deep neural network, trained via model-free reinforcement learning, is used to map from task space to joint space. Experiments show the method capa… ▽ More

    Submitted 20 August, 2021; v1 submitted 12 November, 2020; originally announced November 2020.

    Comments: Presented at IROS 2021. Video is at https://youtu.be/ICfve-GTTp8

  13. arXiv:2010.03982  [pdf, other

    cs.CL

    Generating Instructions at Different Levels of Abstraction

    Authors: Arne Köhn, Julia Wichlacz, Álvaro Torralba, Daniel Höller, Jörg Hoffmann, Alexander Koller

    Abstract: When generating technical instructions, it is often convenient to describe complex objects in the world at different levels of abstraction. A novice user might need an object explained piece by piece, while for an expert, talking about the complex object (e.g. a wall or railing) directly may be more succinct and efficient. We show how to generate building instructions at different levels of abstra… ▽ More

    Submitted 8 October, 2020; originally announced October 2020.

    Comments: Accepted COLING 2020 long paper

  14. arXiv:2009.10019  [pdf, other

    cs.RO cs.LG

    Learning a Contact-Adaptive Controller for Robust, Efficient Legged Locomotion

    Authors: Xingye Da, Zhaoming Xie, David Hoeller, Byron Boots, Animashree Anandkumar, Yuke Zhu, Buck Babich, Animesh Garg

    Abstract: We present a hierarchical framework that combines model-based control and reinforcement learning (RL) to synthesize robust controllers for a quadruped (the Unitree Laikago). The system consists of a high-level controller that learns to choose from a set of primitives in response to changes in the environment and a low-level controller that utilizes an established control method to robustly execute… ▽ More

    Submitted 23 November, 2020; v1 submitted 21 September, 2020; originally announced September 2020.

    Comments: supplementary video: https://youtu.be/JJOmFZKpYTo

  15. arXiv:2008.00766  [pdf, other

    cs.LG cs.AI

    Tracking the Race Between Deep Reinforcement Learning and Imitation Learning -- Extended Version

    Authors: Timo P. Gros, Daniel Höller, Jörg Hoffmann, Verena Wolf

    Abstract: Learning-based approaches for solving large sequential decision making problems have become popular in recent years. The resulting agents perform differently and their characteristics depend on those of the underlying learning approach. Here, we consider a benchmark planning problem from the reinforcement learning domain, the Racetrack, to investigate the properties of agents derived from differen… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

    Comments: Extended Version of the Conference Paper published in the Proceedings of the 17th International Conference on Quantitative Evaluation of SysTems (QEST)

  16. arXiv:2003.03200  [pdf, other

    cs.RO math.OC

    Practical Reinforcement Learning For MPC: Learning from sparse objectives in under an hour on a real robot

    Authors: Napat Karnchanachari, Miguel I. Valls, David Hoeller, Marco Hutter

    Abstract: Model Predictive Control (MPC) is a powerful control technique that handles constraints, takes the system's dynamics into account, and optimizes for a given cost function. In practice, however, it often requires an expert to craft and tune this cost function and find trade-offs between different state penalties to satisfy simple high level objectives. In this paper, we use Reinforcement Learning a… ▽ More

    Submitted 20 April, 2020; v1 submitted 6 March, 2020; originally announced March 2020.

    Comments: 14 pages, 6 figures, submitted to L4DC 2020

    MSC Class: 49M37 ACM Class: I.2.6; I.2.8; I.2.9

  17. arXiv:1911.05499  [pdf, other

    cs.AI

    HDDL -- A Language to Describe Hierarchical Planning Problems

    Authors: D. Höller, G. Behnke, P. Bercher, S. Biundo, H. Fiorino, D. Pellier, R. Alford

    Abstract: The research in hierarchical planning has made considerable progress in the last few years. Many recent systems do not rely on hand-tailored advice anymore to find solutions, but are supposed to be domain-independent systems that come with sophisticated solving techniques. In principle, this development would make the comparison between systems easier (because the domains are not tailored to a sin… ▽ More

    Submitted 13 November, 2019; originally announced November 2019.

    Comments: International Workshop on HTN Planning (ICAPS), 2019

  18. arXiv:1910.03358  [pdf, other

    cs.LG cs.RO stat.ML

    Deep Value Model Predictive Control

    Authors: Farbod Farshidian, David Hoeller, Marco Hutter

    Abstract: In this paper, we introduce an actor-critic algorithm called Deep Value Model Predictive Control (DMPC), which combines model-based trajectory optimization with value function estimation. The DMPC actor is a Model Predictive Control (MPC) optimizer with an objective function defined in terms of a value function estimated by the critic. We show that our MPC actor is an importance sampler, which min… ▽ More

    Submitted 8 October, 2019; originally announced October 2019.

    Comments: Accepted for publication in the Conference on Robotic Learning (CoRL) 2019, Osaka. 10 pages (+5 supplementary)

  19. arXiv:1909.04405  [pdf, other

    cs.AI

    Hierarchical Planning in the IPC

    Authors: D. Höller, G. Behnke, P. Bercher, S. Biundo, H. Fiorino, D. Pellier, R. Alford

    Abstract: Over the last year, the amount of research in hierarchical planning has increased, leading to significant improvements in the performance of planners. However, the research is diverging and planners are somewhat hard to compare against each other. This is mostly caused by the fact that there is no standard set of benchmark domains, nor even a common description language for hierarchical planning p… ▽ More

    Submitted 10 September, 2019; originally announced September 2019.

    Journal ref: Workshop on the International Planning Competition (ICAPS), 2019