Skip to main content

Showing 1–50 of 122 results for author: Boots, B

.
  1. arXiv:2505.11420  [pdf, ps, other

    cs.RO

    Self-supervised perception for tactile skin covered dexterous hands

    Authors: Akash Sharma, Carolina Higuera, Chaithanya Krishna Bodduluri, Zixi Liu, Taosha Fan, Tess Hellebrekers, Mike Lambeta, Byron Boots, Michael Kaess, Tingfan Wu, Francois Robert Hogan, Mustafa Mukadam

    Abstract: We present Sparsh-skin, a pre-trained encoder for magnetic skin sensors distributed across the fingertips, phalanges, and palm of a dexterous robot hand. Magnetic tactile skins offer a flexible form factor for hand-wide coverage with fast response times, in contrast to vision-based tactile sensors that are restricted to the fingertips and limited by bandwidth. Full hand tactile perception is cruci… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

    Comments: 18 pages, 15 figures

  2. arXiv:2504.13149  [pdf, other

    cs.RO

    Long Range Navigator (LRN): Extending robot planning horizons beyond metric maps

    Authors: Matt Schmittle, Rohan Baijal, Nathan Hatch, Rosario Scalise, Mateo Guaman Castro, Sidharth Talia, Khimya Khetarpal, Byron Boots, Siddhartha Srinivasa

    Abstract: A robot navigating an outdoor environment with no prior knowledge of the space must rely on its local sensing to perceive its surroundings and plan. This can come in the form of a local metric map or local policy with some fixed horizon. Beyond that, there is a fog of unknown space marked with some fixed cost. A limited planning horizon can often result in myopic decisions leading the robot off co… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

    Comments: 10 pages, 9 figures

  3. arXiv:2503.18531  [pdf, other

    cs.RO cs.LG cs.NE

    Parental Guidance: Efficient Lifelong Learning through Evolutionary Distillation

    Authors: Octi Zhang, Quanquan Peng, Rosario Scalise, Bryon Boots

    Abstract: Developing robotic agents that can perform well in diverse environments while showing a variety of behaviors is a key challenge in AI and robotics. Traditional reinforcement learning (RL) methods often create agents that specialize in narrow tasks, limiting their adaptability and diversity. To overcome this, we propose a preliminary, evolution-inspired framework that includes a reproduction module… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

    Comments: 4 pages, 3 figures, CoRL 2024 Workshop MAPoDeL

    ACM Class: F.2.2, I.2.7

  4. arXiv:2502.07380  [pdf, other

    cs.RO

    Demonstrating Wheeled Lab: Modern Sim2Real for Low-cost, Open-source Wheeled Robotics

    Authors: Tyler Han, Preet Shah, Sidharth Rajagopal, Yanda Bao, Sanghun Jung, Sidharth Talia, Gabriel Guo, Bryan Xu, Bhaumik Mehta, Emma Romig, Rosario Scalise, Byron Boots

    Abstract: Simulation has been pivotal in recent robotics milestones and is poised to play a prominent role in the field's future. However, recent robotic advances often rely on expensive and high-maintenance platforms, limiting access to broader robotics audiences. This work introduces Wheeled Lab, a framework for the low-cost, open-source wheeled platforms that are already widely established in education a… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

    Comments: Under Review

  5. arXiv:2412.00086  [pdf, other

    cs.RO cs.LG

    Dynamic Non-Prehensile Object Transport via Model-Predictive Reinforcement Learning

    Authors: Neel Jawale, Byron Boots, Balakumar Sundaralingam, Mohak Bhardwaj

    Abstract: We investigate the problem of teaching a robot manipulator to perform dynamic non-prehensile object transport, also known as the `robot waiter' task, from a limited set of real-world demonstrations. We propose an approach that combines batch reinforcement learning (RL) with model-predictive control (MPC) by pretraining an ensemble of value functions from demonstration data, and utilizing them onli… ▽ More

    Submitted 26 November, 2024; originally announced December 2024.

    Comments: 11 pages

  6. arXiv:2410.24090  [pdf, other

    cs.RO

    Sparsh: Self-supervised touch representations for vision-based tactile sensing

    Authors: Carolina Higuera, Akash Sharma, Chaithanya Krishna Bodduluri, Taosha Fan, Patrick Lancaster, Mrinal Kalakrishnan, Michael Kaess, Byron Boots, Mike Lambeta, Tingfan Wu, Mustafa Mukadam

    Abstract: In this work, we introduce general purpose touch representations for the increasingly accessible class of vision-based tactile sensors. Such sensors have led to many recent advances in robot manipulation as they markedly complement vision, yet solutions today often rely on task and sensor specific handcrafted perception models. Collecting real data at scale with task centric ground truth labels, l… ▽ More

    Submitted 31 October, 2024; originally announced October 2024.

    Comments: Conference on Robot Learning (CoRL), 2024

  7. arXiv:2410.20254  [pdf, other

    cs.LG cs.RO stat.ML

    Overcoming the Sim-to-Real Gap: Leveraging Simulation to Learn to Explore for Real-World RL

    Authors: Andrew Wagenmaker, Kevin Huang, Liyiming Ke, Byron Boots, Kevin Jamieson, Abhishek Gupta

    Abstract: In order to mitigate the sample complexity of real-world reinforcement learning, common practice is to first train a policy in a simulator where samples are cheap, and then deploy this policy in the real world, with the hope that it generalizes effectively. Such \emph{direct sim2real} transfer is not guaranteed to succeed, however, and in cases where it fails, it is unclear how to best utilize the… ▽ More

    Submitted 26 October, 2024; originally announced October 2024.

    Comments: NeurIPS 2024

  8. arXiv:2409.10923  [pdf, other

    cs.RO

    Agile Continuous Jumping in Discontinuous Terrains

    Authors: Yuxiang Yang, Guanya Shi, Changyi Lin, Xiangyun Meng, Rosario Scalise, Mateo Guaman Castro, Wenhao Yu, Tingnan Zhang, Ding Zhao, Jie Tan, Byron Boots

    Abstract: We focus on agile, continuous, and terrain-adaptive jumping of quadrupedal robots in discontinuous terrains such as stairs and stepping stones. Unlike single-step jumping, continuous jumping requires accurately executing highly dynamic motions over long horizons, which is challenging for existing approaches. To accomplish this task, we design a hierarchical learning and control framework, which co… ▽ More

    Submitted 20 September, 2024; v1 submitted 17 September, 2024; originally announced September 2024.

    Comments: Website: https://yxyang.github.io/jumping_cod/

  9. arXiv:2405.16487  [pdf, other

    cs.RO

    Dynamics Models in the Aggressive Off-Road Driving Regime

    Authors: Tyler Han, Sidharth Talia, Rohan Panicker, Preet Shah, Neel Jawale, Byron Boots

    Abstract: Current developments in autonomous off-road driving are steadily increasing performance through higher speeds and more challenging, unstructured environments. However, this operating regime subjects the vehicle to larger inertial effects, where consideration of higher-order states is necessary to avoid failures such as rollovers or excessive impact forces. Aggressive driving through Model Predicti… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: Accepted to ICRA 2024 Workshop on Resilient Off-road Autonomy

  10. arXiv:2403.18197  [pdf, other

    cs.RO

    LocoMan: Advancing Versatile Quadrupedal Dexterity with Lightweight Loco-Manipulators

    Authors: Changyi Lin, Xingyu Liu, Yuxiang Yang, Yaru Niu, Wenhao Yu, Tingnan Zhang, Jie Tan, Byron Boots, Ding Zhao

    Abstract: Quadrupedal robots have emerged as versatile agents capable of locomoting and manipulating in complex environments. Traditional designs typically rely on the robot's inherent body parts or incorporate top-mounted arms for manipulation tasks. However, these configurations may limit the robot's operational dexterity, efficiency and adaptability, particularly in cluttered or constrained spaces. In th… ▽ More

    Submitted 18 October, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: Project page: https://linchangyi1.github.io/LocoMan

  11. arXiv:2403.11298  [pdf, other

    cs.RO

    Multi-Sample Long Range Path Planning under Sensing Uncertainty for Off-Road Autonomous Driving

    Authors: Matt Schmittle, Rohan Baijal, Brian Hou, Siddhartha Srinivasa, Byron Boots

    Abstract: We focus on the problem of long-range dynamic replanning for off-road autonomous vehicles, where a robot plans paths through a previously unobserved environment while continuously receiving noisy local observations. An effective approach for planning under sensing uncertainty is determinization, where one converts a stochastic world into a deterministic one and plans under this simplification. Thi… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  12. arXiv:2312.16016  [pdf, other

    cs.RO

    V-STRONG: Visual Self-Supervised Traversability Learning for Off-road Navigation

    Authors: Sanghun Jung, JoonHo Lee, Xiangyun Meng, Byron Boots, Alexander Lambert

    Abstract: Reliable estimation of terrain traversability is critical for the successful deployment of autonomous systems in wild, outdoor environments. Given the lack of large-scale annotated datasets for off-road navigation, strictly-supervised learning approaches remain limited in their generalization ability. To this end, we introduce a novel, image-based self-supervised learning method for traversability… ▽ More

    Submitted 15 March, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

    Comments: ICRA 2024; 8 pages

  13. arXiv:2311.12284  [pdf, other

    cs.RO

    Model Predictive Control for Aggressive Driving Over Uneven Terrain

    Authors: Tyler Han, Alex Liu, Anqi Li, Alex Spitzer, Guanya Shi, Byron Boots

    Abstract: Terrain traversability in unstructured off-road autonomy has traditionally relied on semantic classification, resource-intensive dynamics models, or purely geometry-based methods to predict vehicle-terrain interactions. While inconsequential at low speeds, uneven terrain subjects our full-scale system to safety-critical challenges at operating speeds of 7--10 m/s. This study focuses particularly o… ▽ More

    Submitted 7 June, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: Accepted to R:SS 2024

  14. arXiv:2310.09053  [pdf, other

    cs.RO cs.AI eess.SY

    DATT: Deep Adaptive Trajectory Tracking for Quadrotor Control

    Authors: Kevin Huang, Rwik Rana, Alexander Spitzer, Guanya Shi, Byron Boots

    Abstract: Precise arbitrary trajectory tracking for quadrotors is challenging due to unknown nonlinear dynamics, trajectory infeasibility, and actuation limits. To tackle these challenges, we present Deep Adaptive Trajectory Tracking (DATT), a learning-based approach that can precisely track arbitrary, potentially infeasible trajectories in the presence of large disturbances in the real world. DATT builds o… ▽ More

    Submitted 13 December, 2023; v1 submitted 13 October, 2023; originally announced October 2023.

  15. arXiv:2310.04590  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Deep Model Predictive Optimization

    Authors: Jacob Sacks, Rwik Rana, Kevin Huang, Alex Spitzer, Guanya Shi, Byron Boots

    Abstract: A major challenge in robotics is to design robust policies which enable complex and agile behaviors in the real world. On one end of the spectrum, we have model-free reinforcement learning (MFRL), which is incredibly flexible and general but often results in brittle policies. In contrast, model predictive control (MPC) continually re-plans at each time step to remain robust to perturbations and mo… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

    Comments: Main paper is 6 pages with 4 figures and 1 table. Code available at: https://github.com/jisacks/dmpo

  16. arXiv:2309.16652  [pdf, other

    cs.RO

    Perceiving Extrinsic Contacts from Touch Improves Learning Insertion Policies

    Authors: Carolina Higuera, Joseph Ortiz, Haozhi Qi, Luis Pineda, Byron Boots, Mustafa Mukadam

    Abstract: Robotic manipulation tasks such as object insertion typically involve interactions between object and environment, namely extrinsic contacts. Prior work on Neural Contact Fields (NCF) use intrinsic tactile sensing between gripper and object to estimate extrinsic contacts in simulation. However, its effectiveness and utility in real-world tasks remains unknown. In this work, we improve NCF to ena… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: Under review

  17. arXiv:2309.13523  [pdf, other

    cs.CV

    LiDAR-UDA: Self-ensembling Through Time for Unsupervised LiDAR Domain Adaptation

    Authors: Amirreza Shaban, JoonHo Lee, Sanghun Jung, Xiangyun Meng, Byron Boots

    Abstract: We introduce LiDAR-UDA, a novel two-stage self-training-based Unsupervised Domain Adaptation (UDA) method for LiDAR segmentation. Existing self-training methods use a model trained on labeled source data to generate pseudo labels for target data and refine the predictions via fine-tuning the network on the pseudo labels. These methods suffer from domain shifts caused by different LiDAR sensor conf… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

    Comments: Accepted ICCV 2023 (Oral)

  18. arXiv:2306.09557  [pdf, other

    cs.RO

    CAJun: Continuous Adaptive Jumping using a Learned Centroidal Controller

    Authors: Yuxiang Yang, Guanya Shi, Xiangyun Meng, Wenhao Yu, Tingnan Zhang, Jie Tan, Byron Boots

    Abstract: We present CAJun, a novel hierarchical learning and control framework that enables legged robots to jump continuously with adaptive jumping distances. CAJun consists of a high-level centroidal policy and a low-level leg controller. In particular, we use reinforcement learning (RL) to train the centroidal policy, which specifies the gait timing, base velocity, and swing foot position for the leg co… ▽ More

    Submitted 27 October, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: Please visit https://yxyang.github.io/cajun/ for additional results

  19. arXiv:2305.03735  [pdf, other

    cs.AI cs.GT cs.MA cs.RO

    Stackelberg Games for Learning Emergent Behaviors During Competitive Autocurricula

    Authors: Boling Yang, Liyuan Zheng, Lillian J. Ratliff, Byron Boots, Joshua R. Smith

    Abstract: Autocurricular training is an important sub-area of multi-agent reinforcement learning~(MARL) that allows multiple agents to learn emergent skills in an unsupervised co-evolving scheme. The robotics community has experimented autocurricular training with physically grounded problems, such as robust control and interactive manipulation tasks. However, the asymmetric nature of these tasks makes the… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

  20. arXiv:2304.08663  [pdf, other

    cs.RO cs.AI cs.LG

    Continuous Versatile Jumping Using Learned Action Residuals

    Authors: Yuxiang Yang, Xiangyun Meng, Wenhao Yu, Tingnan Zhang, Jie Tan, Byron Boots

    Abstract: Jumping is essential for legged robots to traverse through difficult terrains. In this work, we propose a hierarchical framework that combines optimal control and reinforcement learning to learn continuous jumping motions for quadrupedal robots. The core of our framework is a stance controller, which combines a manually designed acceleration controller with a learned residual policy. As the accele… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: To be presented at L4DC 2023

  21. arXiv:2304.01182  [pdf, other

    cs.RO

    Learning to Read Braille: Bridging the Tactile Reality Gap with Diffusion Models

    Authors: Carolina Higuera, Byron Boots, Mustafa Mukadam

    Abstract: Simulating vision-based tactile sensors enables learning models for contact-rich tasks when collecting real world data at scale can be prohibitive. However, modeling the optical response of the gel deformation as well as incorporating the dynamics of the contact makes sim2real challenging. Prior works have explored data augmentation, fine-tuning, or learning generative models to reduce the sim2rea… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  22. arXiv:2303.17156  [pdf, other

    cs.LG

    MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations

    Authors: Anqi Li, Byron Boots, Ching-An Cheng

    Abstract: We study a new paradigm for sequential decision making, called offline policy learning from observations (PLfO). Offline PLfO aims to learn policies using datasets with substandard qualities: 1) only a subset of trajectories is labeled with rewards, 2) labeled trajectories may not contain actions, 3) labeled trajectories may not be of high quality, and 4) the data may not have full coverage. Such… ▽ More

    Submitted 6 August, 2023; v1 submitted 30 March, 2023; originally announced March 2023.

  23. arXiv:2303.15771  [pdf, other

    cs.RO

    TerrainNet: Visual Modeling of Complex Terrain for High-speed, Off-road Navigation

    Authors: Xiangyun Meng, Nathan Hatch, Alexander Lambert, Anqi Li, Nolan Wagener, Matthew Schmittle, JoonHo Lee, Wentao Yuan, Zoey Chen, Samuel Deng, Greg Okopal, Dieter Fox, Byron Boots, Amirreza Shaban

    Abstract: Effective use of camera-based vision systems is essential for robust performance in autonomous off-road driving, particularly in the high-speed regime. Despite success in structured, on-road settings, current end-to-end approaches for scene prediction have yet to be successfully adapted for complex outdoor terrain. To this end, we present TerrainNet, a vision-based terrain perception system for se… ▽ More

    Submitted 29 May, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

  24. arXiv:2302.11048  [pdf, other

    cs.LG cs.AI

    Adversarial Model for Offline Reinforcement Learning

    Authors: Mohak Bhardwaj, Tengyang Xie, Byron Boots, Nan Jiang, Ching-An Cheng

    Abstract: We propose a novel model-based offline Reinforcement Learning (RL) framework, called Adversarial Model for Offline Reinforcement Learning (ARMOR), which can robustly learn policies to improve upon an arbitrary reference policy regardless of data coverage. ARMOR is designed to optimize policies for the worst-case performance relative to the reference policy through adversarially training a Markov d… ▽ More

    Submitted 24 December, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: Accepted at the Neural Information Processing Systems (NeurIPS), 2023. Mohak Bhardwaj and Tengyang Xie contributed equally to this work. arXiv admin note: text overlap with arXiv:2211.04538

  25. arXiv:2212.02603  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Learning to Optimize in Model Predictive Control

    Authors: Jacob Sacks, Byron Boots

    Abstract: Sampling-based Model Predictive Control (MPC) is a flexible control framework that can reason about non-smooth dynamics and cost functions. Recently, significant work has focused on the use of machine learning to improve the performance of MPC, often through learning or fine-tuning the dynamics or cost function. In contrast, we focus on learning to optimize more effectively. In other words, to imp… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

    Comments: Proceedings of the IEEE Conference on Robotics and Automation (ICRA), 2022. Paper is 6 pages with 2 figures and 2 tables

    Journal ref: In 2022 International Conference on Robotics and Automation (ICRA), pp. 10549-10556. IEEE, 2022

  26. arXiv:2212.02587  [pdf, other

    cs.RO cs.AI eess.SY

    Learning Sampling Distributions for Model Predictive Control

    Authors: Jacob Sacks, Byron Boots

    Abstract: Sampling-based methods have become a cornerstone of contemporary approaches to Model Predictive Control (MPC), as they make no restrictions on the differentiability of the dynamics or cost function and are straightforward to parallelize. However, their efficacy is highly dependent on the quality of the sampling distribution itself, which is often assumed to be simple, like a Gaussian. This restric… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

    Comments: Accepted at the Conference on Robot Learning (CoRL), 2022. Main paper is 9 pages with 4 figures. Appendix is 12 pages with 11 figures and 1 table

  27. arXiv:2210.12284  [pdf, other

    quant-ph

    Scalable Measurement Error Mitigation via Iterative Bayesian Unfolding

    Authors: Siddarth Srinivasan, Bibek Pokharel, Gregory Quiroz, Byron Boots

    Abstract: Measurement error mitigation (MEM) techniques are postprocessing strategies to counteract systematic read-out errors on quantum computers (QC). Currently used MEM strategies face a tradeoff: methods that scale well with the number of qubits return negative probabilities, while those that guarantee a valid probability distribution are not scalable. Here, we present a scheme that addresses both of t… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

  28. arXiv:2210.12209  [pdf, other

    cs.RO cs.AI

    Motion Policy Networks

    Authors: Adam Fishman, Adithyavairan Murali, Clemens Eppner, Bryan Peele, Byron Boots, Dieter Fox

    Abstract: Collision-free motion generation in unknown environments is a core building block for robot manipulation. Generating such motions is challenging due to multiple objectives; not only should the solutions be optimal, the motion generator itself must be fast enough for real-time performance and reliable enough for practical deployment. A wide variety of methods have been proposed ranging from local c… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: To be published in the Conference on Robot Learning (CoRL) 2022. 10 pages with 4 figures. Appendix has 10 pages and 1 figure

  29. arXiv:2210.09297  [pdf, other

    cs.RO cs.CV

    Neural Contact Fields: Tracking Extrinsic Contact with Tactile Sensing

    Authors: Carolina Higuera, Siyuan Dong, Byron Boots, Mustafa Mukadam

    Abstract: We present Neural Contact Fields, a method that brings together neural fields and tactile sensing to address the problem of tracking extrinsic contact between object and environment. Knowing where the external contact occurs is a first step towards methods that can actively control it in facilitating downstream manipulation tasks. Prior work for localizing environmental contacts typically assume a… ▽ More

    Submitted 13 March, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

    Comments: 2023 International Conference on Robotics and Automation (ICRA)

  30. arXiv:2206.13631  [pdf, other

    cs.RO cs.AI

    Learning Semantics-Aware Locomotion Skills from Human Demonstration

    Authors: Yuxiang Yang, Xiangyun Meng, Wenhao Yu, Tingnan Zhang, Jie Tan, Byron Boots

    Abstract: The semantics of the environment, such as the terrain type and property, reveals important information for legged robots to adjust their behaviors. In this work, we present a framework that learns semantics-aware locomotion skills from perception for quadrupedal robots, such that the robot can traverse through complex offroad terrains with appropriate speeds and gaits using perception information.… ▽ More

    Submitted 10 October, 2022; v1 submitted 27 June, 2022; originally announced June 2022.

  31. arXiv:2206.00205  [pdf, other

    cs.CV

    CAFA: Class-Aware Feature Alignment for Test-Time Adaptation

    Authors: Sanghun Jung, Jungsoo Lee, Nanhee Kim, Amirreza Shaban, Byron Boots, Jaegul Choo

    Abstract: Despite recent advancements in deep learning, deep neural networks continue to suffer from performance degradation when applied to new data that differs from training data. Test-time adaptation (TTA) aims to address this challenge by adapting a model to unlabeled data at test time. TTA can be applied to pretrained networks without modifying their training procedures, enabling them to utilize a wel… ▽ More

    Submitted 3 September, 2023; v1 submitted 31 May, 2022; originally announced June 2022.

  32. Learning Implicit Priors for Motion Optimization

    Authors: Julen Urain, An T. Le, Alexander Lambert, Georgia Chalvatzaki, Byron Boots, Jan Peters

    Abstract: In this paper, we focus on the problem of integrating Energy-based Models (EBM) as guiding priors for motion optimization. EBMs are a set of neural networks that can represent expressive probability density distributions in terms of a Gibbs distribution parameterized by a suitable energy function. Due to their implicit nature, they can easily be integrated as optimization factors or as initial sam… ▽ More

    Submitted 11 January, 2023; v1 submitted 11 April, 2022; originally announced April 2022.

    Comments: 17 pages, accepted at IEEE/RSJ IROS 2022, paper website: https://sites.google.com/view/implicit-priors/home

    Journal ref: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Kyoto, Japan, 2022, pp. 7672-7679

  33. arXiv:2202.07068  [pdf, other

    cs.RO cs.AI cs.HC cs.MA

    Motivating Physical Activity via Competitive Human-Robot Interaction

    Authors: Boling Yang, Golnaz Habibi, Patrick E. Lancaster, Byron Boots, Joshua R. Smith

    Abstract: This project aims to motivate research in competitive human-robot interaction by creating a robot competitor that can challenge human users in certain scenarios such as physical exercise and games. With this goal in mind, we introduce the Fencing Game, a human-robot competition used to evaluate both the capabilities of the robot competitor and user experience. We develop the robot competitor throu… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

    Comments: Conference on Robot Learning. PMLR, 2022

  34. arXiv:2111.07986  [pdf, other

    cs.RO cs.LG eess.SY

    Nonprehensile Riemannian Motion Predictive Control

    Authors: Hamid Izadinia, Byron Boots, Steven M. Seitz

    Abstract: Nonprehensile manipulation involves long horizon underactuated object interactions and physical contact with different objects that can inherently introduce a high degree of uncertainty. In this work, we introduce a novel Real-to-Sim reward analysis technique, called Riemannian Motion Predictive Control (RMPC), to reliably imagine and predict the outcome of taking possible actions for a real robot… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

    Comments: To appear at International Symposium on Experimental Robotics (ISER)

  35. arXiv:2111.02972  [pdf, other

    cs.RO

    Stein Variational Probabilistic Roadmaps

    Authors: Alexander Lambert, Brian Hou, Rosario Scalise, Siddhartha S. Srinivasa, Byron Boots

    Abstract: Efficient and reliable generation of global path plans are necessary for safe execution and deployment of autonomous systems. In order to generate planning graphs which adequately resolve the topology of a given environment, many sampling-based motion planners resort to coarse, heuristically-driven strategies which often fail to generalize to new and varied surroundings. Further, many of these app… ▽ More

    Submitted 20 May, 2022; v1 submitted 4 November, 2021; originally announced November 2021.

    Comments: Pre-print

  36. arXiv:2110.04669  [pdf, other

    cs.RO cs.LG

    Leveraging Experience in Lazy Search

    Authors: Mohak Bhardwaj, Sanjiban Choudhury, Byron Boots, Siddhartha Srinivasa

    Abstract: Lazy graph search algorithms are efficient at solving motion planning problems where edge evaluation is the computational bottleneck. These algorithms work by lazily computing the shortest potentially feasible path, evaluating edges along that path, and repeating until a feasible path is found. The order in which edges are selected is critical to minimizing the total number of edge evaluations: a… ▽ More

    Submitted 9 October, 2021; originally announced October 2021.

    Comments: Extended journal version accepted for publication at Autonomous Robots; 17 pages. arXiv admin note: substantial text overlap with arXiv:1907.07238

  37. arXiv:2109.10443  [pdf, other

    cs.RO eess.SY

    Geometric Fabrics: Generalizing Classical Mechanics to Capture the Physics of Behavior

    Authors: Karl Van Wyk, Mandy Xie, Anqi Li, Muhammad Asif Rana, Buck Babich, Bryan Peele, Qian Wan, Iretiayo Akinola, Balakumar Sundaralingam, Dieter Fox, Byron Boots, Nathan D. Ratliff

    Abstract: Classical mechanical systems are central to controller design in energy shaping methods of geometric control. However, their expressivity is limited by position-only metrics and the intimate link between metric and geometry. Recent work on Riemannian Motion Policies (RMPs) has shown that shedding these restrictions results in powerful design tools, but at the expense of theoretical stability guara… ▽ More

    Submitted 18 January, 2022; v1 submitted 21 September, 2021; originally announced September 2021.

  38. arXiv:2107.05146  [pdf, other

    cs.RO

    Entropy Regularized Motion Planning via Stein Variational Inference

    Authors: Alexander Lambert, Byron Boots

    Abstract: Many Imitation and Reinforcement Learning approaches rely on the availability of expert-generated demonstrations for learning policies or value functions from data. Obtaining a reliable distribution of trajectories from motion planners is non-trivial, since it must broadly cover the space of states likely to be encountered during execution while also satisfying task-based constraints. We propose a… ▽ More

    Submitted 11 July, 2021; originally announced July 2021.

    Comments: RSS 2021 Workshop on Integrating Planning and Learning

  39. arXiv:2106.09110  [pdf, other

    cs.LG cs.RO eess.SY

    Safe Reinforcement Learning Using Advantage-Based Intervention

    Authors: Nolan Wagener, Byron Boots, Ching-An Cheng

    Abstract: Many sequential decision problems involve finding a policy that maximizes total reward while obeying safety constraints. Although much recent research has focused on the development of safe reinforcement learning (RL) algorithms that produce a safe policy after training, ensuring safety during training as well remains an open problem. A fundamental challenge is performing exploration while still s… ▽ More

    Submitted 19 July, 2021; v1 submitted 16 June, 2021; originally announced June 2021.

    Comments: Appearing in ICML 2021. 29 pages, 8 figures

  40. arXiv:2105.03019  [pdf, other

    cs.RO cs.LG

    Imitation Learning via Simultaneous Optimization of Policies and Auxiliary Trajectories

    Authors: Mandy Xie, Anqi Li, Karl Van Wyk, Frank Dellaert, Byron Boots, Nathan Ratliff

    Abstract: Imitation learning (IL) is a frequently used approach for data-efficient policy learning. Many IL methods, such as Dataset Aggregation (DAgger), combat challenges like distributional shift by interacting with oracular experts. Unfortunately, assuming access to oracular experts is often unrealistic in practice; data used in IL frequently comes from offline processes such as lead-through or teleoper… ▽ More

    Submitted 5 June, 2021; v1 submitted 6 May, 2021; originally announced May 2021.

  41. arXiv:2104.13542  [pdf, other

    cs.RO

    STORM: An Integrated Framework for Fast Joint-Space Model-Predictive Control for Reactive Manipulation

    Authors: Mohak Bhardwaj, Balakumar Sundaralingam, Arsalan Mousavian, Nathan Ratliff, Dieter Fox, Fabio Ramos, Byron Boots

    Abstract: Sampling-based model-predictive control (MPC) is a promising tool for feedback control of robots with complex, non-smooth dynamics, and cost functions. However, the computationally demanding nature of sampling-based MPC algorithms has been a key bottleneck in their application to high-dimensional robotic manipulation problems in the real world. Previous methods have addressed this issue by running… ▽ More

    Submitted 14 September, 2021; v1 submitted 27 April, 2021; originally announced April 2021.

    Comments: Accepted for oral presentation at the Conference on Robot Learning (CoRL), 2021. Code available at: https://github.com/NVlabs/storm

    Journal ref: 5th Annual Conference on Robot Learning, 2021

  42. arXiv:2104.04644  [pdf, other

    cs.RO cs.LG

    Fast and Efficient Locomotion via Learned Gait Transitions

    Authors: Yuxiang Yang, Tingnan Zhang, Erwin Coumans, Jie Tan, Byron Boots

    Abstract: We focus on the problem of developing energy efficient controllers for quadrupedal robots. Animals can actively switch gaits at different speeds to lower their energy consumption. In this paper, we devise a hierarchical learning framework, in which distinctive locomotion gaits and natural gait transitions emerge automatically with a simple reward of energy minimization. We use evolutionary strateg… ▽ More

    Submitted 22 November, 2021; v1 submitted 9 April, 2021; originally announced April 2021.

    Comments: Published in CoRL 2021. Website: Website: https://sites.google.com/view/fast-and-efficient Code: https://github.com/yxyang/fast_and_efficient

  43. arXiv:2104.02863  [pdf, other

    cs.RO cs.LG

    The Value of Planning for Infinite-Horizon Model Predictive Control

    Authors: Nathan Hatch, Byron Boots

    Abstract: Model Predictive Control (MPC) is a classic tool for optimal control of complex, real-world systems. Although it has been successfully applied to a wide range of challenging tasks in robotics, it is fundamentally limited by the prediction horizon, which, if too short, will result in myopic decisions. Recently, several papers have suggested using a learned value function as the terminal cost for MP… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

    Comments: 7 pages, 8 figures. To appear in the proceedings of the International Conference on Robotics and Automation (ICRA) 2021

  44. arXiv:2103.14162  [pdf, other

    cs.CV

    Few-shot Weakly-Supervised Object Detection via Directional Statistics

    Authors: Amirreza Shaban, Amir Rahimi, Thalaiyasingam Ajanthan, Byron Boots, Richard Hartley

    Abstract: Detecting novel objects from few examples has become an emerging topic in computer vision recently. However, these methods need fully annotated training images to learn new object categories which limits their applicability in real world scenarios such as field robotics. In this work, we propose a probabilistic multiple instance learning approach for few-shot Common Object Localization (COL) and f… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

  45. arXiv:2103.12890  [pdf, other

    cs.RO cs.LG

    Dual Online Stein Variational Inference for Control and Dynamics

    Authors: Lucas Barcelos, Alexander Lambert, Rafael Oliveira, Paulo Borges, Byron Boots, Fabio Ramos

    Abstract: Model predictive control (MPC) schemes have a proven track record for delivering aggressive and robust performance in many challenging control tasks, coping with nonlinear system dynamics, constraints, and observational noise. Despite their success, these methods often rely on simple control distributions, which can limit their performance in highly uncertain and complex environments. MPC framewor… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

    Comments: Corresponding author: [email protected]

  46. arXiv:2103.05922  [pdf, other

    cs.RO cs.LG eess.SY

    RMP2: A Structured Composable Policy Class for Robot Learning

    Authors: Anqi Li, Ching-An Cheng, M. Asif Rana, Man Xie, Karl Van Wyk, Nathan Ratliff, Byron Boots

    Abstract: We consider the problem of learning motion policies for acceleration-based robotics systems with a structured policy class specified by RMPflow. RMPflow is a multi-task control framework that has been successfully applied in many robotics problems. Using RMPflow as a structured policy class in learning has several benefits, such as sufficient expressiveness, the flexibility to inject different lev… ▽ More

    Submitted 10 March, 2021; originally announced March 2021.

  47. Combining pretrained CNN feature extractors to enhance clustering of complex natural images

    Authors: Joris Guerin, Stephane Thiery, Eric Nyiri, Olivier Gibaru, Byron Boots

    Abstract: Recently, a common starting point for solving complex unsupervised image classification tasks is to use generic features, extracted with deep Convolutional Neural Networks (CNN) pretrained on a large and versatile dataset (ImageNet). However, in most research, the CNN architecture for feature extraction is chosen arbitrarily, without justification. This paper aims at providing insight on the use o… ▽ More

    Submitted 7 January, 2021; originally announced January 2021.

    Comments: 21 pages, 16 figures, 10 tables, preprint of our paper published in Neurocomputing

    Journal ref: Guerin, J., Thiery, S., Nyiri, E., Gibaru, O., & Boots, B. (2021). Combining pretrained CNN feature extractors to enhance clustering of complex natural images. Neurocomputing, 423, 551-571

  48. arXiv:2012.13457  [pdf, other

    cs.RO cs.LG

    Towards Coordinated Robot Motions: End-to-End Learning of Motion Policies on Transform Trees

    Authors: M. Asif Rana, Anqi Li, Dieter Fox, Sonia Chernova, Byron Boots, Nathan Ratliff

    Abstract: Generating robot motion that fulfills multiple tasks simultaneously is challenging due to the geometric constraints imposed by the robot. In this paper, we propose to solve multi-task problems through learning structured policies from human demonstrations. Our structured policy is inspired by RMPflow, a framework for combining subtask policies on different spaces. The policy structure provides the… ▽ More

    Submitted 10 March, 2021; v1 submitted 24 December, 2020; originally announced December 2020.

  49. arXiv:2012.05909  [pdf, other

    cs.LG cs.RO

    Blending MPC & Value Function Approximation for Efficient Reinforcement Learning

    Authors: Mohak Bhardwaj, Sanjiban Choudhury, Byron Boots

    Abstract: Model-Predictive Control (MPC) is a powerful tool for controlling complex, real-world systems that uses a model to make predictions about future behavior. For each state encountered, MPC solves an online optimization problem to choose a control action that will minimize future cost. This is a surprisingly effective strategy, but real-time performance requirements warrant the use of simple models.… ▽ More

    Submitted 13 April, 2021; v1 submitted 10 December, 2020; originally announced December 2020.

    Comments: 15 pages

    Journal ref: International Conference on Learning Representations (ICLR), 2021

  50. arXiv:2011.07641  [pdf, other

    cs.RO cs.AI

    Stein Variational Model Predictive Control

    Authors: Alexander Lambert, Adam Fishman, Dieter Fox, Byron Boots, Fabio Ramos

    Abstract: Decision making under uncertainty is critical to real-world, autonomous systems. Model Predictive Control (MPC) methods have demonstrated favorable performance in practice, but remain limited when dealing with complex probability distributions. In this paper, we propose a generalization of MPC that represents a multitude of solutions as posterior distributions. By casting MPC as a Bayesian inferen… ▽ More

    Submitted 12 April, 2021; v1 submitted 15 November, 2020; originally announced November 2020.

    Comments: Accepted to Conference on Robot Learning (CoRL) 2020