Skip to main content

Showing 1–50 of 79 results for author: Sreenath, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.13834  [pdf, ps, other

    cs.RO cs.AI

    Toward Real-World Cooperative and Competitive Soccer with Quadrupedal Robot Teams

    Authors: Zhi Su, Yuman Gao, Emily Lukas, Yunfei Li, Jiaze Cai, Faris Tulbah, Fei Gao, Chao Yu, Zhongyu Li, Yi Wu, Koushil Sreenath

    Abstract: Achieving coordinated teamwork among legged robots requires both fine-grained locomotion control and long-horizon strategic decision-making. Robot soccer offers a compelling testbed for this challenge, combining dynamic, competitive, and multi-agent interactions. In this work, we present a hierarchical multi-agent reinforcement learning (MARL) framework that enables fully autonomous and decentrali… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: 11 pages, 12 figures

  2. arXiv:2505.07294  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    HuB: Learning Extreme Humanoid Balance

    Authors: Tong Zhang, Boyuan Zheng, Ruiqian Nai, Yingdong Hu, Yen-Jen Wang, Geng Chen, Fanqi Lin, Jiongye Li, Chuye Hong, Koushil Sreenath, Yang Gao

    Abstract: The human body demonstrates exceptional motor capabilities-such as standing steadily on one foot or performing a high kick with the leg raised over 1.5 meters-both requiring precise balance control. While recent research on humanoid control has leveraged reinforcement learning to track human motions for skill acquisition, applying this paradigm to balance-intensive tasks remains challenging. In th… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

    Comments: Project website: https://hub-robot.github.io

  3. arXiv:2504.21738  [pdf, ps, other

    cs.RO

    LangWBC: Language-directed Humanoid Whole-Body Control via End-to-end Learning

    Authors: Yiyang Shao, Xiaoyu Huang, Bike Zhang, Qiayuan Liao, Yuman Gao, Yufeng Chi, Zhongyu Li, Sophia Shao, Koushil Sreenath

    Abstract: General-purpose humanoid robots are expected to interact intuitively with humans, enabling seamless integration into daily life. Natural language provides the most accessible medium for this purpose. However, translating language into humanoid whole-body motion remains a significant challenge, primarily due to the gap between linguistic understanding and physical actions. In this work, we present… ▽ More

    Submitted 30 April, 2025; originally announced April 2025.

  4. arXiv:2504.17249  [pdf, other

    cs.RO

    Demonstrating Berkeley Humanoid Lite: An Open-source, Accessible, and Customizable 3D-printed Humanoid Robot

    Authors: Yufeng Chi, Qiayuan Liao, Junfeng Long, Xiaoyu Huang, Sophia Shao, Borivoje Nikolic, Zhongyu Li, Koushil Sreenath

    Abstract: Despite significant interest and advancements in humanoid robotics, most existing commercially available hardware remains high-cost, closed-source, and non-transparent within the robotics community. This lack of accessibility and customization hinders the growth of the field and the broader development of humanoid technologies. To address these challenges and promote democratization in humanoid ro… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

    Comments: Accepted in Robotics: Science and Systems (RSS) 2025

  5. arXiv:2503.18221  [pdf, other

    cs.RO

    Decentralized Navigation of a Cable-Towed Load using Quadrupedal Robot Team via MARL

    Authors: Wen-Tse Chen, Minh Nguyen, Zhongyu Li, Guo Ning Sue, Koushil Sreenath

    Abstract: This work addresses the challenge of enabling a team of quadrupedal robots to collaboratively tow a cable-connected load through cluttered and unstructured environments while avoiding obstacles. Leveraging cables allows the multi-robot system to navigate narrow spaces by maintaining slack when necessary. However, this introduces hybrid physical interactions due to alternating taut and slack states… ▽ More

    Submitted 23 March, 2025; originally announced March 2025.

  6. arXiv:2503.11801  [pdf, ps, other

    cs.GR cs.LG cs.RO

    Diffuse-CLoC: Guided Diffusion for Physics-based Character Look-ahead Control

    Authors: Xiaoyu Huang, Takara Truong, Yunbo Zhang, Fangzhou Yu, Jean Pierre Sleiman, Jessica Hodgins, Koushil Sreenath, Farbod Farshidian

    Abstract: We present Diffuse-CLoC, a guided diffusion framework for physics-based look-ahead control that enables intuitive, steerable, and physically realistic motion generation. While existing kinematics motion generation with diffusion models offer intuitive steering capabilities with inference-time conditioning, they often fail to produce physically viable motions. In contrast, recent diffusion-based co… ▽ More

    Submitted 1 July, 2025; v1 submitted 14 March, 2025; originally announced March 2025.

  7. arXiv:2503.07771  [pdf, other

    cs.RO

    RoboCopilot: Human-in-the-loop Interactive Imitation Learning for Robot Manipulation

    Authors: Philipp Wu, Yide Shentu, Qiayuan Liao, Ding Jin, Menglong Guo, Koushil Sreenath, Xingyu Lin, Pieter Abbeel

    Abstract: Learning from human demonstration is an effective approach for learning complex manipulation skills. However, existing approaches heavily focus on learning from passive human demonstration data for its simplicity in data collection. Interactive human teaching has appealing theoretical and practical properties, but they are not well supported by existing human-robot interfaces. This paper proposes… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

  8. arXiv:2503.00606  [pdf, other

    cs.RO

    Dynamic Collision Avoidance Using Velocity Obstacle-Based Control Barrier Functions

    Authors: Jihao Huang, Jun Zeng, Xuemin Chi, Koushil Sreenath, Zhitao Liu, Hongye Su

    Abstract: Designing safety-critical controllers for acceleration-controlled unicycle robots is challenging, as control inputs may not appear in the constraints of control Lyapunov functions(CLFs) and control barrier functions (CBFs), leading to invalid controllers. Existing methods often rely on state-feedback-based CLFs and high-order CBFs (HOCBFs), which are computationally expensive to construct and fail… ▽ More

    Submitted 8 March, 2025; v1 submitted 1 March, 2025; originally announced March 2025.

    Comments: Accepted by IEEE TCST

  9. arXiv:2502.15043  [pdf, other

    cs.RO eess.SY

    DDAT: Diffusion Policies Enforcing Dynamically Admissible Robot Trajectories

    Authors: Jean-Baptiste Bouvier, Kanghyun Ryu, Kartik Nagpal, Qiayuan Liao, Koushil Sreenath, Negar Mehr

    Abstract: Diffusion models excel at creating images and videos thanks to their multimodal generative capabilities. These same capabilities have made diffusion models increasingly popular in robotics research, where they are used for generating robot motion. However, the stochastic nature of diffusion models is fundamentally at odds with the precise dynamical equations describing the feasible motion of robot… ▽ More

    Submitted 26 April, 2025; v1 submitted 20 February, 2025; originally announced February 2025.

    Journal ref: Robotics: Science and Systems (RSS) 2025

  10. arXiv:2502.08844  [pdf, other

    cs.RO

    MuJoCo Playground

    Authors: Kevin Zakka, Baruch Tabanpour, Qiayuan Liao, Mustafa Haiderbhai, Samuel Holt, Jing Yuan Luo, Arthur Allshire, Erik Frey, Koushil Sreenath, Lueder A. Kahrs, Carmelo Sferrazza, Yuval Tassa, Pieter Abbeel

    Abstract: We introduce MuJoCo Playground, a fully open-source framework for robot learning built with MJX, with the express goal of streamlining simulation, training, and sim-to-real transfer onto robots. With a simple "pip install playground", researchers can train policies in minutes on a single GPU. Playground supports diverse robotic platforms, including quadrupeds, humanoids, dexterous hands, and robot… ▽ More

    Submitted 12 February, 2025; originally announced February 2025.

  11. arXiv:2412.14803  [pdf, other

    cs.CV cs.RO

    Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations

    Authors: Yucheng Hu, Yanjiang Guo, Pengchao Wang, Xiaoyu Chen, Yen-Jen Wang, Jianke Zhang, Koushil Sreenath, Chaochao Lu, Jianyu Chen

    Abstract: Visual representations play a crucial role in developing generalist robotic policies. Previous vision encoders, typically pre-trained with single-image reconstruction or two-image contrastive learning, tend to capture static information, often neglecting the dynamic aspects vital for embodied tasks. Recently, video diffusion models (VDMs) demonstrate the ability to predict future frames and showca… ▽ More

    Submitted 4 May, 2025; v1 submitted 19 December, 2024; originally announced December 2024.

    Comments: ICML 2025 Spotlight Paper. The first two authors contribute equally

  12. arXiv:2411.15130  [pdf, other

    cs.RO eess.SY

    Learning-based Trajectory Tracking for Bird-inspired Flapping-Wing Robots

    Authors: Jiaze Cai, Vishnu Sangli, Mintae Kim, Koushil Sreenath

    Abstract: Bird-sized flapping-wing robots offer significant potential for agile flight in complex environments, but achieving agile and robust trajectory tracking remains a challenge due to the complex aerodynamics and highly nonlinear dynamics inherent in flapping-wing flight. In this work, a learning-based control approach is introduced to unlock the versatility and adaptiveness of flapping-wing flight. W… ▽ More

    Submitted 22 November, 2024; originally announced November 2024.

  13. arXiv:2411.04494  [pdf, other

    cs.RO eess.SY

    Online Omnidirectional Jumping Trajectory Planning for Quadrupedal Robots on Uneven Terrains

    Authors: Linzhu Yue, Zhitao Song, Jinhu Dong, Zhongyu Li, Hongbo Zhang, Lingwei Zhang, Xuanqi Zeng, Koushil Sreenath, Yun-hui Liu

    Abstract: Natural terrain complexity often necessitates agile movements like jumping in animals to improve traversal efficiency. To enable similar capabilities in quadruped robots, complex real-time jumping maneuvers are required. Current research does not adequately address the problem of online omnidirectional jumping and neglects the robot's kinodynamic constraints during trajectory generation. This pape… ▽ More

    Submitted 9 November, 2024; v1 submitted 7 November, 2024; originally announced November 2024.

    Comments: Submitted to IJRR

  14. arXiv:2410.13418  [pdf, other

    cs.RO

    Interactive Navigation with Adaptive Non-prehensile Mobile Manipulation

    Authors: Cunxi Dai, Xiaohan Liu, Koushil Sreenath, Zhongyu Li, Ralph Hollis

    Abstract: This paper introduces a framework for interactive navigation through adaptive non-prehensile mobile manipulation. A key challenge in this process is handling objects with unknown dynamics, which are difficult to infer from visual observation. To address this, we propose an adaptive dynamics model for common movable indoor objects via learned SE(2) dynamics representations. This model is integrated… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 7 pages, 8 figures

  15. arXiv:2410.11825  [pdf, other

    cs.RO cs.AI

    Learning Smooth Humanoid Locomotion through Lipschitz-Constrained Policies

    Authors: Zixuan Chen, Xialin He, Yen-Jen Wang, Qiayuan Liao, Yanjie Ze, Zhongyu Li, S. Shankar Sastry, Jiajun Wu, Koushil Sreenath, Saurabh Gupta, Xue Bin Peng

    Abstract: Reinforcement learning combined with sim-to-real transfer offers a general framework for developing locomotion controllers for legged robots. To facilitate successful deployment in the real world, smoothing techniques, such as low-pass filters and smoothness rewards, are often employed to develop policies with smooth behaviors. However, because these techniques are non-differentiable and usually r… ▽ More

    Submitted 28 October, 2024; v1 submitted 15 October, 2024; originally announced October 2024.

    Comments: 8 pages

  16. arXiv:2410.10621  [pdf, other

    cs.RO

    Traversability-Aware Legged Navigation by Learning from Real-World Visual Data

    Authors: Hongbo Zhang, Zhongyu Li, Xuanqi Zeng, Laura Smith, Kyle Stachowicz, Dhruv Shah, Linzhu Yue, Zhitao Song, Weipeng Xia, Sergey Levine, Koushil Sreenath, Yun-hui Liu

    Abstract: The enhanced mobility brought by legged locomotion empowers quadrupedal robots to navigate through complex and unstructured environments. However, optimizing agile locomotion while accounting for the varying energy costs of traversing different terrains remains an open challenge. Most previous work focuses on planning trajectories with traversability cost estimation based on human-labeled environm… ▽ More

    Submitted 11 November, 2024; v1 submitted 14 October, 2024; originally announced October 2024.

  17. arXiv:2409.18382  [pdf, other

    cs.RO cs.LG eess.SY

    CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models

    Authors: Kanghyun Ryu, Qiayuan Liao, Zhongyu Li, Payam Delgosha, Koushil Sreenath, Negar Mehr

    Abstract: Curriculum learning is a training mechanism in reinforcement learning (RL) that facilitates the achievement of complex policies by progressively increasing the task difficulty during training. However, designing effective curricula for a specific task often requires extensive domain knowledge and human intervention, which limits its applicability across various domains. Our core idea is that large… ▽ More

    Submitted 14 April, 2025; v1 submitted 26 September, 2024; originally announced September 2024.

    Comments: Accepted to ICRA 2025

  18. arXiv:2409.16301  [pdf, other

    cs.RO cs.LG eess.SY

    Gait Switching and Enhanced Stabilization of Walking Robots with Deep Learning-based Reachability: A Case Study on Two-link Walker

    Authors: Xingpeng Xia, Jason J. Choi, Ayush Agrawal, Koushil Sreenath, Claire J. Tomlin, Somil Bansal

    Abstract: Learning-based approaches have recently shown notable success in legged locomotion. However, these approaches often lack accountability, necessitating empirical tests to determine their effectiveness. In this work, we are interested in designing a learning-based locomotion controller whose stability can be examined and guaranteed. This can be achieved by verifying regions of attraction (RoAs) of l… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

    Comments: The first two authors contributed equally. This work is supported in part by the NSF Grant CMMI-1944722, the NSF CAREER Program under award 2240163, the NASA ULI on Safe Aviation Autonomy, and the DARPA Assured Autonomy and Assured Neuro Symbolic Learning and Reasoning (ANSR) programs. The work of Jason J. Choi received the support of a fellowship from Kwanjeong Educational Foundation, Korea

  19. arXiv:2407.21781  [pdf, other

    cs.RO

    Berkeley Humanoid: A Research Platform for Learning-based Control

    Authors: Qiayuan Liao, Bike Zhang, Xuanyu Huang, Xiaoyu Huang, Zhongyu Li, Koushil Sreenath

    Abstract: We introduce Berkeley Humanoid, a reliable and low-cost mid-scale humanoid research platform for learning-based control. Our lightweight, in-house-built robot is designed specifically for learning algorithms with low simulation complexity, anthropomorphic motion, and high reliability against falls. The robot's narrow sim-to-real gap enables agile and robust locomotion across various terrains in ou… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

    Comments: 12 pages, 9 figures

  20. arXiv:2407.06584  [pdf, other

    cs.RO

    HiLMa-Res: A General Hierarchical Framework via Residual RL for Combining Quadrupedal Locomotion and Manipulation

    Authors: Xiaoyu Huang, Qiayuan Liao, Yiming Ni, Zhongyu Li, Laura Smith, Sergey Levine, Xue Bin Peng, Koushil Sreenath

    Abstract: This work presents HiLMa-Res, a hierarchical framework leveraging reinforcement learning to tackle manipulation tasks while performing continuous locomotion using quadrupedal robots. Unlike most previous efforts that focus on solving a specific task, HiLMa-Res is designed to be general for various loco-manipulation tasks that require quadrupedal robots to maintain sustained mobility. The novel des… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: IROS 2024

  21. arXiv:2404.19264  [pdf, other

    cs.RO

    DiffuseLoco: Real-Time Legged Locomotion Control with Diffusion from Offline Datasets

    Authors: Xiaoyu Huang, Yufeng Chi, Ruofeng Wang, Zhongyu Li, Xue Bin Peng, Sophia Shao, Borivoje Nikolic, Koushil Sreenath

    Abstract: This work introduces DiffuseLoco, a framework for training multi-skill diffusion-based policies for dynamic legged locomotion from offline datasets, enabling real-time control of diverse skills on robots in the real world. Offline learning at scale has led to breakthroughs in computer vision, natural language processing, and robotic manipulation domains. However, scaling up learning for legged rob… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  22. arXiv:2404.05291  [pdf, other

    cs.RO

    Long-horizon Locomotion and Manipulation on a Quadrupedal Robot with Large Language Models

    Authors: Yutao Ouyang, Jinhan Li, Yunfei Li, Zhongyu Li, Chao Yu, Koushil Sreenath, Yi Wu

    Abstract: We present a large language model (LLM) based system to empower quadrupedal robots with problem-solving abilities for long-horizon tasks beyond short-term motions. Long-horizon tasks for quadrupeds are challenging since they require both a high-level understanding of the semantics of the problem for task planning and a broad range of locomotion and manipulation skills to interact with the environm… ▽ More

    Submitted 19 March, 2025; v1 submitted 8 April, 2024; originally announced April 2024.

  23. arXiv:2403.20328  [pdf, other

    cs.RO cs.LG

    Learning Visual Quadrupedal Loco-Manipulation from Demonstrations

    Authors: Zhengmao He, Kun Lei, Yanjie Ze, Koushil Sreenath, Zhongyu Li, Huazhe Xu

    Abstract: Quadruped robots are progressively being integrated into human environments. Despite the growing locomotion capabilities of quadrupedal robots, their interaction with objects in realistic scenes is still limited. While additional robotic arms on quadrupedal robots enable manipulating objects, they are sometimes redundant given that a quadruped robot is essentially a mobile unit equipped with four… ▽ More

    Submitted 2 August, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

    Comments: Published at IROS 2024. Project website: https://zhengmaohe.github.io/leg-manip

  24. arXiv:2403.20001  [pdf, other

    cs.RO

    Adaptive Energy Regularization for Autonomous Gait Transition and Energy-Efficient Quadruped Locomotion

    Authors: Boyuan Liang, Lingfeng Sun, Xinghao Zhu, Bike Zhang, Ziyin Xiong, Yixiao Wang, Chenran Li, Koushil Sreenath, Masayoshi Tomizuka

    Abstract: In reinforcement learning for legged robot locomotion, crafting effective reward strategies is crucial. Pre-defined gait patterns and complex reward systems are widely used to stabilize policy training. Drawing from the natural locomotion behaviors of humans and animals, which adapt their gaits to minimize energy consumption, we propose a simplified, energy-centric reward strategy to foster the de… ▽ More

    Submitted 5 March, 2025; v1 submitted 29 March, 2024; originally announced March 2024.

    Comments: 7 pages, 7 figures

    Journal ref: ICRA 2025

  25. Leveraging Symmetry in RL-based Legged Locomotion Control

    Authors: Zhi Su, Xiaoyu Huang, Daniel OrdoƱez-Apraez, Yunfei Li, Zhongyu Li, Qiayuan Liao, Giulio Turrisi, Massimiliano Pontil, Claudio Semini, Yi Wu, Koushil Sreenath

    Abstract: Model-free reinforcement learning is a promising approach for autonomously solving challenging robotics control problems, but faces exploration difficulty without information of the robot's kinematics and dynamics morphology. The under-exploration of multiple modalities with symmetric states leads to behaviors that are often unnatural and sub-optimal. This issue becomes particularly pronounced in… ▽ More

    Submitted 11 March, 2025; v1 submitted 25 March, 2024; originally announced March 2024.

    Journal ref: 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 6899-6906

  26. arXiv:2402.19469  [pdf, other

    cs.RO cs.CV cs.LG

    Humanoid Locomotion as Next Token Prediction

    Authors: Ilija Radosavovic, Bike Zhang, Baifeng Shi, Jathushan Rajasegaran, Sarthak Kamat, Trevor Darrell, Koushil Sreenath, Jitendra Malik

    Abstract: We cast real-world humanoid control as a next token prediction problem, akin to predicting the next word in language. Our model is a causal transformer trained via autoregressive prediction of sensorimotor trajectories. To account for the multi-modal nature of the data, we perform prediction in a modality-aligned way, and for each input token predict the next token from the same modality. This gen… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  27. arXiv:2402.05279  [pdf, other

    cs.LG

    Safety Filters for Black-Box Dynamical Systems by Learning Discriminating Hyperplanes

    Authors: Will Lavanakul, Jason J. Choi, Koushil Sreenath, Claire J. Tomlin

    Abstract: Learning-based approaches are emerging as an effective approach for safety filters for black-box dynamical systems. Existing methods have relied on certificate functions like Control Barrier Functions (CBFs) and Hamilton-Jacobi (HJ) reachability value functions. The primary motivation for our work is the recognition that ultimately, enforcing the safety constraint as a control input constraint at… ▽ More

    Submitted 21 May, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: * Indicate co-first authors. This is an extended version of the paper presented at L4DC 2024

  28. arXiv:2401.16889  [pdf, other

    cs.RO cs.AI eess.SY

    Reinforcement Learning for Versatile, Dynamic, and Robust Bipedal Locomotion Control

    Authors: Zhongyu Li, Xue Bin Peng, Pieter Abbeel, Sergey Levine, Glen Berseth, Koushil Sreenath

    Abstract: This paper presents a comprehensive study on using deep reinforcement learning (RL) to create dynamic locomotion controllers for bipedal robots. Going beyond focusing on a single locomotion skill, we develop a general control solution that can be used for a range of dynamic bipedal skills, from periodic walking and running to aperiodic jumping and standing. Our RL-based controller incorporates a n… ▽ More

    Submitted 26 August, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepted in International Journal of Robotics Research (IJRR) 2024. This is the author's version and will no longer be updated as the copyright may get transferred at anytime

  29. arXiv:2311.13824  [pdf, other

    cs.RO eess.SY

    Constraint-Guided Online Data Selection for Scalable Data-Driven Safety Filters in Uncertain Robotic Systems

    Authors: Jason J. Choi, Fernando CastaƱeda, Wonsuhk Jung, Bike Zhang, Claire J. Tomlin, Koushil Sreenath

    Abstract: As the use of autonomous robots expands in tasks that are complex and challenging to model, the demand for robust data-driven control methods that can certify safety and stability in uncertain conditions is increasing. However, the practical implementation of these methods often faces scalability issues due to the growing amount of data points with system complexity, and a significant reliance on… ▽ More

    Submitted 27 September, 2024; v1 submitted 23 November, 2023; originally announced November 2023.

    Comments: The first three authors contributed equally to the work. This work has been submitted to the IEEE for possible publication

  30. arXiv:2309.09969  [pdf, other

    cs.RO cs.LG eess.SY

    Prompt a Robot to Walk with Large Language Models

    Authors: Yen-Jen Wang, Bike Zhang, Jianyu Chen, Koushil Sreenath

    Abstract: Large language models (LLMs) pre-trained on vast internet-scale data have showcased remarkable capabilities across diverse domains. Recently, there has been escalating interest in deploying LLMs for robotics, aiming to harness the power of foundation models in real-world settings. However, this approach faces significant challenges, particularly in grounding these models in the physical world and… ▽ More

    Submitted 15 October, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: Conference on Decision and Control (CDC), 2024

  31. arXiv:2306.13259  [pdf, other

    cs.RO eess.SY math.OC

    Control Barrier Functions for Collision Avoidance Between Strongly Convex Regions

    Authors: Akshay Thirugnanam, Jun Zeng, Koushil Sreenath

    Abstract: In this paper, we focus on non-conservative collision avoidance between robots and obstacles with control affine dynamics and convex shapes. System safety is defined using the minimum distance between the safe regions associated with robots and obstacles. However, collision avoidance using the minimum distance as a control barrier function (CBF) can pose challenges because the minimum distance is… ▽ More

    Submitted 4 February, 2025; v1 submitted 22 June, 2023; originally announced June 2023.

    Comments: 30 pages, 11 figures; submitted to SICON 2024. Refined definitions and proofs, and added extensions of results and an open-source repository

    MSC Class: 93C10 (Primary); 93D30 (Secondary)

  32. arXiv:2304.07954  [pdf, other

    cs.RO eess.SY math.OC

    Velocity Obstacle for Polytopic Collision Avoidance for Distributed Multi-robot Systems

    Authors: Jihao Huang, Jun Zeng, Xuemin Chi, Koushil Sreenath, Zhitao Liu, Hongye Su

    Abstract: Obstacle avoidance for multi-robot navigation with polytopic shapes is challenging. Existing works simplify the system dynamics or consider it as a convex or non-convex optimization problem with positive distance constraints between robots, which limits real-time performance and scalability. Additionally, generating collision-free behavior for polytopic-shaped robots is harder due to implicit and… ▽ More

    Submitted 10 June, 2024; v1 submitted 16 April, 2023; originally announced April 2023.

    Comments: Accepted to IEEE Robotics and Automation Letters (RA-L) 2023, with open source repository released

  33. arXiv:2303.03381  [pdf, other

    cs.RO cs.LG

    Real-World Humanoid Locomotion with Reinforcement Learning

    Authors: Ilija Radosavovic, Tete Xiao, Bike Zhang, Trevor Darrell, Jitendra Malik, Koushil Sreenath

    Abstract: Humanoid robots that can autonomously operate in diverse environments have the potential to help address labour shortages in factories, assist elderly at homes, and colonize new planets. While classical controllers for humanoid robots have shown impressive results in a number of settings, they are challenging to generalize and adapt to new environments. Here, we present a fully learning-based appr… ▽ More

    Submitted 14 December, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: Project page: https://learning-humanoid-locomotion.github.io

  34. arXiv:2302.14246  [pdf, other

    eess.SY cs.RO math.OC

    i2LQR: Iterative LQR for Iterative Tasks in Dynamic Environments

    Authors: Yifan Zeng, Suiyi He, Han Hoang Nguyen, Yihan Li, Zhongyu Li, Koushil Sreenath, Jun Zeng

    Abstract: This work introduces a novel control strategy called Iterative Linear Quadratic Regulator for Iterative Tasks (i2LQR), which aims to improve closed-loop performance with local trajectory optimization for iterative tasks in a dynamic environment. The proposed algorithm is reference-free and utilizes historical data from previous iterations to enhance the performance of the autonomous system. Unlike… ▽ More

    Submitted 6 September, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: Accepted by 2023 62nd IEEE Conference on Decision and Control (CDC)

  35. arXiv:2302.09450  [pdf, other

    cs.RO cs.AI eess.SY

    Robust and Versatile Bipedal Jumping Control through Reinforcement Learning

    Authors: Zhongyu Li, Xue Bin Peng, Pieter Abbeel, Sergey Levine, Glen Berseth, Koushil Sreenath

    Abstract: This work aims to push the limits of agility for bipedal robots by enabling a torque-controlled bipedal robot to perform robust and versatile dynamic jumps in the real world. We present a reinforcement learning framework for training a robot to accomplish a large variety of jumping tasks, such as jumping to different locations and directions. To improve performance on these challenging tasks, we d… ▽ More

    Submitted 31 May, 2023; v1 submitted 18 February, 2023; originally announced February 2023.

    Comments: Accepted in Robotics: Science and Systems 2023 (RSS 2023). The accompanying video is at https://youtu.be/aAPSZ2QFB-E

  36. arXiv:2301.12012  [pdf, other

    cs.RO cs.LG eess.SY

    In-Distribution Barrier Functions: Self-Supervised Policy Filters that Avoid Out-of-Distribution States

    Authors: Fernando CastaƱeda, Haruki Nishimura, Rowan McAllister, Koushil Sreenath, Adrien Gaidon

    Abstract: Learning-based control approaches have shown great promise in performing complex tasks directly from high-dimensional perception data for real robotic systems. Nonetheless, the learned controllers can behave unexpectedly if the trajectories of the system divert from the training data distribution, which can compromise safety. In this work, we propose a control filter that wraps any reference polic… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

  37. arXiv:2212.14199  [pdf, other

    cs.RO

    Walking in Narrow Spaces: Safety-critical Locomotion Control for Quadrupedal Robots with Duality-based Optimization

    Authors: Qiayuan Liao, Zhongyu Li, Akshay Thirugnanam, Jun Zeng, Koushil Sreenath

    Abstract: This paper presents a safety-critical locomotion control framework for quadrupedal robots. Our goal is to enable quadrupedal robots to safely navigate in cluttered environments. To tackle this, we introduce exponential Discrete Control Barrier Functions (exponential DCBFs) with duality-based obstacle avoidance constraints into a Nonlinear Model Predictive Control (NMPC) with Whole-Body Control (WB… ▽ More

    Submitted 9 August, 2023; v1 submitted 29 December, 2022; originally announced December 2022.

    Comments: Accepted to International Conference on Intelligent Robots and Systems (IROS) 2023

  38. arXiv:2210.04435  [pdf, other

    cs.RO cs.AI eess.SY

    Creating a Dynamic Quadrupedal Robotic Goalkeeper with Reinforcement Learning

    Authors: Xiaoyu Huang, Zhongyu Li, Yanzhen Xiang, Yiming Ni, Yufeng Chi, Yunhao Li, Lizhi Yang, Xue Bin Peng, Koushil Sreenath

    Abstract: We present a reinforcement learning (RL) framework that enables quadrupedal robots to perform soccer goalkeeping tasks in the real world. Soccer goalkeeping using quadrupeds is a challenging problem, that combines highly dynamic locomotion with precise and fast non-prehensile object (ball) manipulation. The robot needs to react to and intercept a potentially flying ball using dynamic locomotion ma… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

    Comments: First two authors contributed equally. Accompanying video is at https://youtu.be/iX6OgG67-ZQ

  39. arXiv:2210.04361  [pdf, other

    math.OC cs.RO eess.SY math.DS

    Iterative Convex Optimization for Model Predictive Control with Discrete-Time High-Order Control Barrier Functions

    Authors: Shuo Liu, Jun Zeng, Koushil Sreenath, Calin A. Belta

    Abstract: Safety is one of the fundamental challenges in control theory. Recently, multi-step optimal control problems for discrete-time dynamical systems were formulated to enforce stability, while subject to input constraints as well as safety-critical requirements using discrete-time control barrier functions within a model predictive control (MPC) framework. Existing work usually focus on the feasibilit… ▽ More

    Submitted 13 July, 2023; v1 submitted 9 October, 2022; originally announced October 2022.

    Comments: The open source code is added and the paper is accepted to American Control Conference (ACC) 2023 (8 pages)

  40. arXiv:2209.05309  [pdf, other

    cs.RO cs.LG

    GenLoco: Generalized Locomotion Controllers for Quadrupedal Robots

    Authors: Gilbert Feng, Hongbo Zhang, Zhongyu Li, Xue Bin Peng, Bhuvan Basireddy, Linzhu Yue, Zhitao Song, Lizhi Yang, Yunhui Liu, Koushil Sreenath, Sergey Levine

    Abstract: Recent years have seen a surge in commercially-available and affordable quadrupedal robots, with many of these platforms being actively used in research and industry. As the availability of legged robots grows, so does the need for controllers that enable these robots to perform useful skills. However, most learning-based frameworks for controller development focus on training robot-specific contr… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

    Comments: First two authors contributed equally

  41. arXiv:2208.10733  [pdf, other

    eess.SY cs.LG math.OC

    Recursively Feasible Probabilistic Safe Online Learning with Control Barrier Functions

    Authors: Fernando CastaƱeda, Jason J. Choi, Wonsuhk Jung, Bike Zhang, Claire J. Tomlin, Koushil Sreenath

    Abstract: Learning-based control has recently shown great efficacy in performing complex tasks for various applications. However, to deploy it in real systems, it is of vital importance to guarantee the system will stay safe. Control Barrier Functions (CBFs) offer mathematical tools for designing safety-preserving controllers for systems with known dynamics. In this article, we first introduce a model-uncer… ▽ More

    Submitted 3 September, 2024; v1 submitted 23 August, 2022; originally announced August 2022.

    Comments: Journal article. Includes the results of the 2021 CDC paper titled "Pointwise feasibility of gaussian process-based safety-critical control under model uncertainty" and proposes a recursively feasible safe online learning algorithm as new contribution

  42. arXiv:2208.06721  [pdf, other

    cs.RO eess.SY

    Lyapunov Design for Robust and Efficient Robotic Reinforcement Learning

    Authors: Tyler Westenbroek, Fernando Castaneda, Ayush Agrawal, Shankar Sastry, Koushil Sreenath

    Abstract: Recent advances in the reinforcement learning (RL) literature have enabled roboticists to automatically train complex policies in simulated environments. However, due to the poor sample complexity of these methods, solving RL problems using real-world data remains a challenging problem. This paper introduces a novel cost-shaping method which aims to reduce the number of samples needed to learn a s… ▽ More

    Submitted 17 November, 2022; v1 submitted 13 August, 2022; originally announced August 2022.

  43. arXiv:2208.01160  [pdf, other

    cs.RO cs.AI eess.SY

    Hierarchical Reinforcement Learning for Precise Soccer Shooting Skills using a Quadrupedal Robot

    Authors: Yandong Ji, Zhongyu Li, Yinan Sun, Xue Bin Peng, Sergey Levine, Glen Berseth, Koushil Sreenath

    Abstract: We address the problem of enabling quadrupedal robots to perform precise shooting skills in the real world using reinforcement learning. Developing algorithms to enable a legged robot to shoot a soccer ball to a given target is a challenging problem that combines robot motion control and planning into one task. To solve this problem, we need to consider the dynamics limitation and motion stability… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

    Comments: Accepted to 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2022)

  44. arXiv:2206.14424  [pdf, other

    cs.RO cs.AI eess.SY

    Collaborative Navigation and Manipulation of a Cable-towed Load by Multiple Quadrupedal Robots

    Authors: Chenyu Yang, Guo Ning Sue, Zhongyu Li, Lizhi Yang, Haotian Shen, Yufeng Chi, Akshara Rai, Jun Zeng, Koushil Sreenath

    Abstract: This paper tackles the problem of robots collaboratively towing a load with cables to a specified goal location while avoiding collisions in real time. The introduction of cables (as opposed to rigid links) enables the robotic team to travel through narrow spaces by changing its intrinsic dimensions through slack/taut switches of the cable. However, this is a challenging problem because of the hyb… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

    Comments: Extended version of the manuscript accepted to IEEE Robotics and Automation Letters (RA-L) 2022

  45. arXiv:2205.15299  [pdf, other

    cs.RO cs.AI cs.CV cs.LG eess.SY

    Adapting Rapid Motor Adaptation for Bipedal Robots

    Authors: Ashish Kumar, Zhongyu Li, Jun Zeng, Deepak Pathak, Koushil Sreenath, Jitendra Malik

    Abstract: Recent advances in legged locomotion have enabled quadrupeds to walk on challenging terrains. However, bipedal robots are inherently more unstable and hence it's harder to design walking controllers for them. In this work, we leverage recent advances in rapid adaptation for locomotion control, and extend them to work on bipedal robots. Similar to existing works, we start with a base policy which p… ▽ More

    Submitted 6 September, 2022; v1 submitted 30 May, 2022; originally announced May 2022.

    Comments: First two authors contributed equally. Website at https://ashish-kmr.github.io/a-rma/

  46. arXiv:2205.05787  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Bridging Model-based Safety and Model-free Reinforcement Learning through System Identification of Low Dimensional Linear Models

    Authors: Zhongyu Li, Jun Zeng, Akshay Thirugnanam, Koushil Sreenath

    Abstract: Bridging model-based safety and model-free reinforcement learning (RL) for dynamic robots is appealing since model-based methods are able to provide formal safety guarantees, while RL-based methods are able to exploit the robot agility by learning from the full-order system dynamics. However, current approaches to tackle this problem are mostly restricted to simple systems. In this paper, we propo… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

    Comments: Accepted in Proceedings of Robotics: Science and Systems 2022 (RSS 2022)

  47. arXiv:2204.03134  [pdf, other

    cs.RO

    Perception-aware receding horizon trajectory planning for multicopters with visual-inertial odometry

    Authors: Xiangyu Wu, Shuxiao Chen, Koushil Sreenath, Mark W. Mueller

    Abstract: Visual inertial odometry (VIO) is widely used for the state estimation of multicopters, but it may function poorly in environments with few visual features or in overly aggressive flights. In this work, we propose a perception-aware collision avoidance trajectory planner for multicopters, that may be used with any feature-based VIO algorithm. Our approach is able to fly the vehicle to a goal posit… ▽ More

    Submitted 1 August, 2022; v1 submitted 6 April, 2022; originally announced April 2022.

    Comments: 12 pages

  48. arXiv:2203.08180  [pdf, other

    eess.SY cs.RO

    Tethered Power for a Series of Quadcopters: Analysis and Applications

    Authors: Karan P. Jain, Prasanth Kotaru, Massimiliano de Sa, Koushil Sreenath, Mark W. Mueller

    Abstract: Tethered quadcopters are used for extended flight operations where the power to the system is provided via a tether connected to an external power source. In this work, we consider a system of multiple quadcopters powered by a single tether. We study the design factors that influence the power requirements, such as the electrical resistance of the tether, input voltage, and quadcopters' positions.… ▽ More

    Submitted 26 September, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

    Comments: Submitted to ICRA 2023

  49. arXiv:2203.05194  [pdf, other

    cs.RO cs.AI eess.SY

    Learning Torque Control for Quadrupedal Locomotion

    Authors: Shuxiao Chen, Bike Zhang, Mark W. Mueller, Akshara Rai, Koushil Sreenath

    Abstract: Reinforcement learning (RL) has become a promising approach to developing controllers for quadrupedal robots. Conventionally, an RL design for locomotion follows a position-based paradigm, wherein an RL policy outputs target joint positions at a low frequency that are then tracked by a high-frequency proportional-derivative (PD) controller to produce joint torques. In contrast, for the model-based… ▽ More

    Submitted 12 March, 2023; v1 submitted 10 March, 2022; originally announced March 2022.

  50. arXiv:2203.02570  [pdf, other

    cs.RO cs.AI eess.SY

    Bayesian Optimization Meets Hybrid Zero Dynamics: Safe Parameter Learning for Bipedal Locomotion Control

    Authors: Lizhi Yang, Zhongyu Li, Jun Zeng, Koushil Sreenath

    Abstract: In this paper, we propose a multi-domain control parameter learning framework that combines Bayesian Optimization (BO) and Hybrid Zero Dynamics (HZD) for locomotion control of bipedal robots. We leverage BO to learn the control parameters used in the HZD-based controller. The learning process is firstly deployed in simulation to optimize different control parameters for a large repertoire of gaits… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

    Comments: Accepted to 2022 International Conference on Robotics and Automation (ICRA 2022)