Skip to main content

Showing 1–30 of 30 results for author: Van de Panne, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.19564  [pdf, other

    cs.RO cs.LG

    Diffusion-based Planning with Learned Viability Filters

    Authors: Nicholas Ioannidis, Daniele Reda, Setareh Cohan, Michiel van de Panne

    Abstract: Diffusion models can be used as a motion planner by sampling from a distribution of possible futures. However, the samples may not satisfy hard constraints that exist only implicitly in the training data, e.g., avoiding falls or not colliding with a wall. We propose learned viability filters that efficiently predict the future success of any given plan, i.e., diffusion sample, and thereby enforce… ▽ More

    Submitted 26 February, 2025; originally announced February 2025.

  2. arXiv:2410.03441  [pdf, other

    cs.CV

    CLoSD: Closing the Loop between Simulation and Diffusion for multi-task character control

    Authors: Guy Tevet, Sigal Raab, Setareh Cohan, Daniele Reda, Zhengyi Luo, Xue Bin Peng, Amit H. Bermano, Michiel van de Panne

    Abstract: Motion diffusion models and Reinforcement Learning (RL) based control for physics-based simulations have complementary strengths for human motion generation. The former is capable of generating a wide variety of motions, adhering to intuitive control such as text, while the latter offers physically plausible motion and direct interaction with the environment. In this work, we present a method that… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

  3. arXiv:2406.01152  [pdf, other

    cs.RO

    Learning-based legged locomotion; state of the art and future perspectives

    Authors: Sehoon Ha, Joonho Lee, Michiel van de Panne, Zhaoming Xie, Wenhao Yu, Majid Khadiv

    Abstract: Legged locomotion holds the premise of universal mobility, a critical capability for many real-world robotic applications. Both model-based and learning-based approaches have advanced the field of legged locomotion in the past three decades. In recent years, however, a number of factors have dramatically accelerated progress in learning-based methods, including the rise of deep learning, rapid pro… ▽ More

    Submitted 22 November, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  4. arXiv:2405.11126  [pdf, other

    cs.CV cs.GR cs.LG

    Flexible Motion In-betweening with Diffusion Models

    Authors: Setareh Cohan, Guy Tevet, Daniele Reda, Xue Bin Peng, Michiel van de Panne

    Abstract: Motion in-betweening, a fundamental task in character animation, consists of generating motion sequences that plausibly interpolate user-provided keyframe constraints. It has long been recognized as a labor-intensive and challenging process. We investigate the potential of diffusion models in generating diverse human motions guided by keyframes. Unlike previous inbetweening methods, we propose a s… ▽ More

    Submitted 23 May, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

    Comments: SIGGRAPH 2024. For project page and code, see https://setarehc.github.io/CondMDI/

  5. arXiv:2307.01938  [pdf, other

    cs.CV

    Physics-based Motion Retargeting from Sparse Inputs

    Authors: Daniele Reda, Jungdam Won, Yuting Ye, Michiel van de Panne, Alexander Winkler

    Abstract: Avatars are important to create interactive and immersive experiences in virtual worlds. One challenge in animating these characters to mimic a user's motion is that commercial AR/VR products consist only of a headset and controllers, providing very limited sensor data of the user's pose. Another challenge is that an avatar might have a different skeleton structure than a human and the mapping bet… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Comments: More info at: https://www.cs.ubc.ca/~dreda/retargeting.html

  6. arXiv:2306.09532  [pdf, other

    cs.RO cs.GR

    Hierarchical Planning and Control for Box Loco-Manipulation

    Authors: Zhaoming Xie, Jonathan Tseng, Sebastian Starke, Michiel van de Panne, C. Karen Liu

    Abstract: Humans perform everyday tasks using a combination of locomotion and manipulation skills. Building a system that can handle both skills is essential to creating virtual humans. We present a physically-simulated human capable of solving box rearrangement tasks, which requires a combination of both skills. We propose a hierarchical control architecture, where each level solves the task at a different… ▽ More

    Submitted 8 July, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

  7. arXiv:2210.13611  [pdf, other

    cs.LG cs.AI

    Understanding the Evolution of Linear Regions in Deep Reinforcement Learning

    Authors: Setareh Cohan, Nam Hee Kim, David Rolnick, Michiel van de Panne

    Abstract: Policies produced by deep reinforcement learning are typically characterised by their learning curves, but they remain poorly understood in many other respects. ReLU-based policies result in a partitioning of the input space into piecewise linear regions. We seek to understand how observed region counts and their densities evolve during deep reinforcement learning using empirical results that sp… ▽ More

    Submitted 7 November, 2022; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022 camera ready

  8. arXiv:2210.01247  [pdf, other

    cs.RO

    OPT-Mimic: Imitation of Optimized Trajectories for Dynamic Quadruped Behaviors

    Authors: Yuni Fuchioka, Zhaoming Xie, Michiel van de Panne

    Abstract: Reinforcement Learning (RL) has seen many recent successes for quadruped robot control. The imitation of reference motions provides a simple and powerful prior for guiding solutions towards desired solutions without the need for meticulous reward design. While much work uses motion capture data or hand-crafted trajectories as the reference motion, relatively little work has explored the use of ref… ▽ More

    Submitted 23 March, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

  9. Learning to Brachiate via Simplified Model Imitation

    Authors: Daniele Reda, Hung Yu Ling, Michiel van de Panne

    Abstract: Brachiation is the primary form of locomotion for gibbons and siamangs, in which these primates swing from tree limb to tree limb using only their arms. It is challenging to control because of the limited control authority, the required advance planning, and the precision of the required grasps. We present a novel approach to this problem using reinforcement learning, and as demonstrated on a fing… ▽ More

    Submitted 8 May, 2022; originally announced May 2022.

    Comments: 11 pages, 6 figures. Accepted at SIGGRAPH 2022. For videos, supplementary material and code, visit the following URL https://brachiation-rl.github.io/brachiation

  10. arXiv:2205.00307  [pdf, other

    cs.GR cs.LG cs.RO

    Learning to Get Up

    Authors: Tianxin Tao, Matthew Wilson, Ruiyu Gou, Michiel van de Panne

    Abstract: Getting up from an arbitrary fallen state is a basic human skill. Existing methods for learning this skill often generate highly dynamic and erratic get-up motions, which do not resemble human get-up strategies, or are based on tracking recorded human get-up motions. In this paper, we present a staged approach using reinforcement learning, without recourse to motion capture data. The method first… ▽ More

    Submitted 27 August, 2022; v1 submitted 30 April, 2022; originally announced May 2022.

    Comments: SIGGRAPH 2022. Project page: https://tianxintao.github.io/get_up_control/

  11. arXiv:2204.04905  [pdf, other

    cs.LG cs.CV cs.RO

    Evaluating Vision Transformer Methods for Deep Reinforcement Learning from Pixels

    Authors: Tianxin Tao, Daniele Reda, Michiel van de Panne

    Abstract: Vision Transformers (ViT) have recently demonstrated the significant potential of transformer architectures for computer vision. To what extent can image-based deep reinforcement learning also benefit from ViT architectures, as compared to standard convolutional neural network (CNN) architectures? To answer this question, we evaluate ViT training methods for image-based reinforcement learning (RL)… ▽ More

    Submitted 15 May, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

  12. A Survey on Reinforcement Learning Methods in Character Animation

    Authors: Ariel Kwiatkowski, Eduardo Alvarado, Vicky Kalogeiton, C. Karen Liu, Julien Pettré, Michiel van de Panne, Marie-Paule Cani

    Abstract: Reinforcement Learning is an area of Machine Learning focused on how agents can be trained to make sequential decisions, and achieve a particular goal within an arbitrary environment. While learning, they repeatedly take actions based on their observation of the environment, and receive appropriate rewards which define the objective. This experience is then used to progressively improve the policy… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Comments: 27 pages, 6 figures, Eurographics STAR, Computer Graphics Forum

  13. arXiv:2203.02574  [pdf, other

    cs.CV cs.LG

    Style-ERD: Responsive and Coherent Online Motion Style Transfer

    Authors: Tianxin Tao, Xiaohang Zhan, Zhongquan Chen, Michiel van de Panne

    Abstract: Motion style transfer is a common method for enriching character animation. Motion style transfer algorithms are often designed for offline settings where motions are processed in segments. However, for online animation applications, such as realtime avatar animation from motion capture, motions need to be processed as a stream with minimal latency. In this work, we realize a flexible, high-qualit… ▽ More

    Submitted 28 March, 2022; v1 submitted 4 March, 2022; originally announced March 2022.

    Comments: CVPR 2022, project page: https://tianxintao.github.io/Online-Motion-Style-Transfer

  14. arXiv:2202.02693  [pdf, other

    cs.LG cs.AI

    Exploration with Multi-Sample Target Values for Distributional Reinforcement Learning

    Authors: Michael Teng, Michiel van de Panne, Frank Wood

    Abstract: Distributional reinforcement learning (RL) aims to learn a value-network that predicts the full distribution of the returns for a given state, often modeled via a quantile-based critic. This approach has been successfully integrated into common RL methods for continuous control, giving rise to algorithms such as Distributional Soft Actor-Critic (DSAC). In this paper, we introduce multi-sample targ… ▽ More

    Submitted 5 February, 2022; originally announced February 2022.

    Comments: Submitted to ICML 2022

  15. arXiv:2110.15245  [pdf, ps, other

    cs.RO cs.LG

    From Machine Learning to Robotics: Challenges and Opportunities for Embodied Intelligence

    Authors: Nicholas Roy, Ingmar Posner, Tim Barfoot, Philippe Beaudoin, Yoshua Bengio, Jeannette Bohg, Oliver Brock, Isabelle Depatie, Dieter Fox, Dan Koditschek, Tomas Lozano-Perez, Vikash Mansinghka, Christopher Pal, Blake Richards, Dorsa Sadigh, Stefan Schaal, Gaurav Sukhatme, Denis Therien, Marc Toussaint, Michiel Van de Panne

    Abstract: Machine learning has long since become a keystone technology, accelerating science and applications in a broad range of domains. Consequently, the notion of applying learning methods to a particular problem set has become an established and valuable modus operandi to advance a particular field. In this article we argue that such an approach does not straightforwardly extended to robotics -- or to… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

  16. arXiv:2105.00371  [pdf, other

    cs.LG cs.GR cs.RO

    Discovering Diverse Athletic Jumping Strategies

    Authors: Zhiqi Yin, Zeshi Yang, Michiel van de Panne, KangKang Yin

    Abstract: We present a framework that enables the discovery of diverse and natural-looking motion strategies for athletic skills such as the high jump. The strategies are realized as control policies for physics-based characters. Given a task objective and an initial character configuration, the combination of physics simulation and deep reinforcement learning (DRL) provides a suitable starting point for au… ▽ More

    Submitted 1 May, 2021; originally announced May 2021.

    Comments: 17 pages; SIGGRAPH 2021

    Journal ref: ACM Trans. Graph. 40, 4, Article 91 (August 2021), 17 pages (2021)

  17. arXiv:2104.09771  [pdf, other

    cs.RO cs.LG

    GLiDE: Generalizable Quadrupedal Locomotion in Diverse Environments with a Centroidal Model

    Authors: Zhaoming Xie, Xingye Da, Buck Babich, Animesh Garg, Michiel van de Panne

    Abstract: Model-free reinforcement learning (RL) for legged locomotion commonly relies on a physics simulator that can accurately predict the behaviors of every degree of freedom of the robot. In contrast, approximate reduced-order models are commonly used for many model predictive control strategies. In this work we abandon the conventional use of high-fidelity dynamics models in RL and we instead seek to… ▽ More

    Submitted 15 February, 2022; v1 submitted 20 April, 2021; originally announced April 2021.

    Comments: video: https://youtu.be/m4scMY8LcmQ

  18. Character Controllers Using Motion VAEs

    Authors: Hung Yu Ling, Fabio Zinno, George Cheng, Michiel van de Panne

    Abstract: A fundamental problem in computer animation is that of realizing purposeful and realistic human movement given a sufficiently-rich set of motion capture clips. We learn data-driven generative models of human movement using autoregressive conditional variational autoencoders, or Motion VAEs. The latent variables of the learned autoencoder define the action space for the movement and thereby govern… ▽ More

    Submitted 26 March, 2021; originally announced March 2021.

    Comments: Project page: https://www.cs.ubc.ca/~hyuling/projects/mvae/ ; Code: https://github.com/electronicarts/character-motion-vaes

  19. arXiv:2011.02404  [pdf, other

    cs.RO

    Dynamics Randomization Revisited:A Case Study for Quadrupedal Locomotion

    Authors: Zhaoming Xie, Xingye Da, Michiel van de Panne, Buck Babich, Animesh Garg

    Abstract: Understanding the gap between simulation and reality is critical for reinforcement learning with legged robots, which are largely trained in simulation. However, recent work has resulted in sometimes conflicting conclusions with regard to which factors are important for success, including the role of dynamics randomization. In this paper, we aim to provide clarity and understanding on the role of… ▽ More

    Submitted 24 March, 2021; v1 submitted 4 November, 2020; originally announced November 2020.

  20. Learning to Locomote: Understanding How Environment Design Matters for Deep Reinforcement Learning

    Authors: Daniele Reda, Tianxin Tao, Michiel van de Panne

    Abstract: Learning to locomote is one of the most common tasks in physics-based animation and deep reinforcement learning (RL). A learned policy is the product of the problem to be solved, as embodied by the RL environment, and the RL algorithm. While enormous attention has been devoted to RL algorithms, much less is known about the impact of design choices for the RL environment. In this paper, we show tha… ▽ More

    Submitted 8 October, 2020; originally announced October 2020.

    Comments: Presented at The 13th Annual ACM SIGGRAPH Conference on Motion, Interaction and Games

  21. arXiv:2009.10337  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Learning Task-Agnostic Action Spaces for Movement Optimization

    Authors: Amin Babadi, Michiel van de Panne, C. Karen Liu, Perttu Hämäläinen

    Abstract: We propose a novel method for exploring the dynamics of physically based animated characters, and learning a task-agnostic action space that makes movement optimization easier. Like several previous papers, we parameterize actions as target states, and learn a short-horizon goal-conditioned low-level control policy that drives the agent's state towards the targets. Our novel contribution is that w… ▽ More

    Submitted 23 July, 2021; v1 submitted 22 September, 2020; originally announced September 2020.

    Comments: Accepted as a regular paper by IEEE Transactions on Visualization and Computer Graphics (TVCG) in July 2021

  22. arXiv:2005.04323  [pdf, other

    cs.GR cs.LG cs.RO

    ALLSTEPS: Curriculum-driven Learning of Stepping Stone Skills

    Authors: Zhaoming Xie, Hung Yu Ling, Nam Hee Kim, Michiel van de Panne

    Abstract: Humans are highly adept at walking in environments with foot placement constraints, including stepping-stone scenarios where the footstep locations are fully constrained. Finding good solutions to stepping-stone locomotion is a longstanding and fundamental challenge for animation and robotics. We present fully learned solutions to this difficult problem using reinforcement learning. We demonstrate… ▽ More

    Submitted 29 August, 2020; v1 submitted 8 May, 2020; originally announced May 2020.

  23. arXiv:1912.03015  [pdf, other

    cs.LG cs.RO stat.ML

    Learning to Correspond Dynamical Systems

    Authors: Nam Hee Kim, Zhaoming Xie, Michiel van de Panne

    Abstract: Many dynamical systems exhibit similar structure, as often captured by hand-designed simplified models that can be used for analysis and control. We develop a method for learning to correspond pairs of dynamical systems via a learned latent dynamical system. Given trajectory data from two dynamical systems, we learn a shared latent state space and a shared latent dynamics model, along with an enco… ▽ More

    Submitted 4 June, 2020; v1 submitted 6 December, 2019; originally announced December 2019.

  24. arXiv:1903.09537  [pdf, other

    cs.RO

    Iterative Reinforcement Learning Based Design of Dynamic Locomotion Skills for Cassie

    Authors: Zhaoming Xie, Patrick Clary, Jeremy Dao, Pedro Morais, Jonathan Hurst, Michiel van de Panne

    Abstract: Deep reinforcement learning (DRL) is a promising approach for developing legged locomotion skills. However, the iterative design process that is inevitable in practice is poorly supported by the default methodology. It is difficult to predict the outcomes of changes made to the reward functions, policy architectures, and the set of tasks being trained on. In this paper, we propose a practical meth… ▽ More

    Submitted 22 March, 2019; originally announced March 2019.

  25. arXiv:1804.06424  [pdf, other

    cs.AI cs.RO

    Terrain RL Simulator

    Authors: Glen Berseth, Xue Bin Peng, Michiel van de Panne

    Abstract: We provide $89$ challenging simulation environments that range in difficulty. The difficulty of solving a task is linked not only to the number of dimensions in the action space but also to the size and shape of the distribution of configurations the agent experiences. Therefore, we are releasing a number of simulation environments that include randomly generated terrain. The library also provides… ▽ More

    Submitted 17 April, 2018; originally announced April 2018.

    Comments: 10 pages

  26. arXiv:1804.02717  [pdf, other

    cs.GR cs.AI cs.LG

    DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills

    Authors: Xue Bin Peng, Pieter Abbeel, Sergey Levine, Michiel van de Panne

    Abstract: A longstanding goal in character animation is to combine data-driven specification of behavior with a system that can execute a similar behavior in a physical simulation, thus enabling realistic responses to perturbations and environmental variation. We show that well-known reinforcement learning (RL) methods can be adapted to learn robust control policies capable of imitating a broad range of exa… ▽ More

    Submitted 26 July, 2018; v1 submitted 8 April, 2018; originally announced April 2018.

  27. arXiv:1803.05580  [pdf, other

    cs.RO

    Feedback Control For Cassie With Deep Reinforcement Learning

    Authors: Zhaoming Xie, Glen Berseth, Patrick Clary, Jonathan Hurst, Michiel van de Panne

    Abstract: Bipedal locomotion skills are challenging to develop. Control strategies often use local linearization of the dynamics in conjunction with reduced-order abstractions to yield tractable solutions. In these model-based control strategies, the controller is often not fully aware of many details, including torque limits, joint limits, and other non-linearities that are necessarily excluded from the co… ▽ More

    Submitted 27 July, 2018; v1 submitted 14 March, 2018; originally announced March 2018.

    Comments: 6 pages, 4 figures, accepted for IROS2018

  28. arXiv:1802.04765  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Progressive Reinforcement Learning with Distillation for Multi-Skilled Motion Control

    Authors: Glen Berseth, Cheng Xie, Paul Cernek, Michiel Van de Panne

    Abstract: Deep reinforcement learning has demonstrated increasing capabilities for continuous control problems, including agents that can move with skill and agility through their environment. An open problem in this setting is that of developing good strategies for integrating or merging policies for multiple skills, where each individual skill is a specialist in a specific skill and its associated state d… ▽ More

    Submitted 13 February, 2018; originally announced February 2018.

    Comments: 15 pages, Conference paper

  29. arXiv:1801.03954  [pdf, other

    cs.AI

    Model-Based Action Exploration for Learning Dynamic Motion Skills

    Authors: Glen Berseth, Michiel van de Panne

    Abstract: Deep reinforcement learning has achieved great strides in solving challenging motion control tasks. Recently, there has been significant work on methods for exploiting the data gathered during training, but there has been less work on how to best generate the data to learn from. For continuous action domains, the most common method for generating exploratory actions involves sampling from a Gaussi… ▽ More

    Submitted 11 April, 2018; v1 submitted 11 January, 2018; originally announced January 2018.

    Comments: 7 pages, 7 figures, conference paper

  30. arXiv:1611.01055  [pdf, other

    cs.LG cs.GR cs.RO

    Learning Locomotion Skills Using DeepRL: Does the Choice of Action Space Matter?

    Authors: Xue Bin Peng, Michiel van de Panne

    Abstract: The use of deep reinforcement learning allows for high-dimensional state descriptors, but little is known about how the choice of action representation impacts the learning difficulty and the resulting performance. We compare the impact of four different action parameterizations (torques, muscle-activations, target joint angles, and target joint-angle velocities) in terms of learning time, policy… ▽ More

    Submitted 3 November, 2016; originally announced November 2016.