Skip to main content

Showing 1–17 of 17 results for author: Le, A T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.12095  [pdf, ps, other

    cs.RO

    DoublyAware: Dual Planning and Policy Awareness for Temporal Difference Learning in Humanoid Locomotion

    Authors: Khang Nguyen, An T. Le, Jan Peters, Minh Nhat Vu

    Abstract: Achieving robust robot learning for humanoid locomotion is a fundamental challenge in model-based reinforcement learning (MBRL), where environmental stochasticity and randomness can hinder efficient exploration and learning stability. The environmental, so-called aleatoric, uncertainty can be amplified in high-dimensional action spaces with complex contact dynamics, and further entangled with epis… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

  2. arXiv:2505.13549  [pdf, ps, other

    cs.RO

    TD-GRPC: Temporal Difference Learning with Group Relative Policy Constraint for Humanoid Locomotion

    Authors: Khang Nguyen, Khai Nguyen, An T. Le, Jan Peters, Manfred Huber, Ngo Anh Vien, Minh Nhat Vu

    Abstract: Robot learning in high-dimensional control settings, such as humanoid locomotion, presents persistent challenges for reinforcement learning (RL) algorithms due to unstable dynamics, complex contact interactions, and sensitivity to distributional shifts during training. Model-based methods, \textit{e.g.}, Temporal-Difference Model Predictive Control (TD-MPC), have demonstrated promising results by… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  3. arXiv:2505.01059  [pdf, ps, other

    cs.RO cs.AI cs.LG eess.SY

    Model Tensor Planning

    Authors: An T. Le, Khai Nguyen, Minh Nhat Vu, João Carvalho, Jan Peters

    Abstract: Sampling-based model predictive control (MPC) offers strong performance in nonlinear and contact-rich robotic tasks, yet often suffers from poor exploration due to locally greedy sampling schemes. We propose \emph{Model Tensor Planning} (MTP), a novel sampling-based MPC framework that introduces high-entropy control trajectory generation through structured tensor sampling. By sampling over randomi… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

    Comments: 22 pages, 9 figures

  4. arXiv:2504.04936  [pdf, other

    cs.RO cs.LG

    Constrained Gaussian Process Motion Planning via Stein Variational Newton Inference

    Authors: Jiayun Li, Kay Pompetzki, An Thai Le, Haolei Tong, Jan Peters, Georgia Chalvatzaki

    Abstract: Gaussian Process Motion Planning (GPMP) is a widely used framework for generating smooth trajectories within a limited compute time--an essential requirement in many robotic applications. However, traditional GPMP approaches often struggle with enforcing hard nonlinear constraints and rely on Maximum a Posteriori (MAP) solutions that disregard the full Bayesian posterior. This limits planning dive… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

  5. arXiv:2503.14240  [pdf, other

    cs.LG

    Persistent Homology-induced Graph Ensembles for Time Series Regressions

    Authors: Viet The Nguyen, Duy Anh Pham, An Thai Le, Jans Peter, Gunther Gust

    Abstract: The effectiveness of Spatio-temporal Graph Neural Networks (STGNNs) in time-series applications is often limited by their dependence on fixed, hand-crafted input graph structures. Motivated by insights from the Topological Data Analysis (TDA) paradigm, of which real-world data exhibits multi-scale patterns, we construct several graphs using Persistent Homology Filtration -- a mathematical framewor… ▽ More

    Submitted 19 March, 2025; v1 submitted 18 March, 2025; originally announced March 2025.

  6. arXiv:2503.06135  [pdf, other

    cs.RO

    FlowMP: Learning Motion Fields for Robot Planning with Conditional Flow Matching

    Authors: Khang Nguyen, An T. Le, Tien Pham, Manfred Huber, Jan Peters, Minh Nhat Vu

    Abstract: Prior flow matching methods in robotics have primarily learned velocity fields to morph one distribution of trajectories into another. In this work, we extend flow matching to capture second-order trajectory dynamics, incorporating acceleration effects either explicitly in the model or implicitly through the learning objective. Unlike diffusion models, which rely on a noisy forward process and ite… ▽ More

    Submitted 8 March, 2025; originally announced March 2025.

  7. arXiv:2412.08398  [pdf, other

    cs.RO cs.LG

    Grasp Diffusion Network: Learning Grasp Generators from Partial Point Clouds with Diffusion Models in SO(3)xR3

    Authors: Joao Carvalho, An T. Le, Philipp Jahr, Qiao Sun, Julen Urain, Dorothea Koert, Jan Peters

    Abstract: Grasping objects successfully from a single-view camera is crucial in many robot manipulation tasks. An approach to solve this problem is to leverage simulation to create large datasets of pairs of objects and grasp poses, and then learn a conditional generative model that can be prompted quickly during deployment. However, the grasp pose data is highly multimodal since there are several ways to g… ▽ More

    Submitted 11 December, 2024; originally announced December 2024.

  8. arXiv:2411.19393  [pdf, ps, other

    cs.RO cs.AI cs.LG eess.SY

    Global Tensor Motion Planning

    Authors: An T. Le, Kay Hansel, João Carvalho, Joe Watson, Julen Urain, Armin Biess, Georgia Chalvatzaki, Jan Peters

    Abstract: Batch planning is increasingly necessary to quickly produce diverse and quality motion plans for downstream learning applications, such as distillation and imitation learning. This paper presents Global Tensor Motion Planning (GTMP) -- a sampling-based motion planning algorithm comprising only tensor operations. We introduce a novel discretization structure represented as a random multipartite gra… ▽ More

    Submitted 29 May, 2025; v1 submitted 28 November, 2024; originally announced November 2024.

    Comments: 8 pages, 3 figures. Accepted at IEEE Robotics and Automation Letters 2025

  9. arXiv:2408.09840  [pdf, other

    cs.LG math.NA physics.comp-ph

    Machine Learning with Physics Knowledge for Prediction: A Survey

    Authors: Joe Watson, Chen Song, Oliver Weeger, Theo Gruner, An T. Le, Kay Pompetzki, Ahmed Hendawy, Oleg Arenz, Will Trojak, Miles Cranmer, Carlo D'Eramo, Fabian Bülow, Tanmay Goyal, Jan Peters, Martin W. Hoffman

    Abstract: This survey examines the broad suite of methods and models for combining machine learning with physics knowledge for prediction and forecast, with a focus on partial differential equations. These methods have attracted significant interest due to their potential impact on advancing scientific research and industrial practices by improving predictive models with small- or large-scale datasets and e… ▽ More

    Submitted 15 May, 2025; v1 submitted 19 August, 2024; originally announced August 2024.

    Comments: 61 pages, 8 figures, 2 tables. Accepted at the Transactions of Machine Learning Research (TMLR)

  10. arXiv:2407.04489  [pdf, other

    cs.CV

    Dude: Dual Distribution-Aware Context Prompt Learning For Large Vision-Language Model

    Authors: Duy M. H. Nguyen, An T. Le, Trung Q. Nguyen, Nghiem T. Diep, Tai Nguyen, Duy Duong-Tran, Jan Peters, Li Shen, Mathias Niepert, Daniel Sonntag

    Abstract: Prompt learning methods are gaining increasing attention due to their ability to customize large vision-language models to new domains using pre-trained contextual knowledge and minimal training data. However, existing works typically rely on optimizing unified prompt inputs, often struggling with fine-grained classification tasks due to insufficient discriminative attributes. To tackle this, we c… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: Version 1

  11. arXiv:2402.01975  [pdf, other

    cs.LG

    Structure-Aware E(3)-Invariant Molecular Conformer Aggregation Networks

    Authors: Duy M. H. Nguyen, Nina Lukashina, Tai Nguyen, An T. Le, TrungTin Nguyen, Nhat Ho, Jan Peters, Daniel Sonntag, Viktor Zaverkin, Mathias Niepert

    Abstract: A molecule's 2D representation consists of its atoms, their attributes, and the molecule's covalent bonds. A 3D (geometric) representation of a molecule is called a conformer and consists of its atom types and Cartesian coordinates. Every conformer has a potential energy, and the lower this energy, the more likely it occurs in nature. Most existing machine learning methods for molecular property p… ▽ More

    Submitted 19 August, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: Accepted at ICML 2024 (updated version)

  12. arXiv:2309.15970  [pdf, other

    cs.RO math.OC

    Accelerating Motion Planning via Optimal Transport

    Authors: An T. Le, Georgia Chalvatzaki, Armin Biess, Jan Peters

    Abstract: Motion planning is still an open problem for many disciplines, e.g., robotics, autonomous driving, due to their need for high computational resources that hinder real-time, efficient decision-making. A class of methods striving to provide smooth solutions is gradient-based trajectory optimization. However, those methods usually suffer from bad local minima, while for many settings, they may be ina… ▽ More

    Submitted 28 October, 2023; v1 submitted 27 September, 2023; originally announced September 2023.

    Comments: Published as a conference paper at NeurIPS 2023. Project website: https://sites.google.com/view/sinkhorn-step/

  13. arXiv:2308.01557  [pdf, other

    cs.RO cs.AI cs.LG

    Motion Planning Diffusion: Learning and Planning of Robot Motions with Diffusion Models

    Authors: Joao Carvalho, An T. Le, Mark Baierl, Dorothea Koert, Jan Peters

    Abstract: Learning priors on trajectory distributions can help accelerate robot motion planning optimization. Given previously successful plans, learning trajectory generative models as priors for a new planning problem is highly desirable. Prior works propose several ways on utilizing this prior to bootstrapping the motion planning problem. Either sampling the prior for initializations or using the prior d… ▽ More

    Submitted 26 March, 2024; v1 submitted 3 August, 2023; originally announced August 2023.

  14. arXiv:2212.01938  [pdf, other

    cs.RO cs.LG eess.SY

    Hierarchical Policy Blending As Optimal Transport

    Authors: An T. Le, Kay Hansel, Jan Peters, Georgia Chalvatzaki

    Abstract: We present hierarchical policy blending as optimal transport (HiPBOT). HiPBOT hierarchically adjusts the weights of low-level reactive expert policies of different agents by adding a look-ahead planning layer on the parameter space. The high-level planner renders policy blending as unbalanced optimal transport consolidating the scaling of the underlying Riemannian motion policies. As a result, HiP… ▽ More

    Submitted 12 April, 2023; v1 submitted 4 December, 2022; originally announced December 2022.

    Comments: 16 pages, 5 figures, accepted to the 5th Annual Learning for Dynamics & Control Conference (L4DC)

  15. Learning Implicit Priors for Motion Optimization

    Authors: Julen Urain, An T. Le, Alexander Lambert, Georgia Chalvatzaki, Byron Boots, Jan Peters

    Abstract: In this paper, we focus on the problem of integrating Energy-based Models (EBM) as guiding priors for motion optimization. EBMs are a set of neural networks that can represent expressive probability density distributions in terms of a Gibbs distribution parameterized by a suitable energy function. Due to their implicit nature, they can easily be integrated as optimization factors or as initial sam… ▽ More

    Submitted 11 January, 2023; v1 submitted 11 April, 2022; originally announced April 2022.

    Comments: 17 pages, accepted at IEEE/RSJ IROS 2022, paper website: https://sites.google.com/view/implicit-priors/home

    Journal ref: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Kyoto, Japan, 2022, pp. 7672-7679

  16. arXiv:2109.04222  [pdf, other

    cs.RO

    Learning Forceful Manipulation Skills from Multi-modal Human Demonstrations

    Authors: An T. Le, Meng Guo, Niels van Duijkeren, Leonel Rozo, Robert Krug, Andras G. Kupcsik, Mathias Buerger

    Abstract: Learning from Demonstration (LfD) provides an intuitive and fast approach to program robotic manipulators. Task parameterized representations allow easy adaptation to new scenes and online observations. However, this approach has been limited to pose-only demonstrations and thus only skills with spatial and temporal features. In this work, we extend the LfD framework to address forceful manipulati… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

  17. arXiv:2104.08137  [pdf, other

    cs.RO

    Hierarchical Human-Motion Prediction and Logic-Geometric Programming for Minimal Interference Human-Robot Tasks

    Authors: An T. Le, Philipp Kratzer, Simon Hagenmayer, Marc Toussaint, Jim Mainprice

    Abstract: In this paper, we tackle the problem of human-robot coordination in sequences of manipulation tasks. Our approach integrates hierarchical human motion prediction with Task and Motion Planning (TAMP). We first devise a hierarchical motion prediction approach by combining Inverse Reinforcement Learning and short-term motion prediction using a Recurrent Neural Network. In a second step, we propose a… ▽ More

    Submitted 5 July, 2021; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: 8 pages, accepted to IEEE-ROMAN 2021