Skip to main content

Showing 1–11 of 11 results for author: Chuck, C

.
  1. arXiv:2505.03172  [pdf, ps, other

    cs.LG cs.AI

    Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement Learning

    Authors: Caleb Chuck, Fan Feng, Carl Qi, Chang Shi, Siddhant Agarwal, Amy Zhang, Scott Niekum

    Abstract: Hindsight relabeling is a powerful tool for overcoming sparsity in goal-conditioned reinforcement learning (GCRL), especially in certain domains such as navigation and locomotion. However, hindsight relabeling can struggle in object-centric domains. For example, suppose that the goal space consists of a robotic arm pushing a particular target block to a goal location. In this case, hindsight relab… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: Published at ICLR 2025

    Journal ref: The Thirteenth International Conference on Learning Representations. 2025

  2. arXiv:2412.05718  [pdf, other

    cs.AI cs.GR cs.LG cs.RO

    RLZero: Direct Policy Inference from Language Without In-Domain Supervision

    Authors: Harshit Sikchi, Siddhant Agarwal, Pranaya Jajoo, Samyak Parajuli, Caleb Chuck, Max Rudolph, Peter Stone, Amy Zhang, Scott Niekum

    Abstract: The reward hypothesis states that all goals and purposes can be understood as the maximization of a received scalar reward signal. However, in practice, defining such a reward signal is notoriously difficult, as humans are often unable to predict the optimal behavior corresponding to a reward function. Natural language offers an intuitive alternative for instructing reinforcement learning (RL) age… ▽ More

    Submitted 1 June, 2025; v1 submitted 7 December, 2024; originally announced December 2024.

    Comments: 26 pages

  3. arXiv:2410.18416  [pdf, other

    cs.LG cs.RO

    SkiLD: Unsupervised Skill Discovery Guided by Factor Interactions

    Authors: Zizhao Wang, Jiaheng Hu, Caleb Chuck, Stephen Chen, Roberto Martín-Martín, Amy Zhang, Scott Niekum, Peter Stone

    Abstract: Unsupervised skill discovery carries the promise that an intelligent agent can learn reusable skills through autonomous, reward-free environment interaction. Existing unsupervised skill discovery methods learn skills by encouraging distinguishable behaviors that cover diverse states. However, in complex environments with many state factors (e.g., household environments with many objects), learning… ▽ More

    Submitted 24 October, 2024; originally announced October 2024.

  4. arXiv:2406.08805  [pdf, other

    cs.LG cs.AI cs.RO

    A Dual Approach to Imitation Learning from Observations with Offline Datasets

    Authors: Harshit Sikchi, Caleb Chuck, Amy Zhang, Scott Niekum

    Abstract: Demonstrations are an effective alternative to task specification for learning agents in settings where designing a reward function is difficult. However, demonstrating expert behavior in the action space of the agent becomes unwieldy when robots have complex, unintuitive morphologies. We consider the practical setting where an agent has a dataset of prior interactions with the environment and is… ▽ More

    Submitted 19 September, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: 8th Conference on Robot Learning (CoRL 2024), Munich, Germany. 23 pages

  5. arXiv:2405.03113  [pdf, other

    cs.RO cs.AI

    Robot Air Hockey: A Manipulation Testbed for Robot Learning with Reinforcement Learning

    Authors: Caleb Chuck, Carl Qi, Michael J. Munje, Shuozhe Li, Max Rudolph, Chang Shi, Siddhant Agarwal, Harshit Sikchi, Abhinav Peri, Sarthak Dayal, Evan Kuo, Kavan Mehta, Anthony Wang, Peter Stone, Amy Zhang, Scott Niekum

    Abstract: Reinforcement Learning is a promising tool for learning complex policies even in fast-moving and object-interactive domains where human teleoperation or hard-coded policies might fail. To effectively reflect this challenging category of tasks, we introduce a dynamic, interactive RL testbed based on robot air hockey. By augmenting air hockey with a large family of tasks ranging from easy tasks like… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  6. arXiv:2404.10883  [pdf, other

    cs.AI cs.LG stat.ME

    Automated Discovery of Functional Actual Causes in Complex Environments

    Authors: Caleb Chuck, Sankaran Vaidyanathan, Stephen Giguere, Amy Zhang, David Jensen, Scott Niekum

    Abstract: Reinforcement learning (RL) algorithms often struggle to learn policies that generalize to novel situations due to issues such as causal confusion, overfitting to irrelevant factors, and failure to isolate control of state factors. These issues stem from a common source: a failure to accurately identify and exploit state-specific causal relationships in the environment. While some prior works in R… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  7. arXiv:2403.16369  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Action-based Representations Using Invariance

    Authors: Max Rudolph, Caleb Chuck, Kevin Black, Misha Lvovsky, Scott Niekum, Amy Zhang

    Abstract: Robust reinforcement learning agents using high-dimensional observations must be able to identify relevant state features amidst many exogeneous distractors. A representation that captures controllability identifies these state elements by determining what affects agent control. While methods such as inverse dynamics and mutual information capture controllability for a limited number of timesteps,… ▽ More

    Submitted 24 June, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

    Comments: Published at the Reinforcement Learning Conference 2024

  8. arXiv:2306.09509  [pdf, other

    cs.AI cs.RO

    Granger Causal Interaction Skill Chains

    Authors: Caleb Chuck, Kevin Black, Aditya Arjun, Yuke Zhu, Scott Niekum

    Abstract: Reinforcement Learning (RL) has demonstrated promising results in learning policies for complex tasks, but it often suffers from low sample efficiency and limited transferability. Hierarchical RL (HRL) methods aim to address the difficulty of learning long-horizon tasks by decomposing policies into skills, abstracting states, and reusing skills in new tasks. However, many HRL methods require some… ▽ More

    Submitted 19 October, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: Accepted TMLR 2024

  9. arXiv:2008.10518  [pdf, other

    cs.RO cs.AI

    ScrewNet: Category-Independent Articulation Model Estimation From Depth Images Using Screw Theory

    Authors: Ajinkya Jain, Rudolf Lioutikov, Caleb Chuck, Scott Niekum

    Abstract: Robots in human environments will need to interact with a wide variety of articulated objects such as cabinets, drawers, and dishwashers while assisting humans in performing day-to-day tasks. Existing methods either require objects to be textured or need to know the articulation model category a priori for estimating the model parameters for an articulated object. We propose ScrewNet, a novel appr… ▽ More

    Submitted 19 July, 2021; v1 submitted 24 August, 2020; originally announced August 2020.

    Comments: Presented at ICRA'21. Project webpage: https://pearl-utexas.github.io/ScrewNet/

  10. arXiv:1906.01408  [pdf, other

    cs.LG cs.AI stat.ML

    Hypothesis-Driven Skill Discovery for Hierarchical Deep Reinforcement Learning

    Authors: Caleb Chuck, Supawit Chockchowwat, Scott Niekum

    Abstract: Deep reinforcement learning (DRL) is capable of learning high-performing policies on a variety of complex high-dimensional tasks, ranging from video games to robotic manipulation. However, standard DRL methods often suffer from poor sample efficiency, partially because they aim to be entirely problem-agnostic. In this work, we introduce a novel approach to exploration and hierarchical skill learni… ▽ More

    Submitted 3 March, 2020; v1 submitted 27 May, 2019; originally announced June 2019.

    Comments: Submitted to IROS 2020

  11. arXiv:1610.00850  [pdf, other

    cs.RO cs.LG

    Comparing Human-Centric and Robot-Centric Sampling for Robot Deep Learning from Demonstrations

    Authors: Michael Laskey, Caleb Chuck, Jonathan Lee, Jeffrey Mahler, Sanjay Krishnan, Kevin Jamieson, Anca Dragan, Ken Goldberg

    Abstract: Motivated by recent advances in Deep Learning for robot control, this paper considers two learning algorithms in terms of how they acquire demonstrations. "Human-Centric" (HC) sampling is the standard supervised learning algorithm, where a human supervisor demonstrates the task by teleoperating the robot to provide trajectories consisting of state-control pairs. "Robot-Centric" (RC) sampling is an… ▽ More

    Submitted 28 March, 2017; v1 submitted 4 October, 2016; originally announced October 2016.

    Comments: Submitted to International Conference on Robotics and Automation (ICRA) 2017