Skip to main content

Showing 1–19 of 19 results for author: Schmeckpeper, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.17564  [pdf, ps, other

    cs.LG cs.AI cs.RO

    Accelerating Residual Reinforcement Learning with Uncertainty Estimation

    Authors: Lakshita Dodeja, Karl Schmeckpeper, Shivam Vats, Thomas Weng, Mingxi Jia, George Konidaris, Stefanie Tellex

    Abstract: Residual Reinforcement Learning (RL) is a popular approach for adapting pretrained policies by learning a lightweight residual policy that provides corrective actions. While Residual RL is more sample-efficient than finetuning the entire base policy, existing methods struggle with sparse rewards and are designed for deterministic base policies. We propose two improvements to Residual RL that furth… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  2. arXiv:2504.03597  [pdf, ps, other

    cs.RO cs.AI

    Real-is-Sim: Bridging the Sim-to-Real Gap with a Dynamic Digital Twin

    Authors: Jad Abou-Chakra, Lingfeng Sun, Krishan Rana, Brandon May, Karl Schmeckpeper, Niko Suenderhauf, Maria Vittoria Minniti, Laura Herlant

    Abstract: We introduce real-is-sim, a new approach to integrating simulation into behavior cloning pipelines. In contrast to real-only methods, which lack the ability to safely test policies before deployment, and sim-to-real methods, which require complex adaptation to cross the sim-to-real gap, our framework allows policies to seamlessly switch between running on real hardware and running in parallelized… ▽ More

    Submitted 1 July, 2025; v1 submitted 4 April, 2025; originally announced April 2025.

  3. arXiv:2410.19989  [pdf, other

    cs.RO cs.LG

    On-Robot Reinforcement Learning with Goal-Contrastive Rewards

    Authors: Ondrej Biza, Thomas Weng, Lingfeng Sun, Karl Schmeckpeper, Tarik Kelestemur, Yecheng Jason Ma, Robert Platt, Jan-Willem van de Meent, Lawson L. S. Wong

    Abstract: Reinforcement Learning (RL) has the potential to enable robots to learn from their own actions in the real world. Unfortunately, RL can be prohibitively expensive, in terms of on-robot runtime, due to inefficient exploration when learning from a sparse reward signal. Designing dense reward functions is labour-intensive and requires domain expertise. In our work, we propose GCR (Goal-Contrastive Re… ▽ More

    Submitted 14 May, 2025; v1 submitted 25 October, 2024; originally announced October 2024.

  4. arXiv:2407.20179  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Theia: Distilling Diverse Vision Foundation Models for Robot Learning

    Authors: Jinghuan Shang, Karl Schmeckpeper, Brandon B. May, Maria Vittoria Minniti, Tarik Kelestemur, David Watkins, Laura Herlant

    Abstract: Vision-based robot policy learning, which maps visual inputs to actions, necessitates a holistic understanding of diverse visual tasks beyond single-task needs like classification or segmentation. Inspired by this, we introduce Theia, a vision foundation model for robot learning that distills multiple off-the-shelf vision foundation models trained on varied vision tasks. Theia's rich visual repres… ▽ More

    Submitted 10 October, 2024; v1 submitted 29 July, 2024; originally announced July 2024.

    Comments: CoRL 2024

  5. arXiv:2406.11740  [pdf, other

    cs.RO cs.AI cs.LG

    Imagination Policy: Using Generative Point Cloud Models for Learning Manipulation Policies

    Authors: Haojie Huang, Karl Schmeckpeper, Dian Wang, Ondrej Biza, Yaoyao Qian, Haotian Liu, Mingxi Jia, Robert Platt, Robin Walters

    Abstract: Humans can imagine goal states during planning and perform actions to match those goals. In this work, we propose Imagination Policy, a novel multi-task key-frame policy network for solving high-precision pick and place tasks. Instead of learning actions directly, Imagination Policy generates point clouds to imagine desired states which are then translated to actions using rigid action estimation.… ▽ More

    Submitted 30 November, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  6. arXiv:2311.07578  [pdf, other

    cs.CV cs.AI cs.LG

    A Metacognitive Approach to Out-of-Distribution Detection for Segmentation

    Authors: Meghna Gummadi, Cassandra Kent, Karl Schmeckpeper, Eric Eaton

    Abstract: Despite outstanding semantic scene segmentation in closed-worlds, deep neural networks segment novel instances poorly, which is required for autonomous agents acting in an open world. To improve out-of-distribution (OOD) detection for segmentation, we introduce a metacognitive approach in the form of a lightweight module that leverages entropy measures, segmentation predictions, and spatial contex… ▽ More

    Submitted 4 October, 2023; originally announced November 2023.

  7. arXiv:2303.15440  [pdf, other

    cs.CV

    EFEM: Equivariant Neural Field Expectation Maximization for 3D Object Segmentation Without Scene Supervision

    Authors: Jiahui Lei, Congyue Deng, Karl Schmeckpeper, Leonidas Guibas, Kostas Daniilidis

    Abstract: We introduce Equivariant Neural Field Expectation Maximization (EFEM), a simple, effective, and robust geometric algorithm that can segment objects in 3D scenes without annotations or training on scenes. We achieve such unsupervised segmentation by exploiting single object shape priors. We make two novel steps in that direction. First, we introduce equivariant shape representations to this problem… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: Accepted by CVPR2023, project page https://www.cis.upenn.edu/~leijh/projects/efem

  8. Semantic keypoint-based pose estimation from single RGB frames

    Authors: Karl Schmeckpeper, Philip R. Osteen, Yufu Wang, Georgios Pavlakos, Kenneth Chaney, Wyatt Jordan, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis

    Abstract: This paper presents an approach to estimating the continuous 6-DoF pose of an object from a single RGB image. The approach combines semantic keypoints predicted by a convolutional network (convnet) with a deformable shape model. Unlike prior investigators, we are agnostic to whether the object is textured or textureless, as the convnet learns the optimal representation from the available training-… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

    Comments: https://sites.google.com/view/rcta-object-keypoints-dataset/home. arXiv admin note: substantial text overlap with arXiv:1703.04670

    Journal ref: Field Robotics, 2, 147-171, 2022

  9. arXiv:2203.05137  [pdf, other

    cs.CV cs.RO

    Cross-modal Map Learning for Vision and Language Navigation

    Authors: Georgios Georgakis, Karl Schmeckpeper, Karan Wanchoo, Soham Dan, Eleni Miltsakaki, Dan Roth, Kostas Daniilidis

    Abstract: We consider the problem of Vision-and-Language Navigation (VLN). The majority of current methods for VLN are trained end-to-end using either unstructured memory such as LSTM, or using cross-modal attention over the egocentric observations of the agent. In contrast to other works, our key insight is that the association between language and vision is stronger when it occurs in explicit spatial repr… ▽ More

    Submitted 21 March, 2022; v1 submitted 9 March, 2022; originally announced March 2022.

  10. arXiv:2202.11907  [pdf, other

    cs.RO cs.CV

    Uncertainty-driven Planner for Exploration and Navigation

    Authors: Georgios Georgakis, Bernadette Bucher, Anton Arapin, Karl Schmeckpeper, Nikolai Matni, Kostas Daniilidis

    Abstract: We consider the problems of exploration and point-goal navigation in previously unseen environments, where the spatial complexity of indoor scenes and partial observability constitute these tasks challenging. We argue that learning occupancy priors over indoor maps provides significant advantages towards addressing these problems. To this end, we present a novel planning framework that first learn… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

  11. arXiv:2109.13396  [pdf, other

    cs.RO cs.AI

    Bridge Data: Boosting Generalization of Robotic Skills with Cross-Domain Datasets

    Authors: Frederik Ebert, Yanlai Yang, Karl Schmeckpeper, Bernadette Bucher, Georgios Georgakis, Kostas Daniilidis, Chelsea Finn, Sergey Levine

    Abstract: Robot learning holds the promise of learning policies that generalize broadly. However, such generalization requires sufficiently diverse datasets of the task of interest, which can be prohibitively expensive to collect. In other fields, such as computer vision, it is common to utilize shared, reusable datasets, such as ImageNet, to overcome this challenge, but this has proven difficult in robotic… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

  12. arXiv:2106.15648  [pdf, other

    cs.CV cs.RO

    Learning to Map for Active Semantic Goal Navigation

    Authors: Georgios Georgakis, Bernadette Bucher, Karl Schmeckpeper, Siddharth Singh, Kostas Daniilidis

    Abstract: We consider the problem of object goal navigation in unseen environments. Solving this problem requires learning of contextual semantic priors, a challenging endeavour given the spatial and semantic variability of indoor environments. Current methods learn to implicitly encode these priors through goal-oriented navigation policy functions operating on spatial representations that are limited to th… ▽ More

    Submitted 8 March, 2022; v1 submitted 29 June, 2021; originally announced June 2021.

  13. arXiv:2105.02799  [pdf, other

    cs.CV cs.RO

    Object-centric Video Prediction without Annotation

    Authors: Karl Schmeckpeper, Georgios Georgakis, Kostas Daniilidis

    Abstract: In order to interact with the world, agents must be able to predict the results of the world's dynamics. A natural approach to learn about these dynamics is through video prediction, as cameras are ubiquitous and powerful sensors. Direct pixel-to-pixel video prediction is difficult, does not take advantage of known priors, and does not provide an easy interface to utilize the learned dynamics. Obj… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

  14. arXiv:2103.14184  [pdf, other

    cs.CV

    Deformable Linear Object Prediction Using Locally Linear Latent Dynamics

    Authors: Wenbo Zhang, Karl Schmeckpeper, Pratik Chaudhari, Kostas Daniilidis

    Abstract: We propose a framework for deformable linear object prediction. Prediction of deformable objects (e.g., rope) is challenging due to their non-linear dynamics and infinite-dimensional configuration spaces. By mapping the dynamics from a non-linear space to a linear space, we can use the good properties of linear dynamics for easier learning and more efficient prediction. We learn a locally linear,… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

  15. arXiv:2011.06507  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    Reinforcement Learning with Videos: Combining Offline Observations with Interaction

    Authors: Karl Schmeckpeper, Oleh Rybkin, Kostas Daniilidis, Sergey Levine, Chelsea Finn

    Abstract: Reinforcement learning is a powerful framework for robots to acquire skills from experience, but often requires a substantial amount of online data collection. As a result, it is difficult to collect sufficiently diverse experiences that are needed for robots to generalize broadly. Videos of humans, on the other hand, are a readily available source of broad and interesting experiences. In this pap… ▽ More

    Submitted 4 November, 2021; v1 submitted 12 November, 2020; originally announced November 2020.

    Journal ref: Conference on Robot Learning (2020)

  16. arXiv:2003.06082  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    An Adversarial Objective for Scalable Exploration

    Authors: Bernadette Bucher, Karl Schmeckpeper, Nikolai Matni, Kostas Daniilidis

    Abstract: Model-based curiosity combines active learning approaches to optimal sampling with the information gain based incentives for exploration presented in the curiosity literature. Existing model-based curiosity methods look to approximate prediction uncertainty with approaches which struggle to scale to many prediction-planning pipelines used in robotics tasks. We address these scalability issues with… ▽ More

    Submitted 11 November, 2020; v1 submitted 12 March, 2020; originally announced March 2020.

    Comments: Additional visualizations of our results are available on our website at https://sites.google.com/view/action-for-better-prediction . Bernadette Bucher and Karl Schmeckpeper contributed equally

  17. arXiv:2002.08946  [pdf, other

    cs.RO

    Reactive Navigation in Partially Familiar Planar Environments Using Semantic Perceptual Feedback

    Authors: Vasileios Vasilopoulos, Georgios Pavlakos, Karl Schmeckpeper, Kostas Daniilidis, Daniel E. Koditschek

    Abstract: This paper solves the planar navigation problem by recourse to an online reactive scheme that exploits recent advances in SLAM and visual object recognition to recast prior geometric knowledge in terms of an offline catalogue of familiar objects. The resulting vector field planner guarantees convergence to an arbitrarily specified goal, avoiding collisions along the way with fixed but arbitrarily… ▽ More

    Submitted 18 August, 2021; v1 submitted 20 February, 2020; originally announced February 2020.

    Comments: Preprint of paper in the International Journal of Robotics Research (76 pages, 23 figures) - Includes results used in arXiv:2002.12349

  18. arXiv:1912.12773  [pdf, other

    cs.LG cs.RO stat.ML

    Learning Predictive Models From Observation and Interaction

    Authors: Karl Schmeckpeper, Annie Xie, Oleh Rybkin, Stephen Tian, Kostas Daniilidis, Sergey Levine, Chelsea Finn

    Abstract: Learning predictive models from interaction with the world allows an agent, such as a robot, to learn about how the world works, and then use this learned model to plan coordinated sequences of actions to bring about desired outcomes. However, learning a model that captures the dynamics of complex skills represents a major challenge: if the agent needs a good model to perform these skills, it migh… ▽ More

    Submitted 29 December, 2019; originally announced December 2019.

  19. arXiv:1910.11215  [pdf, other

    cs.RO cs.CV cs.LG

    RoboNet: Large-Scale Multi-Robot Learning

    Authors: Sudeep Dasari, Frederik Ebert, Stephen Tian, Suraj Nair, Bernadette Bucher, Karl Schmeckpeper, Siddharth Singh, Sergey Levine, Chelsea Finn

    Abstract: Robot learning has emerged as a promising tool for taming the complexity and diversity of the real world. Methods based on high-capacity models, such as deep networks, hold the promise of providing effective generalization to a wide range of open-world environments. However, these same methods typically require large amounts of diverse training data to generalize effectively. In contrast, most rob… ▽ More

    Submitted 2 January, 2020; v1 submitted 24 October, 2019; originally announced October 2019.

    Comments: accepted at the Conference on Robot Learning (CoRL) 2019