Skip to main content

Showing 1–10 of 10 results for author: Kelestemur, T

.
  1. arXiv:2502.20382  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Physics-Driven Data Generation for Contact-Rich Manipulation via Trajectory Optimization

    Authors: Lujie Yang, H. J. Terry Suh, Tong Zhao, Bernhard Paus Graesdal, Tarik Kelestemur, Jiuguang Wang, Tao Pang, Russ Tedrake

    Abstract: We present a low-cost data generation pipeline that integrates physics-based simulation, human demonstrations, and model-based planning to efficiently generate large-scale, high-quality datasets for contact-rich robotic manipulation tasks. Starting with a small number of embodiment-flexible human demonstrations collected in a virtual reality simulation environment, the pipeline refines these demon… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

  2. arXiv:2501.13338  [pdf, other

    cs.RO cs.CV cs.LG

    CuriousBot: Interactive Mobile Exploration via Actionable 3D Relational Object Graph

    Authors: Yixuan Wang, Leonor Fermoselle, Tarik Kelestemur, Jiuguang Wang, Yunzhu Li

    Abstract: Mobile exploration is a longstanding challenge in robotics, yet current methods primarily focus on active perception instead of active interaction, limiting the robot's ability to interact with and fully explore its environment. Existing robotic exploration approaches via active interaction are often restricted to tabletop scenes, neglecting the unique challenges posed by mobile exploration, such… ▽ More

    Submitted 22 January, 2025; originally announced January 2025.

    Comments: Project Page: https://curiousbot.theaiinstitute.com/

  3. arXiv:2410.19989  [pdf, other

    cs.RO cs.LG

    On-Robot Reinforcement Learning with Goal-Contrastive Rewards

    Authors: Ondrej Biza, Thomas Weng, Lingfeng Sun, Karl Schmeckpeper, Tarik Kelestemur, Yecheng Jason Ma, Robert Platt, Jan-Willem van de Meent, Lawson L. S. Wong

    Abstract: Reinforcement Learning (RL) has the potential to enable robots to learn from their own actions in the real world. Unfortunately, RL can be prohibitively expensive, in terms of on-robot runtime, due to inefficient exploration when learning from a sparse reward signal. Designing dense reward functions is labour-intensive and requires domain expertise. In our work, we propose GCR (Goal-Contrastive Re… ▽ More

    Submitted 14 May, 2025; v1 submitted 25 October, 2024; originally announced October 2024.

  4. arXiv:2410.17488  [pdf, other

    cs.RO cs.CV cs.LG

    GenDP: 3D Semantic Fields for Category-Level Generalizable Diffusion Policy

    Authors: Yixuan Wang, Guang Yin, Binghao Huang, Tarik Kelestemur, Jiuguang Wang, Yunzhu Li

    Abstract: Diffusion-based policies have shown remarkable capability in executing complex robotic manipulation tasks but lack explicit characterization of geometry and semantics, which often limits their ability to generalize to unseen objects and layouts. To enhance the generalization capabilities of Diffusion Policy, we introduce a novel framework that incorporates explicit spatial and semantic information… ▽ More

    Submitted 22 October, 2024; originally announced October 2024.

    Comments: Accepted to Conference on Robot Learning (CoRL 2024). Project Page: https://robopil.github.io/GenDP/

  5. arXiv:2407.20179  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Theia: Distilling Diverse Vision Foundation Models for Robot Learning

    Authors: Jinghuan Shang, Karl Schmeckpeper, Brandon B. May, Maria Vittoria Minniti, Tarik Kelestemur, David Watkins, Laura Herlant

    Abstract: Vision-based robot policy learning, which maps visual inputs to actions, necessitates a holistic understanding of diverse visual tasks beyond single-task needs like classification or segmentation. Inspired by this, we introduce Theia, a vision foundation model for robot learning that distills multiple off-the-shelf vision foundation models trained on varied vision tasks. Theia's rich visual repres… ▽ More

    Submitted 10 October, 2024; v1 submitted 29 July, 2024; originally announced July 2024.

    Comments: CoRL 2024

  6. arXiv:2407.01812  [pdf, other

    cs.RO cs.LG

    Equivariant Diffusion Policy

    Authors: Dian Wang, Stephen Hart, David Surovik, Tarik Kelestemur, Haojie Huang, Haibo Zhao, Mark Yeatman, Jiuguang Wang, Robin Walters, Robert Platt

    Abstract: Recent work has shown diffusion models are an effective approach to learning the multimodal distributions arising from demonstration data in behavior cloning. However, a drawback of this approach is the need to learn a denoising function, which is significantly more complex than learning an explicit policy. In this work, we propose Equivariant Diffusion Policy, a novel diffusion policy learning me… ▽ More

    Submitted 15 October, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

    Comments: Conference on Robot Learning 2024, Oral Presentation

  7. arXiv:2309.16118  [pdf, other

    cs.RO cs.CV cs.LG

    D$^3$Fields: Dynamic 3D Descriptor Fields for Zero-Shot Generalizable Rearrangement

    Authors: Yixuan Wang, Mingtong Zhang, Zhuoran Li, Tarik Kelestemur, Katherine Driggs-Campbell, Jiajun Wu, Li Fei-Fei, Yunzhu Li

    Abstract: Scene representation is a crucial design choice in robotic manipulation systems. An ideal representation is expected to be 3D, dynamic, and semantic to meet the demands of diverse manipulation tasks. However, previous works often lack all three properties simultaneously. In this work, we introduce D$^3$Fields -- dynamic 3D descriptor fields. These fields are implicit 3D representations that take i… ▽ More

    Submitted 16 October, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

    Comments: Accepted to Conference on Robot Learning (CoRL 2024) as Oral Presentation. The first three authors contributed equally. Project Page: https://robopil.github.io/d3fields/

  8. arXiv:2207.00942  [pdf, other

    cs.RO

    Pregrasp Object Material Classification by a Novel Gripper Design with Integrated Spectroscopy

    Authors: Nathaniel Hanson, Tarik Kelestemur, Deniz Erdogmus, Taskin Padir

    Abstract: Robots benefit from being able to classify objects they interact with or manipulate based on their material properties. This capability ensures fine manipulation of complex objects through proper grasp pose and force selection. Prior work has focused on haptic or visual processing to determine material type at grasp time. In this work, we introduce a novel parallel robot gripper design and a metho… ▽ More

    Submitted 2 July, 2022; originally announced July 2022.

  9. arXiv:2203.10685  [pdf, other

    cs.RO

    Tactile Pose Estimation and Policy Learning for Unknown Object Manipulation

    Authors: Tarik Kelestemur, Robert Platt, Taskin Padir

    Abstract: Object pose estimation methods allow finding locations of objects in unstructured environments. This is a highly desired skill for autonomous robot manipulation as robots need to estimate the precise poses of the objects in order to manipulate them. In this paper, we investigate the problems of tactile pose estimation and manipulation for category-level objects. Our proposed method uses a Bayes fi… ▽ More

    Submitted 20 March, 2022; originally announced March 2022.

    Comments: Accepted atthe 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022)

  10. arXiv:2011.05559  [pdf, other

    cs.RO

    Learning Bayes Filter Models for Tactile Localization

    Authors: Tarik Kelestemur, Colin Keil, John P. Whitney, Robert Platt, Taskin Padir

    Abstract: Localizing and tracking the pose of robotic grippers are necessary skills for manipulation tasks. However, the manipulators with imprecise kinematic models (e.g. low-cost arms) or manipulators with unknown world coordinates (e.g. poor camera-arm calibration) cannot locate the gripper with respect to the world. In these circumstances, we can leverage tactile feedback between the gripper and the env… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

    Comments: Accepted in IROS2020