Skip to main content

Showing 1–5 of 5 results for author: El-Refai, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.09601  [pdf, ps, other

    cs.RO

    Real2Render2Real: Scaling Robot Data Without Dynamics Simulation or Robot Hardware

    Authors: Justin Yu, Letian Fu, Huang Huang, Karim El-Refai, Rares Andrei Ambrus, Richard Cheng, Muhammad Zubair Irshad, Ken Goldberg

    Abstract: Scaling robot learning requires vast and diverse datasets. Yet the prevailing data collection paradigm-human teleoperation-remains costly and constrained by manual effort and physical robot access. We introduce Real2Render2Real (R2R2R), a novel approach for generating robot training data without relying on object dynamics simulation or teleoperation of robot hardware. The input is a smartphone-cap… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

  2. arXiv:2503.05189  [pdf, other

    cs.RO

    Persistent Object Gaussian Splat (POGS) for Tracking Human and Robot Manipulation of Irregularly Shaped Objects

    Authors: Justin Yu, Kush Hari, Karim El-Refai, Arnav Dalal, Justin Kerr, Chung Min Kim, Richard Cheng, Muhammad Zubair Irshad, Ken Goldberg

    Abstract: Tracking and manipulating irregularly-shaped, previously unseen objects in dynamic environments is important for robotic applications in manufacturing, assembly, and logistics. Recently introduced Gaussian Splats efficiently model object geometry, but lack persistent state estimation for task-oriented manipulation. We present Persistent Object Gaussian Splat (POGS), a system that embeds semantics,… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

    Comments: Accepted to ICRA 2025

  3. arXiv:2409.18108  [pdf, other

    cs.RO

    Language-Embedded Gaussian Splats (LEGS): Incrementally Building Room-Scale Representations with a Mobile Robot

    Authors: Justin Yu, Kush Hari, Kishore Srinivas, Karim El-Refai, Adam Rashid, Chung Min Kim, Justin Kerr, Richard Cheng, Muhammad Zubair Irshad, Ashwin Balakrishna, Thomas Kollar, Ken Goldberg

    Abstract: Building semantic 3D maps is valuable for searching for objects of interest in offices, warehouses, stores, and homes. We present a mapping system that incrementally builds a Language-Embedded Gaussian Splat (LEGS): a detailed 3D scene representation that encodes both appearance and semantics in a unified representation. LEGS is trained online as a robot traverses its environment to enable localiz… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

  4. arXiv:2408.12593  [pdf, other

    cs.RO cs.CV

    Automating Deformable Gasket Assembly

    Authors: Simeon Adebola, Tara Sadjadpour, Karim El-Refai, Will Panitch, Zehan Ma, Roy Lin, Tianshuang Qiu, Shreya Ganti, Charlotte Le, Jaimyn Drake, Ken Goldberg

    Abstract: In Gasket Assembly, a deformable gasket must be aligned and pressed into a narrow channel. This task is common for sealing surfaces in the manufacturing of automobiles, appliances, electronics, and other products. Gasket Assembly is a long-horizon, high-precision task and the gasket must align with the channel and be fully pressed in to achieve a secure fit. To compare approaches, we present 4 met… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: Content without Appendix accepted for IEEE CASE 2024

  5. arXiv:2402.03483  [pdf, other

    cs.CL cs.AI

    SWAG: Storytelling With Action Guidance

    Authors: Zeeshan Patel, Karim El-Refai, Jonathan Pei, Tianle Li

    Abstract: Automated long-form story generation typically employs long-context large language models (LLMs) for one-shot creation, which can produce cohesive but not necessarily engaging content. We introduce Storytelling With Action Guidance (SWAG), a novel approach to storytelling with LLMs. Our approach frames story writing as a search problem through a two-model feedback loop: one LLM generates story con… ▽ More

    Submitted 7 October, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: EMNLP Findings 2024