Skip to main content

Showing 1–18 of 18 results for author: Kappler, D

.
  1. arXiv:2412.16720  [pdf, other

    cs.AI

    OpenAI o1 System Card

    Authors: OpenAI, :, Aaron Jaech, Adam Kalai, Adam Lerer, Adam Richardson, Ahmed El-Kishky, Aiden Low, Alec Helyar, Aleksander Madry, Alex Beutel, Alex Carney, Alex Iftimie, Alex Karpenko, Alex Tachard Passos, Alexander Neitz, Alexander Prokofiev, Alexander Wei, Allison Tam, Ally Bennett, Ananya Kumar, Andre Saraiva, Andrea Vallone, Andrew Duberstein, Andrew Kondrich , et al. (238 additional authors not shown)

    Abstract: The o1 model series is trained with large-scale reinforcement learning to reason using chain of thought. These advanced reasoning capabilities provide new avenues for improving the safety and robustness of our models. In particular, our models can reason about our safety policies in context when responding to potentially unsafe prompts, through deliberative alignment. This leads to state-of-the-ar… ▽ More

    Submitted 21 December, 2024; originally announced December 2024.

  2. arXiv:2410.21276  [pdf, other

    cs.CL cs.AI cs.CV cs.CY cs.LG cs.SD eess.AS

    GPT-4o System Card

    Authors: OpenAI, :, Aaron Hurst, Adam Lerer, Adam P. Goucher, Adam Perelman, Aditya Ramesh, Aidan Clark, AJ Ostrow, Akila Welihinda, Alan Hayes, Alec Radford, Aleksander Mądry, Alex Baker-Whitcomb, Alex Beutel, Alex Borzunov, Alex Carney, Alex Chow, Alex Kirillov, Alex Nichol, Alex Paino, Alex Renzin, Alex Tachard Passos, Alexander Kirillov, Alexi Christakis , et al. (395 additional authors not shown)

    Abstract: GPT-4o is an autoregressive omni model that accepts as input any combination of text, audio, image, and video, and generates any combination of text, audio, and image outputs. It's trained end-to-end across text, vision, and audio, meaning all inputs and outputs are processed by the same neural network. GPT-4o can respond to audio inputs in as little as 232 milliseconds, with an average of 320 mil… ▽ More

    Submitted 25 October, 2024; originally announced October 2024.

  3. arXiv:2305.03270  [pdf, other

    cs.RO

    Deep RL at Scale: Sorting Waste in Office Buildings with a Fleet of Mobile Manipulators

    Authors: Alexander Herzog, Kanishka Rao, Karol Hausman, Yao Lu, Paul Wohlhart, Mengyuan Yan, Jessica Lin, Montserrat Gonzalez Arenas, Ted Xiao, Daniel Kappler, Daniel Ho, Jarek Rettinghouse, Yevgen Chebotar, Kuang-Huei Lee, Keerthana Gopalakrishnan, Ryan Julian, Adrian Li, Chuyuan Kelly Fu, Bob Wei, Sangeetha Ramesh, Khem Holden, Kim Kleiven, David Rendleman, Sean Kirmani, Jeff Bingham , et al. (15 additional authors not shown)

    Abstract: We describe a system for deep reinforcement learning of robotic manipulation skills applied to a large-scale real-world task: sorting recyclables and trash in office buildings. Real-world deployment of deep RL policies requires not only effective training algorithms, but the ability to bootstrap real-world training and enable broad generalization. To this end, our system combines scalable deep RL… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: Published at Robotics: Science and Systems 2023

  4. arXiv:2209.09874  [pdf, other

    cs.RO cs.AI cs.CV

    Open-vocabulary Queryable Scene Representations for Real World Planning

    Authors: Boyuan Chen, Fei Xia, Brian Ichter, Kanishka Rao, Keerthana Gopalakrishnan, Michael S. Ryoo, Austin Stone, Daniel Kappler

    Abstract: Large language models (LLMs) have unlocked new capabilities of task planning from human instructions. However, prior attempts to apply LLMs to real-world robotic tasks are limited by the lack of grounding in the surrounding scene. In this paper, we develop NLMap, an open-vocabulary and queryable scene representation to address this problem. NLMap serves as a framework to gather and integrate conte… ▽ More

    Submitted 15 October, 2022; v1 submitted 20 September, 2022; originally announced September 2022.

    Comments: v2, added references to concurrent work and acknowledgments

  5. arXiv:2202.02005  [pdf, other

    cs.RO cs.LG

    BC-Z: Zero-Shot Task Generalization with Robotic Imitation Learning

    Authors: Eric Jang, Alex Irpan, Mohi Khansari, Daniel Kappler, Frederik Ebert, Corey Lynch, Sergey Levine, Chelsea Finn

    Abstract: In this paper, we study the problem of enabling a vision-based robotic manipulation system to generalize to novel tasks, a long-standing challenge in robot learning. We approach the challenge from an imitation learning perspective, aiming to study how scaling and broadening the data collected can facilitate such generalization. To that end, we develop an interactive and flexible imitation learning… ▽ More

    Submitted 4 February, 2022; originally announced February 2022.

    Comments: CoRL 2021, 23 pages

    Journal ref: Conference on Robot Learning (pp. 991-1002). 2022 Jan 11

  6. arXiv:2005.06594  [pdf, other

    cs.RO cs.LG

    Action Image Representation: Learning Scalable Deep Grasping Policies with Zero Real World Data

    Authors: Mohi Khansari, Daniel Kappler, Jianlan Luo, Jeff Bingham, Mrinal Kalakrishnan

    Abstract: This paper introduces Action Image, a new grasp proposal representation that allows learning an end-to-end deep-grasping policy. Our model achieves $84\%$ grasp success on $172$ real world objects while being trained only in simulation on $48$ objects with just naive domain randomization. Similar to computer vision problems, such as object detection, Action Image builds on the idea that object fea… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

    Comments: 7 pages, 10 figures, and 3 tables. To be published in International Conference on Robotics and Automation, 2020

  7. arXiv:2003.02636  [pdf, other

    cs.RO cs.LG stat.ML

    Scalable Multi-Task Imitation Learning with Autonomous Improvement

    Authors: Avi Singh, Eric Jang, Alexander Irpan, Daniel Kappler, Murtaza Dalal, Sergey Levine, Mohi Khansari, Chelsea Finn

    Abstract: While robot learning has demonstrated promising results for enabling robots to automatically acquire new skills, a critical challenge in deploying learning-based systems is scale: acquiring enough data for the robot to effectively generalize broadly. Imitation learning, in particular, has remained a stable and powerful approach for robot learning, but critically relies on expert operators for data… ▽ More

    Submitted 25 February, 2020; originally announced March 2020.

    Comments: Accepted to ICRA 2020. Supplementary material at https://sites.google.com/view/scalable-mili

  8. arXiv:1906.03352  [pdf, other

    cs.LG cs.AI stat.ML

    Watch, Try, Learn: Meta-Learning from Demonstrations and Reward

    Authors: Allan Zhou, Eric Jang, Daniel Kappler, Alex Herzog, Mohi Khansari, Paul Wohlhart, Yunfei Bai, Mrinal Kalakrishnan, Sergey Levine, Chelsea Finn

    Abstract: Imitation learning allows agents to learn complex behaviors from demonstrations. However, learning a complex vision-based task may require an impractical number of demonstrations. Meta-imitation learning is a promising approach towards enabling agents to learn a new task from one or a few demonstrations by leveraging experience from learning similar tasks. In the presence of task ambiguity or unob… ▽ More

    Submitted 30 January, 2020; v1 submitted 7 June, 2019; originally announced June 2019.

  9. arXiv:1809.07004  [pdf, other

    cs.RO cs.AI cs.LG

    Leveraging Contact Forces for Learning to Grasp

    Authors: Hamza Merzic, Miroslav Bogdanovic, Daniel Kappler, Ludovic Righetti, Jeannette Bohg

    Abstract: Grasping objects under uncertainty remains an open problem in robotics research. This uncertainty is often due to noisy or partial observations of the object pose or shape. To enable a robot to react appropriately to unforeseen effects, it is crucial that it continuously takes sensor feedback into account. While visual feedback is important for inferring a grasp pose and reaching for an object, co… ▽ More

    Submitted 18 September, 2018; originally announced September 2018.

    Comments: 7 pages, 5 figures, Submitted to ICRA'19

  10. arXiv:1809.03276  [pdf, other

    cs.RO

    Grasp success prediction with quality metrics

    Authors: Carlos Rubert, Daniel Kappler, Jeannette Bohg, Antonio Morales

    Abstract: Current robotic manipulation requires reliable methods to predict whether a certain grasp on an object will be successful or not prior to its execution. Different methods and metrics have been developed for this purpose but there is still work to do to provide a robust solution. In this article we combine different metrics to evaluate real grasp executions. We use different machine learning algo… ▽ More

    Submitted 10 September, 2018; originally announced September 2018.

  11. arXiv:1801.02854  [pdf, other

    cs.RO

    Riemannian Motion Policies

    Authors: Nathan D. Ratliff, Jan Issac, Daniel Kappler, Stan Birchfield, Dieter Fox

    Abstract: We introduce the Riemannian Motion Policy (RMP), a new mathematical object for modular motion generation. An RMP is a second-order dynamical system (acceleration field or motion policy) coupled with a corresponding Riemannian metric. The motion policy maps positions and velocities to accelerations, while the metric captures the directions in the space important to the policy. We show that RMPs pro… ▽ More

    Submitted 25 July, 2018; v1 submitted 9 January, 2018; originally announced January 2018.

  12. arXiv:1710.02513  [pdf, other

    cs.RO

    A New Data Source for Inverse Dynamics Learning

    Authors: Daniel Kappler, Franziska Meier, Nathan Ratliff, Stefan Schaal

    Abstract: Modern robotics is gravitating toward increasingly collaborative human robot interaction. Tools such as acceleration policies can naturally support the realization of reactive, adaptive, and compliant robots. These tools require us to model the system dynamics accurately -- a difficult task. The fundamental problem remains that simulation and reality diverge--we do not know how to accurately chang… ▽ More

    Submitted 6 October, 2017; originally announced October 2017.

    Comments: IROS 2017

  13. arXiv:1709.06709  [pdf, other

    cs.LG

    Online Learning of a Memory for Learning Rates

    Authors: Franziska Meier, Daniel Kappler, Stefan Schaal

    Abstract: The promise of learning to learn for robotics rests on the hope that by extracting some information about the learning process itself we can speed up subsequent similar learning tasks. Here, we introduce a computationally efficient online meta-learning algorithm that builds and optimizes a memory model of the optimal learning rate landscape from previously observed gradient behaviors. While perfor… ▽ More

    Submitted 23 March, 2018; v1 submitted 19 September, 2017; originally announced September 2017.

    Comments: accepted to ICRA 2018, code available: https://github.com/fmeier/online-meta-learning ; video pitch available: https://youtu.be/9PzQ25FPPOM

  14. arXiv:1703.03512  [pdf, other

    cs.RO

    Real-time Perception meets Reactive Motion Generation

    Authors: Daniel Kappler, Franziska Meier, Jan Issac, Jim Mainprice, Cristina Garcia Cifuentes, Manuel Wüthrich, Vincent Berenz, Stefan Schaal, Nathan Ratliff, Jeannette Bohg

    Abstract: We address the challenging problem of robotic grasping and manipulation in the presence of uncertainty. This uncertainty is due to noisy sensing, inaccurate models and hard-to-predict environment dynamics. We quantify the importance of continuous, real-time perception and its tight integration with reactive motion generation methods in dynamic manipulation scenarios. We compare three different sys… ▽ More

    Submitted 6 October, 2017; v1 submitted 9 March, 2017; originally announced March 2017.

  15. arXiv:1608.00309  [pdf, other

    cs.RO

    DOOMED: Direct Online Optimization of Modeling Errors in Dynamics

    Authors: Nathan Ratliff, Franziska Meier, Daniel Kappler, Stefan Schaal

    Abstract: It has long been hoped that model-based control will improve tracking performance while maintaining or increasing compliance. This hope hinges on having or being able to estimate an accurate inverse dynamics model. As a result, substantial effort has gone into modeling and estimating dynamics (error) models. Most recent research has focused on learning the true inverse dynamics using data points m… ▽ More

    Submitted 9 August, 2016; v1 submitted 31 July, 2016; originally announced August 2016.

    Comments: Added an acknowledgements section

  16. arXiv:1511.06739  [pdf, other

    cs.CV

    Superpixel Convolutional Networks using Bilateral Inceptions

    Authors: Raghudeep Gadde, Varun Jampani, Martin Kiefel, Daniel Kappler, Peter V. Gehler

    Abstract: In this paper we propose a CNN architecture for semantic image segmentation. We introduce a new 'bilateral inception' module that can be inserted in existing CNN architectures and performs bilateral filtering, at multiple feature-scales, between superpixels in an image. The feature spaces for bilateral filtering and other parameters of the module are learned end-to-end using standard backpropagati… ▽ More

    Submitted 8 August, 2016; v1 submitted 20 November, 2015; originally announced November 2015.

    Comments: European Conference on Computer Vision (ECCV), 2016

    ACM Class: I.2.10; I.2.6

  17. The Coordinate Particle Filter - A novel Particle Filter for High Dimensional Systems

    Authors: Manuel Wüthrich, Jeannette Bohg, Daniel Kappler, Claudia Pfreundt, Stefan Schaal

    Abstract: Parametric filters, such as the Extended Kalman Filter and the Unscented Kalman Filter, typically scale well with the dimensionality of the problem, but they are known to fail if the posterior state distribution cannot be closely approximated by a density of the assumed parametric form. For nonparametric filters, such as the Particle Filter, the converse holds. Such methods are able to approximate… ▽ More

    Submitted 1 May, 2015; originally announced May 2015.

  18. arXiv:1504.07941  [pdf, other

    cs.RO

    A New Perspective and Extension of the Gaussian Filter

    Authors: Manuel Wüthrich, Sebastian Trimpe, Daniel Kappler, Stefan Schaal

    Abstract: The Gaussian Filter (GF) is one of the most widely used filtering algorithms; instances are the Extended Kalman Filter, the Unscented Kalman Filter and the Divided Difference Filter. GFs represent the belief of the current state by a Gaussian with the mean being an affine function of the measurement. We show that this representation can be too restrictive to accurately capture the dependences in s… ▽ More

    Submitted 5 June, 2015; v1 submitted 29 April, 2015; originally announced April 2015.

    Comments: Will appear in Robotics: Science and Systems (R:SS) 2015