Skip to main content

Showing 1–18 of 18 results for author: Paulius, D

.
  1. Skill Generalization with Verbs

    Authors: Rachel Ma, Lyndon Lam, Benjamin A. Spiegel, Aditya Ganeshan, Roma Patel, Ben Abbatematteo, David Paulius, Stefanie Tellex, George Konidaris

    Abstract: It is imperative that robots can understand natural language commands issued by humans. Such commands typically contain verbs that signify what action should be performed on a given object and that are applicable to many objects. We propose a method for generalizing manipulation skills to novel objects using verbs. Our method learns a probabilistic classifier that determines whether a given object… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 7 pages + 2 pages (references), 6 figures. Accepted at IROS 2023. Code, dataset info and demo videos can be found at: https://rachelma80000.github.io/SkillGenVerbs/

  2. arXiv:2409.12262  [pdf, other

    cs.RO

    Bootstrapping Object-level Planning with Large Language Models

    Authors: David Paulius, Alejandro Agostini, Benedict Quartey, George Konidaris

    Abstract: We introduce a new method that extracts knowledge from a large language model (LLM) to produce object-level plans, which describe high-level changes to object state, and uses them to bootstrap task and motion planning (TAMP). Existing work uses LLMs to directly output task plans or generate goals in representations like PDDL. However, these methods fall short because they rely on the LLM to do the… ▽ More

    Submitted 21 March, 2025; v1 submitted 18 September, 2024; originally announced September 2024.

    Comments: Accepted to ICRA 2025; 11 pages (6 pages + 1 page references + 4 pages appendix); for demo videos, please see https://davidpaulius.github.io/olp_llm/

  3. arXiv:2211.09935  [pdf, other

    cs.AI cs.CL cs.LG cs.RO

    CAPE: Corrective Actions from Precondition Errors using Large Language Models

    Authors: Shreyas Sundara Raman, Vanya Cohen, Ifrah Idrees, Eric Rosen, Ray Mooney, Stefanie Tellex, David Paulius

    Abstract: Extracting commonsense knowledge from a large language model (LLM) offers a path to designing intelligent robots. Existing approaches that leverage LLMs for planning are unable to recover when an action fails and often resort to retrying failed actions, without resolving the error's underlying cause. We propose a novel approach (CAPE) that attempts to propose corrective actions to resolve precondi… ▽ More

    Submitted 9 March, 2024; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: 17 pages, 6 figures, accepted at ICRA 2024

    MSC Class: 68T20; 68T50 ACM Class: I.2.7; I.2.8; I.2.2; I.2.4

  4. arXiv:2207.05800  [pdf, other

    cs.RO cs.AI

    Long-Horizon Planning and Execution with Functional Object-Oriented Networks

    Authors: David Paulius, Alejandro Agostini, Dongheui Lee

    Abstract: Following work on joint object-action representations, functional object-oriented networks (FOON) were introduced as a knowledge graph representation for robots. A FOON contains symbolic concepts useful to a robot's understanding of tasks and its environment for object-level planning. Prior to this work, little has been done to show how plans acquired from FOON can be executed by a robot, as the c… ▽ More

    Submitted 2 June, 2023; v1 submitted 12 July, 2022; originally announced July 2022.

    Comments: To be published in RA-L, 8 pages, Joint First Authors (Alejandro and David). For project website, see https://davidpaulius.github.io/foon-lhpe

  5. arXiv:2207.03693  [pdf, other

    cs.RO

    Approximate Task Tree Retrieval in a Knowledge Network for Robotic Cooking

    Authors: Md. Sadman Sakib, David Paulius, Yu Sun

    Abstract: Flexible task planning continues to pose a difficult challenge for robots, where a robot is unable to creatively adapt their task plans to new or unseen problems, which is mainly due to the limited knowledge it has about its actions and world. Motivated by a human's ability to adapt, we explore how task plans from a knowledge graph, known as the Functional Object- Oriented Network (FOON), can be g… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

  6. arXiv:2204.02274  [pdf, other

    cs.RO cs.CV

    Grounding of the Functional Object-Oriented Network in Industrial Tasks

    Authors: Rafik Ayari, Matteo Pantano, David Paulius

    Abstract: In this preliminary work, we propose to design an activity recognition system that is suitable for Industrie 4.0 (I4.0) applications, especially focusing on Learning from Demonstration (LfD) in collaborative robot tasks. More precisely, we focus on the issue of data exchange between an activity recognition system and a collaborative robotic system. We propose an activity recognition system with li… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

  7. arXiv:2112.02433  [pdf, other

    cs.RO cs.AI

    Functional Task Tree Generation from a Knowledge Graph to Solve Unseen Problems

    Authors: Md. Sadman Sakib, David Paulius, Yu Sun

    Abstract: A major component for developing intelligent and autonomous robots is a suitable knowledge representation, from which a robot can acquire knowledge about its actions or world. However, unlike humans, robots cannot creatively adapt to novel scenarios, as their knowledge and environment are rigidly defined. To address the problem of producing novel and flexible task plans called task trees, we explo… ▽ More

    Submitted 4 December, 2021; originally announced December 2021.

  8. arXiv:2106.00728  [pdf, other

    cs.RO cs.CV

    Evaluating Recipes Generated from Functional Object-Oriented Network

    Authors: Md Sadman Sakib, Hailey Baez, David Paulius, Yu Sun

    Abstract: The functional object-oriented network (FOON) has been introduced as a knowledge representation, which takes the form of a graph, for symbolic task planning. To get a sequential plan for a manipulation task, a robot can obtain a task tree through a knowledge retrieval process from the FOON. To evaluate the quality of an acquired task tree, we compare it with a conventional form of task knowledge,… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

    Comments: This manuscript has been accepted at Ubiquitous Robots 2021

  9. arXiv:2106.00158  [pdf, other

    cs.RO cs.AI

    A Road-map to Robot Task Execution with the Functional Object-Oriented Network

    Authors: David Paulius, Alejandro Agostini, Yu Sun, Dongheui Lee

    Abstract: Following work on joint object-action representations, the functional object-oriented network (FOON) was introduced as a knowledge graph representation for robots. Taking the form of a bipartite graph, a FOON contains symbolic or high-level information that would be pertinent to a robot's understanding of its environment and tasks in a way that mirrors human understanding of actions. In this work,… ▽ More

    Submitted 31 May, 2021; originally announced June 2021.

    Comments: Ubiquitous Robots 2021 Submission -- 4 pages

  10. Developing Motion Code Embedding for Action Recognition in Videos

    Authors: Maxat Alibayev, David Paulius, Yu Sun

    Abstract: In this work, we propose a motion embedding strategy known as motion codes, which is a vectorized representation of motions based on a manipulation's salient mechanical attributes. These motion codes provide a robust motion representation, and they are obtained using a hierarchy of features called the motion taxonomy. We developed and trained a deep neural network model that combines visual and se… ▽ More

    Submitted 9 December, 2020; originally announced December 2020.

    Comments: Accepted by 25th International Conference on Pattern Recognition (ICPR2020)

  11. Estimating Motion Codes from Demonstration Videos

    Authors: Maxat Alibayev, David Paulius, Yu Sun

    Abstract: A motion taxonomy can encode manipulations as a binary-encoded representation, which we refer to as motion codes. These motion codes innately represent a manipulation action in an embedded space that describes the motion's mechanical features, including contact and trajectory type. The key advantage of using motion codes for embedding is that motions can be more appropriately defined with robotic-… ▽ More

    Submitted 31 July, 2020; originally announced July 2020.

    Comments: IROS 2020 Submission -- 6 pages; initial upload (Last updated July 31st 2020)

  12. A Motion Taxonomy for Manipulation Embedding

    Authors: David Paulius, Nicholas Eales, Yu Sun

    Abstract: To represent motions from a mechanical point of view, this paper explores motion embedding using the motion taxonomy. With this taxonomy, manipulations can be described and represented as binary strings called motion codes. Motion codes capture mechanical properties, such as contact type and trajectory, that should be used to define suitable distance metrics between motions or loss functions for d… ▽ More

    Submitted 13 July, 2020; originally announced July 2020.

    Comments: RSS 2020 Submission -- Corrected Several Errors in Paper (last updated July 13th, 2020)

    Journal ref: Proceedings of Robotics: Science and Systems 2020

  13. Manipulation Motion Taxonomy and Coding for Robots

    Authors: David Paulius, Yongqiang Huang, Jason Meloncon, Yu Sun

    Abstract: This paper introduces a taxonomy of manipulations as seen especially in cooking for 1) grouping manipulations from the robotics point of view, 2) consolidating aliases and removing ambiguity for motion types, and 3) provide a path to transferring learned manipulations to new unlearned manipulations. Using instructional videos as a reference, we selected a list of common manipulation motions seen i… ▽ More

    Submitted 31 July, 2020; v1 submitted 1 October, 2019; originally announced October 2019.

    Comments: IROS 2019 Submission -- 6 pages

  14. arXiv:1905.00502  [pdf, other

    cs.RO cs.AI

    Task Planning with a Weighted Functional Object-Oriented Network

    Authors: David Paulius, Kelvin Sheng Pei Dong, Yu Sun

    Abstract: In reality, there is still much to be done for robots to be able to perform manipulation actions with full autonomy. Complicated manipulation tasks, such as cooking, may still require a person to perform some actions that are very risky for a robot to perform. On the other hand, some other actions may be very risky for a human with physical disabilities to perform. Therefore, it is necessary to ba… ▽ More

    Submitted 25 March, 2021; v1 submitted 1 May, 2019; originally announced May 2019.

    Comments: ICRA 2021 Submission -- 7 Pages, Accepted to Conference

  15. Functional Object-Oriented Network for Manipulation Learning

    Authors: David Paulius, Yongqiang Huang, Roger Milton, William D. Buchanan, Jeanine Sam, Yu Sun

    Abstract: This paper presents a novel structured knowledge representation called the functional object-oriented network (FOON) to model the connectivity of the functional-related objects and their motions in manipulation tasks. The graphical model FOON is learned by observing object state change and human manipulations with the objects. Using a well-trained FOON, robots can decipher a task goal, seek the co… ▽ More

    Submitted 28 November, 2020; v1 submitted 4 February, 2019; originally announced February 2019.

    Comments: IROS 2016 Submission -- Corrected several errors from the published version (last updated November 28th, 2020)

    Journal ref: 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pgs. 2655-2662

  16. A Survey of Knowledge Representation in Service Robotics

    Authors: David Paulius, Yu Sun

    Abstract: Within the realm of service robotics, researchers have placed a great amount of effort into learning, understanding, and representing motions as manipulations for task execution by robots. The task of robot learning and problem-solving is very broad, as it integrates a variety of tasks such as object detection, activity recognition, task/motion planning, localization, knowledge representation and… ▽ More

    Submitted 21 June, 2023; v1 submitted 5 July, 2018; originally announced July 2018.

    Comments: Featured in Special Issue on Semantic Policy and Action Representations for Autonomous Robots, 22 Pages, Elsevier Format

    Journal ref: Robotics and Autonomous Systems 118 (2019) 13-30

  17. Functional Object-Oriented Network: Construction & Expansion

    Authors: David Paulius, Ahmad Babaeian Jelodar, Yu Sun

    Abstract: We build upon the functional object-oriented network (FOON), a structured knowledge representation which is constructed from observations of human activities and manipulations. A FOON can be used for representing object-motion affordances. Knowledge retrieval through graph search allows us to obtain novel manipulation sequences using knowledge spanning across many video sources, hence the novelty… ▽ More

    Submitted 31 July, 2020; v1 submitted 5 July, 2018; originally announced July 2018.

    Comments: 7 pages, 3 figures, presented at ICRA 2018

    Journal ref: ICRA 2018 Submission -- 7 pages

  18. arXiv:1807.00983  [pdf, other

    cs.CV

    Long Activity Video Understanding using Functional Object-Oriented Network

    Authors: Ahmad Babaeian Jelodar, David Paulius, Yu Sun

    Abstract: Video understanding is one of the most challenging topics in computer vision. In this paper, a four-stage video understanding pipeline is presented to simultaneously recognize all atomic actions and the single on-going activity in a video. This pipeline uses objects and motions from the video and a graph-based knowledge representation network as prior reference. Two deep networks are trained to id… ▽ More

    Submitted 3 July, 2018; originally announced July 2018.

    Comments: 12 pages, 12 figures