Skip to main content

Showing 1–3 of 3 results for author: Kakodkar, N

.
  1. arXiv:2404.02294  [pdf, other

    cs.RO cs.LG

    Constrained Robotic Navigation on Preferred Terrains Using LLMs and Speech Instruction: Exploiting the Power of Adverbs

    Authors: Faraz Lotfi, Farnoosh Faraji, Nikhil Kakodkar, Travis Manderson, David Meger, Gregory Dudek

    Abstract: This paper explores leveraging large language models for map-free off-road navigation using generative AI, reducing the need for traditional data collection and annotation. We propose a method where a robot receives verbal instructions, converted to text through Whisper, and a large language model (LLM) model extracts landmarks, preferred terrains, and crucial adverbs translated into speed setting… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Presented at ISER 2023

  2. arXiv:2307.11865  [pdf, other

    cs.RO cs.CL

    CARTIER: Cartographic lAnguage Reasoning Targeted at Instruction Execution for Robots

    Authors: Dmitriy Rivkin, Nikhil Kakodkar, Francois Hogan, Bobak H. Baghi, Gregory Dudek

    Abstract: This work explores the capacity of large language models (LLMs) to address problems at the intersection of spatial planning and natural language interfaces for navigation. We focus on following complex instructions that are more akin to natural conversation than traditional explicit procedural directives typically seen in robotics. Unlike most prior work where navigation directives are provided as… ▽ More

    Submitted 1 February, 2024; v1 submitted 21 July, 2023; originally announced July 2023.

  3. arXiv:2302.07931  [pdf, other

    cs.RO

    ANSEL Photobot: A Robot Event Photographer with Semantic Intelligence

    Authors: Dmitriy Rivkin, Gregory Dudek, Nikhil Kakodkar, David Meger, Oliver Limoyo, Xue Liu, Francois Hogan

    Abstract: Our work examines the way in which large language models can be used for robotic planning and sampling, specifically the context of automated photographic documentation. Specifically, we illustrate how to produce a photo-taking robot with an exceptional level of semantic awareness by leveraging recent advances in general purpose language (LM) and vision-language (VLM) models. Given a high-level de… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

    Comments: ICRA 2023