Skip to main content

Showing 1–4 of 4 results for author: Palan, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13715  [pdf, other

    cs.AI cs.IR

    Converging Dimensions: Information Extraction and Summarization through Multisource, Multimodal, and Multilingual Fusion

    Authors: Pranav Janjani, Mayank Palan, Sarvesh Shirude, Ninad Shegokar, Sunny Kumar, Faruk Kazi

    Abstract: Recent advances in large language models (LLMs) have led to new summarization strategies, offering an extensive toolkit for extracting important information. However, these approaches are frequently limited by their reliance on isolated sources of data. The amount of information that can be gathered is limited and covers a smaller range of themes, which introduces the possibility of falsified cont… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 11 pages, 3 figures

  2. arXiv:2006.14091  [pdf, other

    cs.RO cs.AI cs.LG

    Learning Reward Functions from Diverse Sources of Human Feedback: Optimally Integrating Demonstrations and Preferences

    Authors: Erdem Bıyık, Dylan P. Losey, Malayandi Palan, Nicholas C. Landolfi, Gleb Shevchuk, Dorsa Sadigh

    Abstract: Reward functions are a common way to specify the objective of a robot. As designing reward functions can be extremely challenging, a more promising approach is to directly learn reward functions from human teachers. Importantly, data from human teachers can be collected either passively or actively in a variety of forms: passive data sources include demonstrations, (e.g., kinesthetic guidance), wh… ▽ More

    Submitted 4 August, 2021; v1 submitted 24 June, 2020; originally announced June 2020.

    Comments: 20 pages, 17 figures. Accepted for publication by The International Journal of Robotics Research (IJRR)

  3. arXiv:1910.04365  [pdf, other

    cs.RO cs.AI cs.LG

    Asking Easy Questions: A User-Friendly Approach to Active Reward Learning

    Authors: Erdem Bıyık, Malayandi Palan, Nicholas C. Landolfi, Dylan P. Losey, Dorsa Sadigh

    Abstract: Robots can learn the right reward function by querying a human expert. Existing approaches attempt to choose questions where the robot is most uncertain about the human's response; however, they do not consider how easy it will be for the human to answer! In this paper we explore an information gain formulation for optimally selecting questions that naturally account for the human's ability to ans… ▽ More

    Submitted 10 October, 2019; originally announced October 2019.

    Comments: Proceedings of the 3rd Conference on Robot Learning (CoRL), October 2019

  4. arXiv:1906.08928  [pdf, other

    cs.RO cs.AI

    Learning Reward Functions by Integrating Human Demonstrations and Preferences

    Authors: Malayandi Palan, Nicholas C. Landolfi, Gleb Shevchuk, Dorsa Sadigh

    Abstract: Our goal is to accurately and efficiently learn reward functions for autonomous robots. Current approaches to this problem include inverse reinforcement learning (IRL), which uses expert demonstrations, and preference-based learning, which iteratively queries the user for her preferences between trajectories. In robotics however, IRL often struggles because it is difficult to get high-quality demo… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

    Comments: Presented at RSS 2019