Skip to main content

Showing 1–1 of 1 results for author: Gundotra, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:1906.09624  [pdf, other

    cs.LG cs.AI stat.ML

    On the Feasibility of Learning, Rather than Assuming, Human Biases for Reward Inference

    Authors: Rohin Shah, Noah Gundotra, Pieter Abbeel, Anca D. Dragan

    Abstract: Our goal is for agents to optimize the right reward function, despite how difficult it is for us to specify what that is. Inverse Reinforcement Learning (IRL) enables us to infer reward functions from demonstrations, but it usually assumes that the expert is noisily optimal. Real people, on the other hand, often have systematic biases: risk-aversion, myopia, etc. One option is to try to characteri… ▽ More

    Submitted 23 June, 2019; originally announced June 2019.

    Comments: Published at ICML 2019