Skip to main content

Showing 1–4 of 4 results for author: Watkins-Valls, D

.
  1. arXiv:2204.07123  [pdf, other

    cs.AI

    Retrospective on the 2021 BASALT Competition on Learning from Human Feedback

    Authors: Rohin Shah, Steven H. Wang, Cody Wild, Stephanie Milani, Anssi Kanervisto, Vinicius G. Goecks, Nicholas Waytowich, David Watkins-Valls, Bharat Prakash, Edmund Mills, Divyansh Garg, Alexander Fries, Alexandra Souly, Chan Jun Shern, Daniel del Castillo, Tom Lieberum

    Abstract: We held the first-ever MineRL Benchmark for Agents that Solve Almost-Lifelike Tasks (MineRL BASALT) Competition at the Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS 2021). The goal of the competition was to promote research towards agents that use learning from human feedback (LfHF) techniques to solve open-world tasks. Rather than mandating the use of LfHF techniques,… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    Comments: Accepted to the PMLR NeurIPS 2021 Demo & Competition Track volume

  2. arXiv:2112.03482  [pdf, other

    cs.LG cs.AI cs.HC

    Combining Learning from Human Feedback and Knowledge Engineering to Solve Hierarchical Tasks in Minecraft

    Authors: Vinicius G. Goecks, Nicholas Waytowich, David Watkins-Valls, Bharat Prakash

    Abstract: Real-world tasks of interest are generally poorly defined by human-readable descriptions and have no pre-defined reward signals unless it is defined by a human designer. Conversely, data-driven algorithms are often designed to solve a specific, narrowly defined, task with performance metrics that drives the agent's learning. In this work, we present the solution that won first place and was awarde… ▽ More

    Submitted 11 May, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

    Comments: Submitted to the AAAI 2022 Spring Symposium on Machine Learning and Knowledge Engineering for Hybrid Intelligence (AAAI-MAKE 2022)

    ACM Class: I.2.1; I.2.6; I.2.10; I.2.0

  3. arXiv:1910.00682  [pdf, other

    cs.RO

    Accelerated Robot Learning via Human Brain Signals

    Authors: Iretiayo Akinola, Zizhao Wang, Junyao Shi, Xiaomin He, Pawan Lapborisuth, Jingxi Xu, David Watkins-Valls, Paul Sajda, Peter Allen

    Abstract: In reinforcement learning (RL), sparse rewards are a natural way to specify the task to be learned. However, most RL algorithms struggle to learn in this setting since the learning signal is mostly zeros. In contrast, humans are good at assessing and predicting the future consequences of actions and can serve as good reward/policy shapers to accelerate the robot learning process. Previous works ha… ▽ More

    Submitted 11 August, 2020; v1 submitted 1 October, 2019; originally announced October 2019.

    Comments: 2020 IEEE International Conference on Robotics and Automation - ICRA 2020

  4. arXiv:1909.09295  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Learning Your Way Without Map or Compass: Panoramic Target Driven Visual Navigation

    Authors: David Watkins-Valls, Jingxi Xu, Nicholas Waytowich, Peter Allen

    Abstract: We present a robot navigation system that uses an imitation learning framework to successfully navigate in complex environments. Our framework takes a pre-built 3D scan of a real environment and trains an agent from pre-generated expert trajectories to navigate to any position given a panoramic view of the goal and the current visual input without relying on map, compass, odometry, or relative pos… ▽ More

    Submitted 25 September, 2020; v1 submitted 19 September, 2019; originally announced September 2019.