Skip to main content

Showing 1–3 of 3 results for author: Zhang, J O

Searching in archive cs. Search in all archives.
.
  1. arXiv:1912.13503  [pdf, other

    cs.LG cs.CV cs.NE cs.RO

    Side-Tuning: A Baseline for Network Adaptation via Additive Side Networks

    Authors: Jeffrey O Zhang, Alexander Sax, Amir Zamir, Leonidas Guibas, Jitendra Malik

    Abstract: When training a neural network for a desired task, one may prefer to adapt a pre-trained network rather than starting from randomly initialized weights. Adaptation can be useful in cases when training data is scarce, when a single learner needs to perform multiple tasks, or when one wishes to encode priors in the network. The most commonly employed approaches for network adaptation are fine-tuning… ▽ More

    Submitted 30 July, 2020; v1 submitted 31 December, 2019; originally announced December 2019.

    Comments: In ECCV 2020 (Spotlight). For more, see project website and code at http://sidetuning.berkeley.edu

  2. arXiv:1912.11121  [pdf, other

    cs.CV cs.LG cs.NE cs.RO

    Learning to Navigate Using Mid-Level Visual Priors

    Authors: Alexander Sax, Jeffrey O. Zhang, Bradley Emi, Amir Zamir, Silvio Savarese, Leonidas Guibas, Jitendra Malik

    Abstract: How much does having visual priors about the world (e.g. the fact that the world is 3D) assist in learning to perform downstream motor tasks (e.g. navigating a complex environment)? What are the consequences of not utilizing such visual priors in learning? We study these questions by integrating a generic perceptual skill set (a distance estimator, an edge detector, etc.) within a reinforcement le… ▽ More

    Submitted 23 December, 2019; originally announced December 2019.

    Comments: In Conference on Robot Learning, 2019. See project website and demos at http://perceptual.actor/

  3. arXiv:1811.03555  [pdf, other

    cs.AI

    Modular Architecture for StarCraft II with Deep Reinforcement Learning

    Authors: Dennis Lee, Haoran Tang, Jeffrey O Zhang, Huazhe Xu, Trevor Darrell, Pieter Abbeel

    Abstract: We present a novel modular architecture for StarCraft II AI. The architecture splits responsibilities between multiple modules that each control one aspect of the game, such as build-order selection or tactics. A centralized scheduler reviews macros suggested by all modules and decides their order of execution. An updater keeps track of environment changes and instantiates macros into series of ex… ▽ More

    Submitted 8 November, 2018; originally announced November 2018.

    Comments: Accepted to The 14th AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment (AIIDE'18)