Skip to main content

Showing 1–10 of 10 results for author: Levine, N

Searching in archive stat. Search in all archives.
.
  1. arXiv:2106.12772  [pdf, other

    cs.LG stat.ML

    Task-agnostic Continual Learning with Hybrid Probabilistic Models

    Authors: Polina Kirichenko, Mehrdad Farajtabar, Dushyant Rao, Balaji Lakshminarayanan, Nir Levine, Ang Li, Huiyi Hu, Andrew Gordon Wilson, Razvan Pascanu

    Abstract: Learning new tasks continuously without forgetting on a constantly changing data distribution is essential for real-world problems but extremely challenging for modern deep learning. In this work we propose HCL, a Hybrid generative-discriminative approach to Continual Learning for classification. We model the distribution of each task and each class with a normalizing flow. The flow is used to lea… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

  2. arXiv:2010.06324  [pdf, other

    cs.LG cs.AI stat.ML

    Balancing Constraints and Rewards with Meta-Gradient D4PG

    Authors: Dan A. Calian, Daniel J. Mankowitz, Tom Zahavy, Zhongwen Xu, Junhyuk Oh, Nir Levine, Timothy Mann

    Abstract: Deploying Reinforcement Learning (RL) agents to solve real-world applications often requires satisfying complex system constraints. Often the constraint thresholds are incorrectly set due to the complex nature of a system or the inability to verify the thresholds offline (e.g, no simulator or reasonable offline evaluation procedure exists). This results in solutions where a task cannot be solved w… ▽ More

    Submitted 27 November, 2020; v1 submitted 13 October, 2020; originally announced October 2020.

  3. arXiv:2006.10974  [pdf, ps, other

    cs.LG stat.ML

    Optimization and Generalization of Regularization-Based Continual Learning: a Loss Approximation Viewpoint

    Authors: Dong Yin, Mehrdad Farajtabar, Ang Li, Nir Levine, Alex Mott

    Abstract: Neural networks have achieved remarkable success in many cognitive tasks. However, when they are trained sequentially on multiple tasks without access to old data, their performance on early tasks tend to drop significantly. This problem is often referred to as catastrophic forgetting, a key challenge in continual learning of neural networks. The regularization-based approach is one of the primary… ▽ More

    Submitted 8 February, 2021; v1 submitted 19 June, 2020; originally announced June 2020.

    Comments: Preliminary version with a different title presented at ICML Workshop on Continual Learning, 2020 (spotlight)

  4. arXiv:1909.01506  [pdf, other

    cs.LG stat.ML

    Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control

    Authors: Nir Levine, Yinlam Chow, Rui Shu, Ang Li, Mohammad Ghavamzadeh, Hung Bui

    Abstract: Many real-world sequential decision-making problems can be formulated as optimal control with high-dimensional observations and unknown dynamics. A promising approach is to embed the high-dimensional observations into a lower-dimensional latent representation space, estimate the latent dynamics model, then utilize this model for control in the latent space. An important open question is how to lea… ▽ More

    Submitted 10 February, 2020; v1 submitted 3 September, 2019; originally announced September 2019.

  5. arXiv:1906.07516  [pdf, other

    cs.LG cs.AI stat.ML

    Robust Reinforcement Learning for Continuous Control with Model Misspecification

    Authors: Daniel J. Mankowitz, Nir Levine, Rae Jeong, Yuanyuan Shi, Jackie Kay, Abbas Abdolmaleki, Jost Tobias Springenberg, Timothy Mann, Todd Hester, Martin Riedmiller

    Abstract: We provide a framework for incorporating robustness -- to perturbations in the transition dynamics which we refer to as model misspecification -- into continuous control Reinforcement Learning (RL) algorithms. We specifically focus on incorporating robustness into a state-of-the-art continuous control RL algorithm called Maximum a-posteriori Policy Optimization (MPO). We achieve this by learning a… ▽ More

    Submitted 11 February, 2020; v1 submitted 18 June, 2019; originally announced June 2019.

  6. arXiv:1902.03393  [pdf, other

    cs.LG cs.AI stat.ML

    Improved Knowledge Distillation via Teacher Assistant

    Authors: Seyed-Iman Mirzadeh, Mehrdad Farajtabar, Ang Li, Nir Levine, Akihiro Matsukawa, Hassan Ghasemzadeh

    Abstract: Despite the fact that deep neural networks are powerful models and achieve appealing results on many tasks, they are too large to be deployed on edge devices like smartphones or embedded sensor nodes. There have been efforts to compress these networks, and a popular method is knowledge distillation, where a large (teacher) pre-trained network is used to train a smaller (student) network. However,… ▽ More

    Submitted 16 December, 2019; v1 submitted 9 February, 2019; originally announced February 2019.

    Comments: AAAI 2020

  7. arXiv:1705.07461  [pdf, other

    cs.AI cs.LG stat.ML

    Shallow Updates for Deep Reinforcement Learning

    Authors: Nir Levine, Tom Zahavy, Daniel J. Mankowitz, Aviv Tamar, Shie Mannor

    Abstract: Deep reinforcement learning (DRL) methods such as the Deep Q-Network (DQN) have achieved state-of-the-art results in a variety of challenging, high-dimensional domains. This success is mainly attributed to the power of deep neural networks to learn rich domain representations for approximating the value function or policy. Batch reinforcement learning methods with linear representations, on the ot… ▽ More

    Submitted 2 November, 2017; v1 submitted 21 May, 2017; originally announced May 2017.

  8. arXiv:1702.07274  [pdf, other

    stat.ML cs.LG

    Rotting Bandits

    Authors: Nir Levine, Koby Crammer, Shie Mannor

    Abstract: The Multi-Armed Bandits (MAB) framework highlights the tension between acquiring new knowledge (Exploration) and leveraging available knowledge (Exploitation). In the classical MAB problem, a decision maker must choose an arm at each time step, upon which she receives a reward. The decision maker's objective is to maximize her cumulative expected reward over the time horizon. The MAB problem has b… ▽ More

    Submitted 2 November, 2017; v1 submitted 23 February, 2017; originally announced February 2017.

  9. arXiv:1504.04114  [pdf, other

    stat.ML cs.LG cs.SI

    Actively Learning to Attract Followers on Twitter

    Authors: Nir Levine, Timothy A. Mann, Shie Mannor

    Abstract: Twitter, a popular social network, presents great opportunities for on-line machine learning research. However, previous research has focused almost entirely on learning from passively collected data. We study the problem of learning to acquire followers through normative user behavior, as opposed to the mass following policies applied by many bots. We formalize the problem as a contextual bandit… ▽ More

    Submitted 16 April, 2015; originally announced April 2015.

  10. arXiv:1404.0752  [pdf, ps, other

    stat.ML

    An Efficient Search Strategy for Aggregation and Discretization of Attributes of Bayesian Networks Using Minimum Description Length

    Authors: Jem Corcoran, Daniel Tran, Nicholas Levine

    Abstract: Bayesian networks are convenient graphical expressions for high dimensional probability distributions representing complex relationships between a large number of random variables. They have been employed extensively in areas such as bioinformatics, artificial intelligence, diagnosis, and risk management. The recovery of the structure of a network from data is of prime importance for the purposes… ▽ More

    Submitted 2 April, 2014; originally announced April 2014.