Skip to main content

Showing 1–8 of 8 results for author: Dadashi, R

Searching in archive stat. Search in all archives.
.
  1. arXiv:2205.09589  [pdf, other

    cs.LG stat.ML

    Learning Energy Networks with Generalized Fenchel-Young Losses

    Authors: Mathieu Blondel, Felipe Llinares-López, Robert Dadashi, Léonard Hussenot, Matthieu Geist

    Abstract: Energy-based models, a.k.a. energy networks, perform inference by optimizing an energy function, typically parametrized by a neural network. This allows one to capture potentially complex relationships between inputs and outputs. To learn the parameters of the energy function, the solution to that optimization problem is typically fed into a loss function. The key challenge for training energy net… ▽ More

    Submitted 12 October, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

  2. arXiv:2006.12917  [pdf, other

    cs.LG stat.ML

    Show me the Way: Intrinsic Motivation from Demonstrations

    Authors: Léonard Hussenot, Robert Dadashi, Matthieu Geist, Olivier Pietquin

    Abstract: The study of exploration in the domain of decision making has a long history but remains actively debated. From the vast literature that addressed this topic for decades under various points of view (e.g., developmental psychology, experimental design, artificial intelligence), intrinsic motivation emerged as a concept that can practically be transferred to artificial agents. Especially, in the re… ▽ More

    Submitted 13 January, 2021; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: AAMAS 2021

  3. arXiv:2006.04678  [pdf, other

    cs.LG stat.ML

    Primal Wasserstein Imitation Learning

    Authors: Robert Dadashi, Léonard Hussenot, Matthieu Geist, Olivier Pietquin

    Abstract: Imitation Learning (IL) methods seek to match the behavior of an agent with that of an expert. In the present work, we propose a new IL method based on a conceptually simple algorithm: Primal Wasserstein Imitation Learning (PWIL), which ties to the primal form of the Wasserstein distance between the expert and the agent state-action distributions. We present a reward function which is derived offl… ▽ More

    Submitted 17 March, 2021; v1 submitted 8 June, 2020; originally announced June 2020.

    Comments: Published in International Conference on Learning Representations (ICLR 2021)

  4. arXiv:2006.02243  [pdf, other

    cs.LG stat.ML

    The Value-Improvement Path: Towards Better Representations for Reinforcement Learning

    Authors: Will Dabney, André Barreto, Mark Rowland, Robert Dadashi, John Quan, Marc G. Bellemare, David Silver

    Abstract: In value-based reinforcement learning (RL), unlike in supervised learning, the agent faces not a single, stationary, approximation problem, but a sequence of value prediction problems. Each time the policy improves, the nature of the problem changes, shifting both the distribution of states and their values. In this paper we take a novel perspective, arguing that the value prediction problems face… ▽ More

    Submitted 4 January, 2021; v1 submitted 3 June, 2020; originally announced June 2020.

    Comments: AAAI-21

  5. arXiv:1902.08102  [pdf, other

    stat.ML cs.LG

    Statistics and Samples in Distributional Reinforcement Learning

    Authors: Mark Rowland, Robert Dadashi, Saurabh Kumar, Rémi Munos, Marc G. Bellemare, Will Dabney

    Abstract: We present a unifying framework for designing and analysing distributional reinforcement learning (DRL) algorithms in terms of recursively estimating statistics of the return distribution. Our key insight is that DRL algorithms can be decomposed as the combination of some statistical estimator and a method for imputing a return distribution consistent with that set of statistics. With this new und… ▽ More

    Submitted 21 February, 2019; originally announced February 2019.

  6. arXiv:1901.11530  [pdf, other

    cs.LG cs.AI stat.ML

    A Geometric Perspective on Optimal Representations for Reinforcement Learning

    Authors: Marc G. Bellemare, Will Dabney, Robert Dadashi, Adrien Ali Taiga, Pablo Samuel Castro, Nicolas Le Roux, Dale Schuurmans, Tor Lattimore, Clare Lyle

    Abstract: We propose a new perspective on representation learning in reinforcement learning based on geometric properties of the space of value functions. We leverage this perspective to provide formal evidence regarding the usefulness of value functions as auxiliary tasks. Our formulation considers adapting the representation to minimize the (linear) approximation of the value function of all stationary po… ▽ More

    Submitted 25 June, 2019; v1 submitted 31 January, 2019; originally announced January 2019.

  7. arXiv:1901.11524  [pdf, other

    cs.LG cs.AI stat.ML

    The Value Function Polytope in Reinforcement Learning

    Authors: Robert Dadashi, Adrien Ali Taïga, Nicolas Le Roux, Dale Schuurmans, Marc G. Bellemare

    Abstract: We establish geometric and topological properties of the space of value functions in finite state-action Markov decision processes. Our main contribution is the characterization of the nature of its shape: a general polytope (Aigner et al., 2010). To demonstrate this result, we exhibit several properties of the structural relationship between policies and value functions including the line theorem… ▽ More

    Submitted 15 May, 2019; v1 submitted 31 January, 2019; originally announced January 2019.

  8. arXiv:1811.04911  [pdf, other

    cs.LG stat.ML

    Boosting Model Performance through Differentially Private Model Aggregation

    Authors: Sophia Collet, Robert Dadashi, Zahi N. Karam, Chang Liu, Parinaz Sobhani, Yevgeniy Vahlis, Ji Chao Zhang

    Abstract: A key factor in developing high performing machine learning models is the availability of sufficiently large datasets. This work is motivated by applications arising in Software as a Service (SaaS) companies where there exist numerous similar yet disjoint datasets from multiple client companies. To overcome the challenges of insufficient data without explicitly aggregating the clients' datasets du… ▽ More

    Submitted 4 December, 2018; v1 submitted 12 November, 2018; originally announced November 2018.