Skip to main content

Showing 1–9 of 9 results for author: François-Lavet, V

Searching in archive stat. Search in all archives.
.
  1. arXiv:2207.08457  [pdf, other

    cs.LG cs.AI stat.ME

    A Meta-Reinforcement Learning Algorithm for Causal Discovery

    Authors: Andreas Sauter, Erman Acar, Vincent François-Lavet

    Abstract: Causal discovery is a major task with the utmost importance for machine learning since causal structures can enable models to go beyond pure correlation-based inference and significantly boost their performance. However, finding causal structures from data poses a significant challenge both in computational effort and accuracy, let alone its impossibility without interventions in general. In this… ▽ More

    Submitted 21 February, 2023; v1 submitted 18 July, 2022; originally announced July 2022.

    Comments: Camera-ready version for CLEAR23

  2. arXiv:2009.13579  [pdf, other

    cs.LG stat.ML

    Novelty Search in Representational Space for Sample Efficient Exploration

    Authors: Ruo Yu Tao, Vincent François-Lavet, Joelle Pineau

    Abstract: We present a new approach for efficient exploration which leverages a low-dimensional encoding of the environment learned with a combination of model-based and model-free objectives. Our approach uses intrinsic rewards that are based on the distance of nearest neighbors in the low dimensional representational space to gauge novelty. We then leverage these intrinsic rewards for sample-efficient exp… ▽ More

    Submitted 15 April, 2022; v1 submitted 28 September, 2020; originally announced September 2020.

    Comments: 10 pages + references + appendix. Oral presentation at NeurIPS 2020

  3. arXiv:2003.01181  [pdf, other

    cs.LG cs.CV stat.ML

    RandomNet: Towards Fully Automatic Neural Architecture Design for Multimodal Learning

    Authors: Stefano Alletto, Shenyang Huang, Vincent Francois-Lavet, Yohei Nakata, Guillaume Rabusseau

    Abstract: Almost all neural architecture search methods are evaluated in terms of performance (i.e. test accuracy) of the model structures that it finds. Should it be the only metric for a good autoML approach? To examine aspects beyond performance, we propose a set of criteria aimed at evaluating the core of autoML problem: the amount of human intervention required to deploy these methods into real world s… ▽ More

    Submitted 2 March, 2020; originally announced March 2020.

    Comments: 6 pages, 1 figures

  4. arXiv:1909.06686  [pdf, other

    cs.LG stat.ML

    Neural Architecture Search for Class-incremental Learning

    Authors: Shenyang Huang, Vincent François-Lavet, Guillaume Rabusseau

    Abstract: In class-incremental learning, a model learns continuously from a sequential data stream in which new classes occur. Existing methods often rely on static architectures that are manually crafted. These methods can be prone to capacity saturation because a neural network's ability to generalize to new concepts is limited by its fixed capacity. To understand how to expand a continual learner, we foc… ▽ More

    Submitted 14 September, 2019; originally announced September 2019.

    Comments: 8 pages, 10 Figures

  5. arXiv:1811.12560  [pdf, other

    cs.LG cs.AI stat.ML

    An Introduction to Deep Reinforcement Learning

    Authors: Vincent Francois-Lavet, Peter Henderson, Riashat Islam, Marc G. Bellemare, Joelle Pineau

    Abstract: Deep reinforcement learning is the combination of reinforcement learning (RL) and deep learning. This field of research has been able to solve a wide range of complex decision-making tasks that were previously out of reach for a machine. Thus, deep RL opens up many new applications in domains such as healthcare, robotics, smart grids, finance, and many more. This manuscript provides an introductio… ▽ More

    Submitted 3 December, 2018; v1 submitted 29 November, 2018; originally announced November 2018.

    Journal ref: Foundations and Trends in Machine Learning: Vol. 11, No. 3-4, 2018

  6. arXiv:1809.04506  [pdf, other

    cs.LG cs.AI stat.ML

    Combined Reinforcement Learning via Abstract Representations

    Authors: Vincent François-Lavet, Yoshua Bengio, Doina Precup, Joelle Pineau

    Abstract: In the quest for efficient and robust reinforcement learning methods, both model-free and model-based approaches offer advantages. In this paper we propose a new way of explicitly bridging both approaches via a shared low-dimensional learned encoding of the environment, meant to capture summarizing abstractions. We show that the modularity brought by this approach leads to good generalization whil… ▽ More

    Submitted 18 November, 2018; v1 submitted 12 September, 2018; originally announced September 2018.

    Comments: Accepted to the Thirty-Third AAAI Conference On Artificial Intelligence, 2019

  7. arXiv:1805.03359  [pdf, other

    cs.LG cs.AI stat.ML

    Reward Estimation for Variance Reduction in Deep Reinforcement Learning

    Authors: Joshua Romoff, Peter Henderson, Alexandre Piché, Vincent Francois-Lavet, Joelle Pineau

    Abstract: Reinforcement Learning (RL) agents require the specification of a reward signal for learning behaviours. However, introduction of corrupt or stochastic rewards can yield high variance in learning. Such corruption may be a direct result of goal misspecification, randomness in the reward signal, or correlation of the reward with external factors that are not known to the agent. Corruption or stochas… ▽ More

    Submitted 7 November, 2018; v1 submitted 8 May, 2018; originally announced May 2018.

    Comments: Version 1 as appears in the International Conference on Learning Representations (ICLR) 2018 Workshop Track; Version 2 as appears in the Proceedings of The 2nd Conference on Robot Learning

  8. arXiv:1709.07796  [pdf, other

    stat.ML cs.AI cs.LG

    On overfitting and asymptotic bias in batch reinforcement learning with partial observability

    Authors: Vincent Francois-Lavet, Guillaume Rabusseau, Joelle Pineau, Damien Ernst, Raphael Fonteneau

    Abstract: This paper provides an analysis of the tradeoff between asymptotic bias (suboptimality with unlimited data) and overfitting (additional suboptimality due to limited data) in the context of reinforcement learning with partial observability. Our theoretical analysis formally characterizes that while potentially increasing the asymptotic bias, a smaller state representation decreases the risk of over… ▽ More

    Submitted 6 February, 2019; v1 submitted 22 September, 2017; originally announced September 2017.

    Comments: Accepted at the Journal of Artificial Intelligence Research (JAIR) - 31 pages

  9. arXiv:1406.7865  [pdf, other

    stat.ML cs.CE cs.LG

    Simple connectome inference from partial correlation statistics in calcium imaging

    Authors: Antonio Sutera, Arnaud Joly, Vincent François-Lavet, Zixiao Aaron Qiu, Gilles Louppe, Damien Ernst, Pierre Geurts

    Abstract: In this work, we propose a simple yet effective solution to the problem of connectome inference in calcium imaging data. The proposed algorithm consists of two steps. First, processing the raw signals to detect neural peak activities. Second, inferring the degree of association between neurons from partial correlation statistics. This paper summarises the methodology that led us to win the Connect… ▽ More

    Submitted 18 November, 2014; v1 submitted 30 June, 2014; originally announced June 2014.

    Journal ref: JMLR: Workshop and Conference Proceedings 46:23-35, 2015