Skip to main content

Showing 1–14 of 14 results for author: Durand, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2503.24019  [pdf, other

    stat.ML cs.LG stat.AP

    AutoML Algorithms for Online Generalized Additive Model Selection: Application to Electricity Demand Forecasting

    Authors: Keshav Das, Julie Keisler, Margaux Brégère, Amaury Durand

    Abstract: Electricity demand forecasting is key to ensuring that supply meets demand lest the grid would blackout. Reliable short-term forecasts may be obtained by combining a Generalized Additive Models (GAM) with a State-Space model (Obst et al., 2021), leading to an adaptive (or online) model. A GAM is an over-parameterized linear model defined by a formula and a state-space model involves hyperparameter… ▽ More

    Submitted 31 March, 2025; originally announced March 2025.

    Comments: 13 pages, 1 figure

  2. arXiv:2501.03999  [pdf, other

    cs.LG stat.ML

    Adaptive Experiments Under High-Dimensional and Data Sparse Settings: Applications for Educational Platforms

    Authors: Haochen Song, Ilya Musabirov, Ananya Bhattacharjee, Audrey Durand, Meredith Franklin, Anna Rafferty, Joseph Jay Williams

    Abstract: In online educational platforms, adaptive experiment designs play a critical role in personalizing learning pathways, instructional sequencing, and content recommendations. Traditional adaptive policies, such as Thompson Sampling, struggle with scalability in high-dimensional and sparse settings such as when there are large amount of treatments (arms) and limited resources such as funding and time… ▽ More

    Submitted 24 February, 2025; v1 submitted 7 January, 2025; originally announced January 2025.

  3. arXiv:2112.08507  [pdf, other

    cs.LG stat.ML

    Algorithms for Adaptive Experiments that Trade-off Statistical Analysis with Reward: Combining Uniform Random Assignment and Reward Maximization

    Authors: Tong Li, Jacob Nogas, Haochen Song, Harsh Kumar, Audrey Durand, Anna Rafferty, Nina Deliu, Sofia S. Villar, Joseph J. Williams

    Abstract: Multi-armed bandit algorithms like Thompson Sampling (TS) can be used to conduct adaptive experiments, in which maximizing reward means that data is used to progressively assign participants to more effective arms. Such assignment strategies increase the risk of statistical hypothesis tests identifying a difference between arms when there is not one, and failing to conclude there is a difference i… ▽ More

    Submitted 23 November, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

  4. arXiv:2103.12198  [pdf

    cs.LG stat.AP

    Challenges in Statistical Analysis of Data Collected by a Bandit Algorithm: An Empirical Exploration in Applications to Adaptively Randomized Experiments

    Authors: Joseph Jay Williams, Jacob Nogas, Nina Deliu, Hammad Shaikh, Sofia S. Villar, Audrey Durand, Anna Rafferty

    Abstract: Multi-armed bandit algorithms have been argued for decades as useful for adaptively randomized experiments. In such experiments, an algorithm varies which arms (e.g. alternative interventions to help students learn) are assigned to participants, with the goal of assigning higher-reward arms to as many participants as possible. We applied the bandit algorithm Thompson Sampling (TS) to run adaptive… ▽ More

    Submitted 26 March, 2021; v1 submitted 22 March, 2021; originally announced March 2021.

  5. arXiv:2007.01516  [pdf, other

    cs.LG q-bio.GN stat.AP stat.ML

    Deep interpretability for GWAS

    Authors: Deepak Sharma, Audrey Durand, Marc-André Legault, Louis-Philippe Lemieux Perreault, Audrey Lemaçon, Marie-Pierre Dubé, Joelle Pineau

    Abstract: Genome-Wide Association Studies are typically conducted using linear models to find genetic variants associated with common diseases. In these studies, association testing is done on a variant-by-variant basis, possibly missing out on non-linear interaction effects between variants. Deep networks can be used to model these interactions, but they are difficult to train and interpret on large geneti… ▽ More

    Submitted 3 July, 2020; originally announced July 2020.

    Comments: Accepted at ICML 2020 workshop on ML Interpretability for Scientific Discovery

  6. arXiv:1912.01706  [pdf, ps, other

    cs.LG cs.CL stat.ML

    A Robust Self-Learning Method for Fully Unsupervised Cross-Lingual Mappings of Word Embeddings: Making the Method Robustly Reproducible as Well

    Authors: Nicolas Garneau, Mathieu Godbout, David Beauchemin, Audrey Durand, Luc Lamontagne

    Abstract: In this paper, we reproduce the experiments of Artetxe et al. (2018b) regarding the robust self-learning method for fully unsupervised cross-lingual mappings of word embeddings. We show that the reproduction of their method is indeed feasible with some minor assumptions. We further investigate the robustness of their model by introducing four new languages that are less similar to English than the… ▽ More

    Submitted 3 March, 2020; v1 submitted 3 December, 2019; originally announced December 2019.

    Comments: Accept in REPROLANG@LREC2020

  7. arXiv:1910.04928  [pdf, other

    cs.LG stat.ML

    Old Dog Learns New Tricks: Randomized UCB for Bandit Problems

    Authors: Sharan Vaswani, Abbas Mehrabian, Audrey Durand, Branislav Kveton

    Abstract: We propose $\tt RandUCB$, a bandit strategy that builds on theoretically derived confidence intervals similar to upper confidence bound (UCB) algorithms, but akin to Thompson sampling (TS), it uses randomization to trade off exploration and exploitation. In the $K$-armed bandit setting, we show that there are infinitely many variants of $\tt RandUCB$, all of which achieve the minimax-optimal… ▽ More

    Submitted 22 March, 2020; v1 submitted 10 October, 2019; originally announced October 2019.

    Comments: AISTATS 2020

  8. arXiv:1909.07543  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Attraction-Repulsion Actor-Critic for Continuous Control Reinforcement Learning

    Authors: Thang Doan, Bogdan Mazoure, Moloud Abdar, Audrey Durand, Joelle Pineau, R Devon Hjelm

    Abstract: Continuous control tasks in reinforcement learning are important because they provide an important framework for learning in high-dimensional state spaces with deceptive rewards, where the agent can easily become trapped into suboptimal solutions. One way to avoid local optima is to use a population of agents to ensure coverage of the policy space, yet learning a population with the "best" coverag… ▽ More

    Submitted 9 July, 2020; v1 submitted 16 September, 2019; originally announced September 2019.

  9. arXiv:1905.06893  [pdf, other

    cs.LG stat.ML

    Leveraging exploration in off-policy algorithms via normalizing flows

    Authors: Bogdan Mazoure, Thang Doan, Audrey Durand, R Devon Hjelm, Joelle Pineau

    Abstract: The ability to discover approximately optimal policies in domains with sparse rewards is crucial to applying reinforcement learning (RL) in many real-world scenarios. Approaches such as neural density models and continuous exploration (e.g., Go-Explore) have been proposed to maintain the high exploration rate necessary to find high performing and generalizable policies. Soft actor-critic(SAC) is a… ▽ More

    Submitted 24 September, 2019; v1 submitted 16 May, 2019; originally announced May 2019.

    Comments: Accepted to 3rd Conference on Robot Learning (CoRL 2019); Keywords: Exploration, soft actor-critic, normalizing flow, off-policy; maximum entropy, reinforcement learning; deceptive reward; sparse reward; inverse autoregressive flow

  10. arXiv:1811.00429  [pdf, other

    cs.LG stat.ML

    Temporal Regularization in Markov Decision Process

    Authors: Pierre Thodoroff, Audrey Durand, Joelle Pineau, Doina Precup

    Abstract: Several applications of Reinforcement Learning suffer from instability due to high variance. This is especially prevalent in high dimensional domains. Regularization is a commonly used technique in machine learning to reduce variance, at the cost of introducing some bias. Most existing regularization techniques focus on spatial (perceptual) regularization. Yet in reinforcement learning, due to the… ▽ More

    Submitted 10 April, 2019; v1 submitted 1 November, 2018; originally announced November 2018.

    Comments: Published as a conference paper at NIPS 2018

  11. arXiv:1808.00020  [pdf, other

    cs.LG stat.ML

    On-line Adaptative Curriculum Learning for GANs

    Authors: Thang Doan, Joao Monteiro, Isabela Albuquerque, Bogdan Mazoure, Audrey Durand, Joelle Pineau, R Devon Hjelm

    Abstract: Generative Adversarial Networks (GANs) can successfully approximate a probability distribution and produce realistic samples. However, open questions such as sufficient convergence conditions and mode collapse still persist. In this paper, we build on existing work in the area by proposing a novel framework for training the generator against an ensemble of discriminator networks, which can be seen… ▽ More

    Submitted 11 March, 2019; v1 submitted 31 July, 2018; originally announced August 2018.

    Comments: Accepted to the Thirty-Third AAAI Conference On Artificial Intelligence, 2019 (Added 128x128 CelebA samples to the end of the appendix)

    Journal ref: Proceedings of 33rd AAAI Conference on Artificial Intelligence (AAAI 2019)

  12. arXiv:1803.10806  [pdf, other

    cs.CV cs.LG stat.ML

    Learning to Become an Expert: Deep Networks Applied To Super-Resolution Microscopy

    Authors: Louis-Émile Robitaille, Audrey Durand, Marc-André Gardner, Christian Gagné, Paul De Koninck, Flavie Lavoie-Cardinal

    Abstract: With super-resolution optical microscopy, it is now possible to observe molecular interactions in living cells. The obtained images have a very high spatial precision but their overall quality can vary a lot depending on the structure of interest and the imaging parameters. Moreover, evaluating this quality is often difficult for non-expert users. In this work, we tackle the problem of learning th… ▽ More

    Submitted 28 March, 2018; originally announced March 2018.

    Comments: Accepted to the Thirtieth Innovative Applications of Artificial Intelligence Conference (IAAI), 2018

  13. arXiv:1708.00768  [pdf, other

    stat.ML cs.LG

    Streaming kernel regression with provably adaptive mean, variance, and regularization

    Authors: Audrey Durand, Odalric-Ambrym Maillard, Joelle Pineau

    Abstract: We consider the problem of streaming kernel regression, when the observations arrive sequentially and the goal is to recover the underlying mean function, assumed to belong to an RKHS. The variance of the noise is not assumed to be known. In this context, we tackle the problem of tuning the regularization parameter adaptively at each time step, while maintaining tight confidence bounds estimates o… ▽ More

    Submitted 2 August, 2017; originally announced August 2017.

  14. arXiv:1701.01095  [pdf, other

    cs.LG stat.ML

    Estimating Quality in Multi-Objective Bandits Optimization

    Authors: Audrey Durand, Christian Gagné

    Abstract: Many real-world applications are characterized by a number of conflicting performance measures. As optimizing in a multi-objective setting leads to a set of non-dominated solutions, a preference function is required for selecting the solution with the appropriate trade-off between the objectives. The question is: how good do estimations of these objectives have to be in order for the solution maxi… ▽ More

    Submitted 20 April, 2017; v1 submitted 4 January, 2017; originally announced January 2017.

    Comments: Submitted to ECML 2017