Skip to main content

Showing 1–6 of 6 results for author: Shilova, A

.
  1. arXiv:2506.13862  [pdf, ps, other

    cs.LG cs.AI

    StaQ it! Growing neural networks for Policy Mirror Descent

    Authors: Alena Shilova, Alex Davey, Brahim Driss, Riad Akrour

    Abstract: In Reinforcement Learning (RL), regularization has emerged as a popular tool both in theory and practice, typically based either on an entropy bonus or a Kullback-Leibler divergence that constrains successive policies. In practice, these approaches have been shown to improve exploration, robustness and stability, giving rise to popular Deep RL algorithms such as SAC and TRPO. Policy Mirror Descent… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: 44 pages, 12 figures

  2. arXiv:2306.10882  [pdf, other

    cs.LG stat.ML

    AdaStop: adaptive statistical testing for sound comparisons of Deep RL agents

    Authors: Timothée Mathieu, Riccardo Della Vecchia, Alena Shilova, Matheus Medeiros Centa, Hector Kohler, Odalric-Ambrym Maillard, Philippe Preux

    Abstract: Recently, the scientific community has questioned the statistical reproducibility of many empirical results, especially in the field of machine learning. To contribute to the resolution of this reproducibility crisis, we propose a theoretically sound methodology for comparing the performance of a set of algorithms. We exemplify our methodology in Deep Reinforcement Learning (Deep RL). The performa… ▽ More

    Submitted 12 December, 2024; v1 submitted 19 June, 2023; originally announced June 2023.

    Journal ref: TMLR 2024

  3. arXiv:2210.08503  [pdf, other

    cs.LG

    Entropy Regularized Reinforcement Learning with Cascading Networks

    Authors: Riccardo Della Vecchia, Alena Shilova, Philippe Preux, Riad Akrour

    Abstract: Deep Reinforcement Learning (Deep RL) has had incredible achievements on high dimensional problems, yet its learning process remains unstable even on the simplest tasks. Deep RL uses neural networks as function approximators. These neural models are largely inspired by developments in the (un)supervised machine learning community. Compared to these learning frameworks, one of the major difficultie… ▽ More

    Submitted 16 October, 2022; originally announced October 2022.

  4. arXiv:2202.10435  [pdf, ps, other

    cs.LG cs.AI

    Survey on Large Scale Neural Network Training

    Authors: Julia Gusak, Daria Cherniuk, Alena Shilova, Alexander Katrutsa, Daniel Bershatsky, Xunyi Zhao, Lionel Eyraud-Dubois, Oleg Shlyazhko, Denis Dimitrov, Ivan Oseledets, Olivier Beaumont

    Abstract: Modern Deep Neural Networks (DNNs) require significant memory to store weight, activations, and other intermediate tensors during training. Hence, many models do not fit one GPU device or can be trained using only a small per-GPU batch size. This survey provides a systematic overview of the approaches that enable more efficient DNNs training. We analyze techniques that save memory and make good us… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

  5. arXiv:1911.13214  [pdf, other

    cs.LG cs.DC cs.NE

    Optimal checkpointing for heterogeneous chains: how to train deep neural networks with limited memory

    Authors: Julien Herrmann, Olivier Beaumont, Lionel Eyraud-Dubois, Julien Hermann, Alexis Joly, Alena Shilova

    Abstract: This paper introduces a new activation checkpointing method which allows to significantly decrease memory usage when training Deep Neural Networks with the back-propagation algorithm. Similarly to checkpoint-ing techniques coming from the literature on Automatic Differentiation, it consists in dynamically selecting the forward activations that are saved during the training phase, and then automati… ▽ More

    Submitted 27 November, 2019; originally announced November 2019.

  6. arXiv:1903.03051  [pdf, other

    cs.DC

    Training on the Edge: The why and the how

    Authors: Navjot Kukreja, Alena Shilova, Olivier Beaumont, Jan Huckelheim, Nicola Ferrier, Paul Hovland, Gerard Gorman

    Abstract: Edge computing is the natural progression from Cloud computing, where, instead of collecting all data and processing it centrally, like in a cloud computing environment, we distribute the computing power and try to do as much processing as possible, close to the source of the data. There are various reasons this model is being adopted quickly, including privacy, and reduced power and bandwidth req… ▽ More

    Submitted 13 February, 2019; originally announced March 2019.

    Comments: Submitted to PAISE 2019