Skip to main content

Showing 1–1 of 1 results for author: Centa, M M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.10882  [pdf, other

    cs.LG stat.ML

    AdaStop: adaptive statistical testing for sound comparisons of Deep RL agents

    Authors: Timothée Mathieu, Riccardo Della Vecchia, Alena Shilova, Matheus Medeiros Centa, Hector Kohler, Odalric-Ambrym Maillard, Philippe Preux

    Abstract: Recently, the scientific community has questioned the statistical reproducibility of many empirical results, especially in the field of machine learning. To contribute to the resolution of this reproducibility crisis, we propose a theoretically sound methodology for comparing the performance of a set of algorithms. We exemplify our methodology in Deep Reinforcement Learning (Deep RL). The performa… ▽ More

    Submitted 12 December, 2024; v1 submitted 19 June, 2023; originally announced June 2023.

    Journal ref: TMLR 2024