Skip to main content

Showing 1–2 of 2 results for author: Thune, T S

.
  1. arXiv:1906.00670  [pdf, other

    cs.LG stat.ML

    Nonstochastic Multiarmed Bandits with Unrestricted Delays

    Authors: Tobias Sommer Thune, Nicolò Cesa-Bianchi, Yevgeny Seldin

    Abstract: We investigate multiarmed bandits with delayed feedback, where the delays need neither be identical nor bounded. We first prove that "delayed" Exp3 achieves the $O(\sqrt{(KT + D)\ln K} )$ regret bound conjectured by Cesa-Bianchi et al. [2019] in the case of variable, but bounded delays. Here, $K$ is the number of actions and $D$ is the total delay over $T$ rounds. We then introduce a new algorithm… ▽ More

    Submitted 19 November, 2019; v1 submitted 3 June, 2019; originally announced June 2019.

    Comments: 9 pages, Neurips camera ready

  2. arXiv:1807.00636  [pdf, other

    cs.LG stat.ML

    Adaptation to Easy Data in Prediction with Limited Advice

    Authors: Tobias Sommer Thune, Yevgeny Seldin

    Abstract: We derive an online learning algorithm with improved regret guarantees for `easy' loss sequences. We consider two types of `easiness': (a) stochastic loss sequences and (b) adversarial loss sequences with small effective range of the losses. While a number of algorithms have been proposed for exploiting small effective range in the full information setting, Gerchinovitz and Lattimore [2016] have s… ▽ More

    Submitted 27 August, 2019; v1 submitted 2 July, 2018; originally announced July 2018.

    Comments: Fixed a mistake in the proof and statement of Theorem 3