Skip to main content

Showing 1–7 of 7 results for author: Hadiji, H

Searching in archive math. Search in all archives.
.
  1. arXiv:2406.14059  [pdf, other

    cs.GT cs.LG math.OC stat.ML

    Tracking solutions of time-varying variational inequalities

    Authors: Hédi Hadiji, Sarah Sachs, Cristóbal Guzmán

    Abstract: Tracking the solution of time-varying variational inequalities is an important problem with applications in game theory, optimization, and machine learning. Existing work considers time-varying games or time-varying optimization problems. For strongly convex optimization problems or strongly monotone games, these results provide tracking guarantees under the assumption that the variation of the ti… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2304.12768  [pdf, ps, other

    cs.GT math.OC stat.ML

    Towards Characterizing the First-order Query Complexity of Learning (Approximate) Nash Equilibria in Zero-sum Matrix Games

    Authors: Hédi Hadiji, Sarah Sachs, Tim van Erven, Wouter M. Koolen

    Abstract: In the first-order query model for zero-sum $K\times K$ matrix games, players observe the expected pay-offs for all their possible actions under the randomized action played by their opponent. This classical model has received renewed interest after the discovery by Rakhlin and Sridharan that $ε$-approximate Nash equilibria can be computed efficiently from $O(\frac{\ln K}ε)$ instead of… ▽ More

    Submitted 2 November, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

  3. arXiv:2303.03272  [pdf, ps, other

    cs.LG math.OC stat.ML

    Accelerated Rates between Stochastic and Adversarial Online Convex Optimization

    Authors: Sarah Sachs, Hedi Hadiji, Tim van Erven, Cristobal Guzman

    Abstract: Stochastic and adversarial data are two widely studied settings in online learning. But many optimization tasks are neither i.i.d. nor fully adversarial, which makes it of fundamental interest to get a better theoretical understanding of the world between these extremes. In this work we establish novel regret bounds for online convex optimization in a setting that interpolates between stochastic i… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: Extended version of 'Between Stochastic and Adversarial Online Convex Optimization: Improved Regret Bounds via Smoothness' by the same authors. arXiv admin note: text overlap with arXiv:2202.07554

  4. arXiv:2202.07554  [pdf, ps, other

    cs.LG math.OC stat.ML

    Between Stochastic and Adversarial Online Convex Optimization: Improved Regret Bounds via Smoothness

    Authors: Sarah Sachs, Hédi Hadiji, Tim van Erven, Cristóbal Guzmán

    Abstract: Stochastic and adversarial data are two widely studied settings in online learning. But many optimization tasks are neither i.i.d. nor fully adversarial, which makes it of fundamental interest to get a better theoretical understanding of the world between these extremes. In this work we establish novel regret bounds for online convex optimization in a setting that interpolates between stochastic i… ▽ More

    Submitted 8 June, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

  5. arXiv:2006.03378  [pdf, other

    math.ST stat.ML

    Adaptation to the Range in $K$-Armed Bandits

    Authors: Hédi Hadiji, Gilles Stoltz

    Abstract: We consider stochastic bandit problems with $K$ arms, each associated with a bounded distribution supported on the range $[m,M]$. We do not assume that the range $[m,M]$ is known and show that there is a cost for learning this range. Indeed, a new trade-off between distribution-dependent and distribution-free regret bounds arises, which prevents from simultaneously achieving the typical $\ln T$ an… ▽ More

    Submitted 15 June, 2022; v1 submitted 5 June, 2020; originally announced June 2020.

  6. arXiv:1905.10221  [pdf, other

    stat.ML cs.LG math.ST

    Polynomial Cost of Adaptation for X -Armed Bandits

    Authors: Hédi Hadiji

    Abstract: In the context of stochastic continuum-armed bandits, we present an algorithm that adapts to the unknown smoothness of the objective function. We exhibit and compute a polynomial cost of adaptation to the H{ö}lder regularity for regret minimization. To do this, we first reconsider the recent lower bound of Locatelli and Carpentier [20], and define and characterize admissible rate functions. Our ne… ▽ More

    Submitted 9 December, 2019; v1 submitted 24 May, 2019; originally announced May 2019.

    Journal ref: Thirty-third Conference on Neural Information Processing Systems, Dec 2019, Vancouver, France

  7. arXiv:1805.05071  [pdf, other

    stat.ML cs.LG math.ST

    KL-UCB-switch: optimal regret bounds for stochastic bandits from both a distribution-dependent and a distribution-free viewpoints

    Authors: Aurélien Garivier, Hédi Hadiji, Pierre Menard, Gilles Stoltz

    Abstract: We consider $K$-armed stochastic bandits and consider cumulative regret bounds up to time $T$. We are interested in strategies achieving simultaneously a distribution-free regret bound of optimal order $\sqrt{KT}$ and a distribution-dependent regret that is asymptotically optimal, that is, matching the $κ\ln T$ lower bound by Lai and Robbins (1985) and Burnetas and Katehakis (1996), where $κ$ is t… ▽ More

    Submitted 1 July, 2022; v1 submitted 14 May, 2018; originally announced May 2018.