Skip to main content

Showing 1–12 of 12 results for author: Hadiji, H

.
  1. arXiv:2502.17175  [pdf, other

    stat.ML cs.LG

    Linear Bandits on Ellipsoids: Minimax Optimal Algorithms

    Authors: Raymond Zhang, Hedi Hadiji, Richard Combes

    Abstract: We consider linear stochastic bandits where the set of actions is an ellipsoid. We provide the first known minimax optimal algorithm for this problem. We first derive a novel information-theoretic lower bound on the regret of any algorithm, which must be at least $Ω(\min(d σ\sqrt{T} + d \|θ\|_{A}, \|θ\|_{A} T))$ where $d$ is the dimension, $T$ the time horizon, $σ^2$ the noise variance, $A$ a matr… ▽ More

    Submitted 24 February, 2025; originally announced February 2025.

    Comments: 20 pages, 3 figures

  2. arXiv:2410.02400  [pdf, ps, other

    cs.LG

    An Online Feasible Point Method for Benign Generalized Nash Equilibrium Problems

    Authors: Sarah Sachs, Hedi Hadiji, Tim van Erven, Mathias Staudigl

    Abstract: We consider a repeatedly played generalized Nash equilibrium game. This induces a multi-agent online learning problem with joint constraints. An important challenge in this setting is that the feasible set for each agent depends on the simultaneous moves of the other agents and, therefore, varies over time. As a consequence, the agents face time-varying constraints, which are not adversarial but r… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

  3. arXiv:2406.14059  [pdf, other

    cs.GT cs.LG math.OC stat.ML

    Tracking solutions of time-varying variational inequalities

    Authors: Hédi Hadiji, Sarah Sachs, Cristóbal Guzmán

    Abstract: Tracking the solution of time-varying variational inequalities is an important problem with applications in game theory, optimization, and machine learning. Existing work considers time-varying games or time-varying optimization problems. For strongly convex optimization problems or strongly monotone games, these results provide tracking guarantees under the assumption that the variation of the ti… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  4. arXiv:2304.12768  [pdf, ps, other

    cs.GT math.OC stat.ML

    Towards Characterizing the First-order Query Complexity of Learning (Approximate) Nash Equilibria in Zero-sum Matrix Games

    Authors: Hédi Hadiji, Sarah Sachs, Tim van Erven, Wouter M. Koolen

    Abstract: In the first-order query model for zero-sum $K\times K$ matrix games, players observe the expected pay-offs for all their possible actions under the randomized action played by their opponent. This classical model has received renewed interest after the discovery by Rakhlin and Sridharan that $ε$-approximate Nash equilibria can be computed efficiently from $O(\frac{\ln K}ε)$ instead of… ▽ More

    Submitted 2 November, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

  5. arXiv:2303.03272  [pdf, ps, other

    cs.LG math.OC stat.ML

    Accelerated Rates between Stochastic and Adversarial Online Convex Optimization

    Authors: Sarah Sachs, Hedi Hadiji, Tim van Erven, Cristobal Guzman

    Abstract: Stochastic and adversarial data are two widely studied settings in online learning. But many optimization tasks are neither i.i.d. nor fully adversarial, which makes it of fundamental interest to get a better theoretical understanding of the world between these extremes. In this work we establish novel regret bounds for online convex optimization in a setting that interpolates between stochastic i… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: Extended version of 'Between Stochastic and Adversarial Online Convex Optimization: Improved Regret Bounds via Smoothness' by the same authors. arXiv admin note: text overlap with arXiv:2202.07554

  6. arXiv:2202.07554  [pdf, ps, other

    cs.LG math.OC stat.ML

    Between Stochastic and Adversarial Online Convex Optimization: Improved Regret Bounds via Smoothness

    Authors: Sarah Sachs, Hédi Hadiji, Tim van Erven, Cristóbal Guzmán

    Abstract: Stochastic and adversarial data are two widely studied settings in online learning. But many optimization tasks are neither i.i.d. nor fully adversarial, which makes it of fundamental interest to get a better theoretical understanding of the world between these extremes. In this work we establish novel regret bounds for online convex optimization in a setting that interpolates between stochastic i… ▽ More

    Submitted 8 June, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

  7. arXiv:2202.05630  [pdf, ps, other

    cs.LG

    Scale-free Unconstrained Online Learning for Curved Losses

    Authors: Jack J. Mayo, Hédi Hadiji, Tim van Erven

    Abstract: A sequence of works in unconstrained online convex optimisation have investigated the possibility of adapting simultaneously to the norm $U$ of the comparator and the maximum norm $G$ of the gradients. In full generality, matching upper and lower bounds are known which show that this comes at the unavoidable cost of an additive $G U^3$, which is not needed when either $G$ or $U$ is known in advanc… ▽ More

    Submitted 15 June, 2022; v1 submitted 11 February, 2022; originally announced February 2022.

    Comments: 34 pages

  8. arXiv:2102.07521  [pdf, ps, other

    cs.LG stat.ML

    Distributed Online Learning for Joint Regret with Communication Constraints

    Authors: Dirk van der Hoeven, Hédi Hadiji, Tim van Erven

    Abstract: We consider distributed online learning for joint regret with communication constraints. In this setting, there are multiple agents that are connected in a graph. Each round, an adversary first activates one of the agents to issue a prediction and provides a corresponding gradient, and then the agents are allowed to send a $b$-bit message to their neighbors in the graph. All agents cooperate to co… ▽ More

    Submitted 25 October, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

  9. arXiv:2010.01874  [pdf, other

    stat.ML cs.LG

    Diversity-Preserving K-Armed Bandits, Revisited

    Authors: Hédi Hadiji, Sébastien Gerchinovitz, Jean-Michel Loubes, Gilles Stoltz

    Abstract: We consider the bandit-based framework for diversity-preserving recommendations introduced by Celis et al. (2019), who approached it in the case of a polytope mainly by a reduction to the setting of linear bandits. We design a UCB algorithm using the specific structure of the setting and show that it enjoys a bounded distribution-dependent regret in the natural cases when the optimal mixed actions… ▽ More

    Submitted 24 July, 2024; v1 submitted 5 October, 2020; originally announced October 2020.

    Journal ref: Transactions on Machine Learning Research Journal, 2024, July

  10. arXiv:2006.03378  [pdf, other

    math.ST stat.ML

    Adaptation to the Range in $K$-Armed Bandits

    Authors: Hédi Hadiji, Gilles Stoltz

    Abstract: We consider stochastic bandit problems with $K$ arms, each associated with a bounded distribution supported on the range $[m,M]$. We do not assume that the range $[m,M]$ is known and show that there is a cost for learning this range. Indeed, a new trade-off between distribution-dependent and distribution-free regret bounds arises, which prevents from simultaneously achieving the typical $\ln T$ an… ▽ More

    Submitted 15 June, 2022; v1 submitted 5 June, 2020; originally announced June 2020.

  11. arXiv:1905.10221  [pdf, other

    stat.ML cs.LG math.ST

    Polynomial Cost of Adaptation for X -Armed Bandits

    Authors: Hédi Hadiji

    Abstract: In the context of stochastic continuum-armed bandits, we present an algorithm that adapts to the unknown smoothness of the objective function. We exhibit and compute a polynomial cost of adaptation to the H{ö}lder regularity for regret minimization. To do this, we first reconsider the recent lower bound of Locatelli and Carpentier [20], and define and characterize admissible rate functions. Our ne… ▽ More

    Submitted 9 December, 2019; v1 submitted 24 May, 2019; originally announced May 2019.

    Journal ref: Thirty-third Conference on Neural Information Processing Systems, Dec 2019, Vancouver, France

  12. arXiv:1805.05071  [pdf, other

    stat.ML cs.LG math.ST

    KL-UCB-switch: optimal regret bounds for stochastic bandits from both a distribution-dependent and a distribution-free viewpoints

    Authors: Aurélien Garivier, Hédi Hadiji, Pierre Menard, Gilles Stoltz

    Abstract: We consider $K$-armed stochastic bandits and consider cumulative regret bounds up to time $T$. We are interested in strategies achieving simultaneously a distribution-free regret bound of optimal order $\sqrt{KT}$ and a distribution-dependent regret that is asymptotically optimal, that is, matching the $κ\ln T$ lower bound by Lai and Robbins (1985) and Burnetas and Katehakis (1996), where $κ$ is t… ▽ More

    Submitted 1 July, 2022; v1 submitted 14 May, 2018; originally announced May 2018.