Skip to main content

Showing 1–29 of 29 results for author: van Erven, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.06536  [pdf, ps, other

    stat.ML cs.LG

    Sample-efficient Learning of Concepts with Theoretical Guarantees: from Data to Concepts without Interventions

    Authors: Hidde Fokkema, Tim van Erven, Sara Magliacane

    Abstract: Machine learning is a vital part of many real-world systems, but several concerns remain about the lack of interpretability, explainability and robustness of black-box AI systems. Concept Bottleneck Models (CBM) address some of these challenges by learning interpretable concepts from high-dimensional data, e.g. images, which are used to predict labels. An important issue in CBMs are spurious corre… ▽ More

    Submitted 3 June, 2025; v1 submitted 10 February, 2025; originally announced February 2025.

    Comments: 55 pages, 20 figures, 12 Tables, Preprint

  2. arXiv:2410.02400  [pdf, ps, other

    cs.LG

    An Online Feasible Point Method for Benign Generalized Nash Equilibrium Problems

    Authors: Sarah Sachs, Hedi Hadiji, Tim van Erven, Mathias Staudigl

    Abstract: We consider a repeatedly played generalized Nash equilibrium game. This induces a multi-agent online learning problem with joint constraints. An important challenge in this setting is that the feasible set for each agent depends on the simultaneous moves of the other agents and, therefore, varies over time. As a consequence, the agents face time-varying constraints, which are not adversarial but r… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

  3. arXiv:2307.02501  [pdf, ps, other

    stat.ML cs.LG

    Generalization Guarantees via Algorithm-dependent Rademacher Complexity

    Authors: Sarah Sachs, Tim van Erven, Liam Hodgkinson, Rajiv Khanna, Umut Simsekli

    Abstract: Algorithm- and data-dependent generalization bounds are required to explain the generalization behavior of modern machine learning algorithms. In this context, there exists information theoretic generalization bounds that involve (various forms of) mutual information, as well as bounds based on hypothesis set stability. We propose a conceptually related, but technically distinct complexity measure… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

  4. arXiv:2306.00497  [pdf, other

    cs.LG cs.CY stat.ML

    The Risks of Recourse in Binary Classification

    Authors: Hidde Fokkema, Damien Garreau, Tim van Erven

    Abstract: Algorithmic recourse provides explanations that help users overturn an unfavorable decision by a machine learning system. But so far very little attention has been paid to whether providing recourse is beneficial or not. We introduce an abstract learning-theoretic framework that compares the risks (i.e., expected losses) for classification with and without algorithmic recourse. This allows us to a… ▽ More

    Submitted 1 March, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: 24 pages, 8 figures, 5 tables

  5. arXiv:2305.00832  [pdf, ps, other

    cs.LG stat.ML

    First- and Second-Order Bounds for Adversarial Linear Contextual Bandits

    Authors: Julia Olkhovskaya, Jack Mayo, Tim van Erven, Gergely Neu, Chen-Yu Wei

    Abstract: We consider the adversarial linear contextual bandit setting, which allows for the loss functions associated with each of $K$ arms to change over time without restriction. Assuming the $d$-dimensional contexts are drawn from a fixed known distribution, the worst-case expected regret over the course of $T$ rounds is known to scale as $\tilde O(\sqrt{Kd T})$. Under the additional assumption that the… ▽ More

    Submitted 24 May, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

  6. arXiv:2304.12768  [pdf, ps, other

    cs.GT math.OC stat.ML

    Towards Characterizing the First-order Query Complexity of Learning (Approximate) Nash Equilibria in Zero-sum Matrix Games

    Authors: Hédi Hadiji, Sarah Sachs, Tim van Erven, Wouter M. Koolen

    Abstract: In the first-order query model for zero-sum $K\times K$ matrix games, players observe the expected pay-offs for all their possible actions under the randomized action played by their opponent. This classical model has received renewed interest after the discovery by Rakhlin and Sridharan that $ε$-approximate Nash equilibria can be computed efficiently from $O(\frac{\ln K}ε)$ instead of… ▽ More

    Submitted 2 November, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

  7. arXiv:2303.03272  [pdf, ps, other

    cs.LG math.OC stat.ML

    Accelerated Rates between Stochastic and Adversarial Online Convex Optimization

    Authors: Sarah Sachs, Hedi Hadiji, Tim van Erven, Cristobal Guzman

    Abstract: Stochastic and adversarial data are two widely studied settings in online learning. But many optimization tasks are neither i.i.d. nor fully adversarial, which makes it of fundamental interest to get a better theoretical understanding of the world between these extremes. In this work we establish novel regret bounds for online convex optimization in a setting that interpolates between stochastic i… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: Extended version of 'Between Stochastic and Adversarial Online Convex Optimization: Improved Regret Bounds via Smoothness' by the same authors. arXiv admin note: text overlap with arXiv:2202.07554

  8. arXiv:2302.08397  [pdf, ps, other

    stat.ML cs.LG

    Adaptive Selective Sampling for Online Prediction with Experts

    Authors: Rui M. Castro, Fredrik Hellström, Tim van Erven

    Abstract: We consider online prediction of a binary sequence with expert advice. For this setting, we devise label-efficient forecasting algorithms, which use a selective sampling scheme that enables collecting much fewer labels than standard procedures, while still retaining optimal worst-case regret guarantees. These algorithms are based on exponentially weighted forecasters, suitable for settings with an… ▽ More

    Submitted 20 October, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

    Journal ref: NeurIPS 2023

  9. arXiv:2209.06826  [pdf, other

    cs.LG

    Modifying Squint for Prediction with Expert Advice in a Changing Environment

    Authors: Thom Neuteboom, Tim van Erven

    Abstract: We provide a new method for online learning, specifically prediction with expert advice, in a changing environment. In a non-changing environment the Squint algorithm has been designed to always function at least as well as other known algorithms and in specific cases it functions much better. However, when using a conventional black-box algorithm to make Squint suitable for a changing environment… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

  10. arXiv:2205.15834  [pdf, other

    stat.ML cs.LG

    Attribution-based Explanations that Provide Recourse Cannot be Robust

    Authors: Hidde Fokkema, Rianne de Heide, Tim van Erven

    Abstract: Different users of machine learning methods require different explanations, depending on their goals. To make machine learning accountable to society, one important goal is to get actionable options for recourse, which allow an affected user to change the decision $f(x)$ of a machine learning system by making limited changes to its input $x$. We formalize this by providing a general definition of… ▽ More

    Submitted 20 December, 2023; v1 submitted 31 May, 2022; originally announced May 2022.

    Comments: 32 pages, 6 figures

  11. arXiv:2202.07554  [pdf, ps, other

    cs.LG math.OC stat.ML

    Between Stochastic and Adversarial Online Convex Optimization: Improved Regret Bounds via Smoothness

    Authors: Sarah Sachs, Hédi Hadiji, Tim van Erven, Cristóbal Guzmán

    Abstract: Stochastic and adversarial data are two widely studied settings in online learning. But many optimization tasks are neither i.i.d. nor fully adversarial, which makes it of fundamental interest to get a better theoretical understanding of the world between these extremes. In this work we establish novel regret bounds for online convex optimization in a setting that interpolates between stochastic i… ▽ More

    Submitted 8 June, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

  12. arXiv:2202.05630  [pdf, ps, other

    cs.LG

    Scale-free Unconstrained Online Learning for Curved Losses

    Authors: Jack J. Mayo, Hédi Hadiji, Tim van Erven

    Abstract: A sequence of works in unconstrained online convex optimisation have investigated the possibility of adapting simultaneously to the norm $U$ of the comparator and the maximum norm $G$ of the gradients. In full generality, matching upper and lower bounds are known which show that this comes at the unavoidable cost of an additive $G U^3$, which is not needed when either $G$ or $U$ is known in advanc… ▽ More

    Submitted 15 June, 2022; v1 submitted 11 February, 2022; originally announced February 2022.

    Comments: 34 pages

  13. arXiv:2107.01881  [pdf, ps, other

    cs.LG

    Robust Online Convex Optimization in the Presence of Outliers

    Authors: Tim van Erven, Sarah Sachs, Wouter M. Koolen, Wojciech Kotłowski

    Abstract: We consider online convex optimization when a number k of data points are outliers that may be corrupted. We model this by introducing the notion of robust regret, which measures the regret only on rounds that are not outliers. The aim for the learner is to achieve small robust regret, without knowing where the outliers are. If the outliers are chosen adversarially, we show that a simple filtering… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

    Journal ref: Proceedings of Thirty Fourth Conference on Learning Theory, PMLR 134:4174-4194, 2021

  14. arXiv:2102.07521  [pdf, ps, other

    cs.LG stat.ML

    Distributed Online Learning for Joint Regret with Communication Constraints

    Authors: Dirk van der Hoeven, Hédi Hadiji, Tim van Erven

    Abstract: We consider distributed online learning for joint regret with communication constraints. In this setting, there are multiple agents that are connected in a graph. Each round, an adversary first activates one of the agents to issue a prediction and provides a corresponding gradient, and then the agents are allowed to send a $b$-bit message to their neighbors in the graph. All agents cooperate to co… ▽ More

    Submitted 25 October, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

  15. arXiv:2102.06622  [pdf, other

    cs.LG stat.ML

    MetaGrad: Adaptation using Multiple Learning Rates in Online Learning

    Authors: Tim van Erven, Wouter M. Koolen, Dirk van der Hoeven

    Abstract: We provide a new adaptive method for online convex optimization, MetaGrad, that is robust to general convex losses but achieves faster rates for a broad class of special functions, including exp-concave and strongly convex functions, but also various types of stochastic and non-stochastic functions without any curvature. We prove this by drawing a connection to the Bernstein condition, which is kn… ▽ More

    Submitted 30 August, 2021; v1 submitted 12 February, 2021; originally announced February 2021.

    Journal ref: Journal of Machine Learning Research 22(161):1-61, 2021

  16. arXiv:2006.07985  [pdf, other

    cs.LG stat.ML

    Explaining Predictions by Approximating the Local Decision Boundary

    Authors: Georgios Vlassopoulos, Tim van Erven, Henry Brighton, Vlado Menkovski

    Abstract: Constructing accurate model-agnostic explanations for opaque machine learning models remains a challenging task. Classification models for high-dimensional data, like images, are often inherently complex. To reduce this complexity, individual predictions may be explained locally, either in terms of a simpler local surrogate model or by communicating how the predictions contrast with those of anoth… ▽ More

    Submitted 22 October, 2020; v1 submitted 14 June, 2020; originally announced June 2020.

  17. arXiv:1902.10797  [pdf, ps, other

    cs.LG stat.ML

    Lipschitz Adaptivity with Multiple Learning Rates in Online Learning

    Authors: Zakaria Mhammedi, Wouter M. Koolen, Tim van Erven

    Abstract: We aim to design adaptive online learning algorithms that take advantage of any special structure that might be present in the learning task at hand, with as little manual tuning by the user as possible. A fundamental obstacle that comes up in the design of such adaptive algorithms is to calibrate a so-called step-size or learning rate hyperparameter depending on variance, gradient norms, etc. A r… ▽ More

    Submitted 30 May, 2019; v1 submitted 27 February, 2019; originally announced February 2019.

    Comments: 22 pages. To appear in COLT 2019

  18. arXiv:1802.07543  [pdf, ps, other

    stat.ML cs.LG

    The Many Faces of Exponential Weights in Online Learning

    Authors: Dirk van der Hoeven, Tim van Erven, Wojciech Kotłowski

    Abstract: A standard introduction to online learning might place Online Gradient Descent at its center and then proceed to develop generalizations and extensions like Online Mirror Descent and second-order methods. Here we explore the alternative approach of putting Exponential Weights (EW) first. We show that many standard methods and their regret bounds then follow as a special case by plugging in suitabl… ▽ More

    Submitted 5 June, 2018; v1 submitted 21 February, 2018; originally announced February 2018.

    Journal ref: Proceedings of the 31st Conference On Learning Theory, PMLR 75:2067-2092, 2018

  19. arXiv:1605.06439  [pdf, ps, other

    cs.LG

    Combining Adversarial Guarantees and Stochastic Fast Rates in Online Learning

    Authors: Wouter M. Koolen, Peter Grünwald, Tim van Erven

    Abstract: We consider online learning algorithms that guarantee worst-case regret rates in adversarial environments (so they can be deployed safely and will perform robustly), yet adapt optimally to favorable stochastic environments (so they will perform well in a variety of settings of practical importance). We quantify the friendliness of stochastic environments by means of the well-known Bernstein (a.k.a… ▽ More

    Submitted 20 May, 2016; originally announced May 2016.

    Journal ref: Advances in Neural Information Processing Systems 29 (NeurIPS), 4457-4465, 2016

  20. arXiv:1604.08740  [pdf, ps, other

    cs.LG

    MetaGrad: Multiple Learning Rates in Online Learning

    Authors: Tim van Erven, Wouter M. Koolen

    Abstract: In online convex optimization it is well known that certain subclasses of objective functions are much easier than arbitrary convex functions. We are interested in designing adaptive methods that can automatically get fast rates in as many such subclasses as possible, without any manual tuning. Previous adaptive methods are able to interpolate between strongly convex and general convex functions.… ▽ More

    Submitted 1 November, 2016; v1 submitted 29 April, 2016; originally announced April 2016.

    Journal ref: Advances in Neural Information Processing Systems 29 (NeurIPS), 3666-3674, 2016

  21. arXiv:1507.02592  [pdf, other

    cs.LG stat.ML

    Fast rates in statistical and online learning

    Authors: Tim van Erven, Peter D. Grünwald, Nishant A. Mehta, Mark D. Reid, Robert C. Williamson

    Abstract: The speed with which a learning algorithm converges as it is presented with more data is a central problem in machine learning --- a fast rate of convergence means less data is needed for the same level of performance. The pursuit of fast rates in online and statistical learning has led to the discovery of many conditions in learning theory under which fast learning is possible. We show that most… ▽ More

    Submitted 1 September, 2015; v1 submitted 9 July, 2015; originally announced July 2015.

    Comments: 69 pages, 3 figures

    Journal ref: Journal of Machine Learning Research 6(54):1793-1861, 2015

  22. arXiv:1502.08009  [pdf, ps, other

    cs.LG stat.ML

    Second-order Quantile Methods for Experts and Combinatorial Games

    Authors: Wouter M. Koolen, Tim van Erven

    Abstract: We aim to design strategies for sequential decision making that adjust to the difficulty of the learning problem. We study this question both in the setting of prediction with expert advice, and for more general combinatorial decision tasks. We are not satisfied with just guaranteeing minimax regret rates, but we want our algorithms to perform significantly better on easy data. Two popular ways to… ▽ More

    Submitted 27 February, 2015; originally announced February 2015.

  23. arXiv:1402.2044  [pdf, ps, other

    stat.ML cs.LG math.ST

    A Second-order Bound with Excess Losses

    Authors: Pierre Gaillard, Gilles Stoltz, Tim Van Erven

    Abstract: We study online aggregation of the predictions of experts, and first show new second-order regret bounds in the standard setting, which are obtained via a version of the Prod algorithm (and also a version of the polynomially weighted average algorithm) with multiple learning rates. These bounds are in terms of excess losses, the differences between the instantaneous losses suffered by the algorith… ▽ More

    Submitted 10 February, 2014; originally announced February 2014.

  24. arXiv:1301.0534  [pdf, ps, other

    cs.LG stat.ML

    Follow the Leader If You Can, Hedge If You Must

    Authors: Steven de Rooij, Tim van Erven, Peter D. Grünwald, Wouter M. Koolen

    Abstract: Follow-the-Leader (FTL) is an intuitive sequential prediction strategy that guarantees constant regret in the stochastic setting, but has terrible performance for worst-case data. Other hedging strategies have better worst-case guarantees but may perform much worse than FTL if the data are not maximally adversarial. We introduce the FlipFlop algorithm, which is the first method that provably combi… ▽ More

    Submitted 17 January, 2013; v1 submitted 3 January, 2013; originally announced January 2013.

    Comments: under submission

    Journal ref: Journal of Machine Learning Research 15(37):1281-1316, 2014

  25. arXiv:1206.2459  [pdf, other

    cs.IT math.ST stat.ML

    Rényi Divergence and Kullback-Leibler Divergence

    Authors: Tim van Erven, Peter Harremoës

    Abstract: Rényi divergence is related to Rényi entropy much like Kullback-Leibler divergence is related to Shannon's entropy, and comes up in many settings. It was introduced by Rényi as a measure of information that satisfies almost the same axioms as Kullback-Leibler divergence, and depends on a parameter that is called its order. In particular, the Rényi divergence of order 1 equals the Kullback-Leibler… ▽ More

    Submitted 24 April, 2014; v1 submitted 12 June, 2012; originally announced June 2012.

    Comments: To appear in IEEE Transactions on Information Theory

  26. arXiv:1008.4654  [pdf, other

    cs.LG

    Freezing and Sleeping: Tracking Experts that Learn by Evolving Past Posteriors

    Authors: Wouter M. Koolen, Tim van Erven

    Abstract: A problem posed by Freund is how to efficiently track a small pool of experts out of a much larger set. This problem was solved when Bousquet and Warmuth introduced their mixing past posteriors (MPP) algorithm in 2001. In Freund's problem the experts would normally be considered black boxes. However, in this paper we re-examine Freund's problem in case the experts have internal structure that en… ▽ More

    Submitted 27 August, 2010; originally announced August 2010.

  27. arXiv:1008.4532  [pdf, other

    cs.LG

    Switching between Hidden Markov Models using Fixed Share

    Authors: Wouter M. Koolen, Tim van Erven

    Abstract: In prediction with expert advice the goal is to design online prediction algorithms that achieve small regret (additional loss on the whole data) compared to a reference scheme. In the simplest such scheme one compares to the loss of the best expert in hindsight. A more ambitious goal is to split the data into segments and compare to the best expert on each segment. This is appropriate if the natu… ▽ More

    Submitted 26 August, 2010; originally announced August 2010.

  28. arXiv:1001.4448  [pdf, ps, other

    cs.IT

    Rényi Divergence and Majorization

    Authors: Tim van Erven, Peter Harremoës

    Abstract: Rényi divergence is related to Rényi entropy much like information divergence (also called Kullback-Leibler divergence or relative entropy) is related to Shannon's entropy, and comes up in many settings. It was introduced by Rényi as a measure of information that satisfies almost the same axioms as information divergence. We review the most important properties of Rényi divergence, including its r… ▽ More

    Submitted 27 May, 2010; v1 submitted 25 January, 2010; originally announced January 2010.

    MSC Class: 94A17

  29. arXiv:0807.1005  [pdf, ps, other

    math.ST cs.IT cs.LG stat.ME stat.ML

    Catching Up Faster by Switching Sooner: A Prequential Solution to the AIC-BIC Dilemma

    Authors: Tim van Erven, Peter Grunwald, Steven de Rooij

    Abstract: Bayesian model averaging, model selection and its approximations such as BIC are generally statistically consistent, but sometimes achieve slower rates og convergence than other methods such as AIC and leave-one-out cross-validation. On the other hand, these other methods can br inconsistent. We identify the "catch-up phenomenon" as a novel explanation for the slow convergence of Bayesian method… ▽ More

    Submitted 7 July, 2008; originally announced July 2008.

    Comments: A preliminary version of a part of this paper appeared at the NIPS 2007 conference

    MSC Class: 62G99; 94A99