Skip to main content

Showing 1–23 of 23 results for author: Bengs, V

.
  1. arXiv:2502.17077  [pdf, other

    cs.LG stat.ML

    A comparative analysis of rank aggregation methods for the partial label ranking problem

    Authors: Jiayi Wang, Juan C. Alfaro, Viktor Bengs

    Abstract: The label ranking problem is a supervised learning scenario in which the learner predicts a total order of the class labels for a given input instance. Recently, research has increasingly focused on the partial label ranking problem, a generalization of the label ranking problem that allows ties in the predicted orders. So far, most existing learning approaches for the partial label ranking proble… ▽ More

    Submitted 25 February, 2025; v1 submitted 24 February, 2025; originally announced February 2025.

  2. arXiv:2502.16299  [pdf, other

    cs.LG cs.AI stat.ML

    A calibration test for evaluating set-based epistemic uncertainty representations

    Authors: Mira Jürgens, Thomas Mortier, Eyke Hüllermeier, Viktor Bengs, Willem Waegeman

    Abstract: The accurate representation of epistemic uncertainty is a challenging yet essential task in machine learning. A widely used representation corresponds to convex sets of probabilistic predictors, also known as credal sets. One popular way of constructing these credal sets is via ensembling or specialized supervised learning methods, where the epistemic uncertainty can be quantified through measures… ▽ More

    Submitted 22 February, 2025; originally announced February 2025.

  3. arXiv:2402.09056  [pdf, other

    cs.AI cs.LG

    Is Epistemic Uncertainty Faithfully Represented by Evidential Deep Learning Methods?

    Authors: Mira Jürgens, Nis Meinert, Viktor Bengs, Eyke Hüllermeier, Willem Waegeman

    Abstract: Trustworthy ML systems should not only return accurate predictions, but also a reliable representation of their uncertainty. Bayesian methods are commonly used to quantify both aleatoric and epistemic uncertainty, but alternative approaches, such as evidential deep learning methods, have become popular in recent years. The latter group of methods in essence extends empirical risk minimization (ERM… ▽ More

    Submitted 9 September, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

    Journal ref: Proceedings of the 41st International Conference on Machine Learning (ICML), 2024, pp. 22624--22642

  4. arXiv:2312.14925  [pdf, ps, other

    cs.LG

    A Survey of Reinforcement Learning from Human Feedback

    Authors: Timo Kaufmann, Paul Weng, Viktor Bengs, Eyke Hüllermeier

    Abstract: Reinforcement learning from human feedback (RLHF) is a variant of reinforcement learning (RL) that learns from human feedback instead of relying on an engineered reward function. Building on prior work on the related setting of preference-based reinforcement learning (PbRL), it stands at the intersection of artificial intelligence and human-computer interaction. This positioning offers a promising… ▽ More

    Submitted 30 April, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    ACM Class: I.2.6

  5. arXiv:2312.00995  [pdf, other

    cs.LG stat.ML

    Second-Order Uncertainty Quantification: A Distance-Based Approach

    Authors: Yusuf Sale, Viktor Bengs, Michele Caprio, Eyke Hüllermeier

    Abstract: In the past couple of years, various approaches to representing and quantifying different types of predictive uncertainty in machine learning, notably in the setting of classification, have been proposed on the basis of second-order probability distributions, i.e., predictions in the form of distributions on probability distributions. A completely conclusive solution has not yet been found, howeve… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: 16 pages, 2 figures

  6. arXiv:2310.00750  [pdf, ps, other

    cs.LG stat.ML

    Identifying Copeland Winners in Dueling Bandits with Indifferences

    Authors: Viktor Bengs, Björn Haddenhorst, Eyke Hüllermeier

    Abstract: We consider the task of identifying the Copeland winner(s) in a dueling bandits problem with ternary feedback. This is an underexplored but practically relevant variant of the conventional dueling bandits problem, in which, in addition to strict preference between two arms, one may observe feedback in the form of an indifference. We provide a lower bound on the sample complexity for any learning a… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

    MSC Class: 68W27 (Primary) 68T05 (Secondary)

  7. arXiv:2302.00736  [pdf, other

    cs.LG cs.GT

    Approximating the Shapley Value without Marginal Contributions

    Authors: Patrick Kolpaczki, Viktor Bengs, Maximilian Muschalik, Eyke Hüllermeier

    Abstract: The Shapley value, which is arguably the most popular approach for assigning a meaningful contribution value to players in a cooperative game, has recently been used intensively in explainable artificial intelligence. Its meaningfulness is due to axiomatic properties that only the Shapley value satisfies, which, however, comes at the expense of an exact computation growing exponentially with the n… ▽ More

    Submitted 30 January, 2024; v1 submitted 1 February, 2023; originally announced February 2023.

  8. arXiv:2302.00511  [pdf, other

    cs.LG cs.AI

    Iterative Deepening Hyperband

    Authors: Jasmin Brandt, Marcel Wever, Dimitrios Iliadis, Viktor Bengs, Eyke Hüllermeier

    Abstract: Hyperparameter optimization (HPO) is concerned with the automated search for the most appropriate hyperparameter configuration (HPC) of a parameterized machine learning algorithm. A state-of-the-art HPO method is Hyperband, which, however, has its own parameters that influence its performance. One of these parameters, the maximal budget, is especially problematic: If chosen too small, the budget n… ▽ More

    Submitted 6 February, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

  9. arXiv:2301.12736  [pdf, ps, other

    cs.LG stat.ML

    On Second-Order Scoring Rules for Epistemic Uncertainty Quantification

    Authors: Viktor Bengs, Eyke Hüllermeier, Willem Waegeman

    Abstract: It is well known that accurate probabilistic predictors can be trained through empirical risk minimisation with proper scoring rules as loss functions. While such learners capture so-called aleatoric uncertainty of predictions, various machine learning methods have recently been developed with the goal to let the learner also represent its epistemic uncertainty, i.e., the uncertainty caused by a l… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

    MSC Class: 68T37 (Primary) 68T30 (Secondary)

  10. arXiv:2212.00333  [pdf, other

    cs.LG cs.DS

    AC-Band: A Combinatorial Bandit-Based Approach to Algorithm Configuration

    Authors: Jasmin Brandt, Elias Schede, Viktor Bengs, Björn Haddenhorst, Eyke Hüllermeier, Kevin Tierney

    Abstract: We study the algorithm configuration (AC) problem, in which one seeks to find an optimal parameter configuration of a given target algorithm in an automated way. Recently, there has been significant progress in designing AC approaches that satisfy strong theoretical guarantees. However, a significant gap still remains between the practical performance of these approaches and state-of-the-art heuri… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

  11. arXiv:2205.10082  [pdf, other

    stat.ML cs.LG

    On the Calibration of Probabilistic Classifier Sets

    Authors: Thomas Mortier, Viktor Bengs, Eyke Hüllermeier, Stijn Luca, Willem Waegeman

    Abstract: Multi-class classification methods that produce sets of probabilistic classifiers, such as ensemble learning methods, are able to model aleatoric and epistemic uncertainty. Aleatoric uncertainty is then typically quantified via the Bayes error, and epistemic uncertainty via the size of the set. In this paper, we extend the notion of calibration, which is commonly used to evaluate the validity of t… ▽ More

    Submitted 19 April, 2023; v1 submitted 20 May, 2022; originally announced May 2022.

  12. arXiv:2203.06102  [pdf, other

    cs.LG stat.ML

    Pitfalls of Epistemic Uncertainty Quantification through Loss Minimisation

    Authors: Viktor Bengs, Eyke Hüllermeier, Willem Waegeman

    Abstract: Uncertainty quantification has received increasing attention in machine learning in the recent past. In particular, a distinction between aleatoric and epistemic uncertainty has been found useful in this regard. The latter refers to the learner's (lack of) knowledge and appears to be especially difficult to measure and quantify. In this paper, we analyse a recent proposal based on the idea of a se… ▽ More

    Submitted 13 October, 2022; v1 submitted 11 March, 2022; originally announced March 2022.

    MSC Class: 68T37 (Primary) 68T30 (Secondary)

  13. arXiv:2202.04593  [pdf, other

    cs.LG stat.ML

    Stochastic Contextual Dueling Bandits under Linear Stochastic Transitivity Models

    Authors: Viktor Bengs, Aadirupa Saha, Eyke Hüllermeier

    Abstract: We consider the regret minimization task in a dueling bandits problem with context information. In every round of the sequential decision problem, the learner makes a context-dependent selection of two choice alternatives (arms) to be compared with each other and receives feedback in the form of noisy preference information. We assume that the feedback process is determined by a linear stochastic… ▽ More

    Submitted 13 October, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

    MSC Class: 68W27 (Primary) 68T05 (Secondary)

    Journal ref: Proceedings of the 39th International Conference on Machine Learning (ICML), PMLR 162:1764-1786, 2022

  14. arXiv:2202.04487  [pdf, other

    cs.LG stat.ML

    Finding Optimal Arms in Non-stochastic Combinatorial Bandits with Semi-bandit Feedback and Finite Budget

    Authors: Jasmin Brandt, Viktor Bengs, Björn Haddenhorst, Eyke Hüllermeier

    Abstract: We consider the combinatorial bandits problem with semi-bandit feedback under finite sampling budget constraints, in which the learner can carry out its action only for a limited number of times specified by an overall budget. The action is to choose a set of arms, whereupon feedback for each arm in the chosen set is received. Unlike existing works, we study this problem in a non-stochastic settin… ▽ More

    Submitted 14 October, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

    MSC Class: 68Q32 (Primary) 68T05; 68W27 (Secondary)

  15. A Survey of Methods for Automated Algorithm Configuration

    Authors: Elias Schede, Jasmin Brandt, Alexander Tornede, Marcel Wever, Viktor Bengs, Eyke Hüllermeier, Kevin Tierney

    Abstract: Algorithm configuration (AC) is concerned with the automated search of the most suitable parameter configuration of a parametrized algorithm. There is currently a wide variety of AC problem variants and methods proposed in the literature. Existing reviews do not take into account all derivatives of the AC problem, nor do they offer a complete classification scheme. To this end, we introduce taxono… ▽ More

    Submitted 13 October, 2022; v1 submitted 3 February, 2022; originally announced February 2022.

    ACM Class: I.2.6

    Journal ref: Journal of Artificial Intelligence Research (JAIR) 75 (2022) 425-487

  16. arXiv:2202.00935  [pdf, other

    cs.LG stat.ML

    Non-Stationary Dueling Bandits

    Authors: Patrick Kolpaczki, Viktor Bengs, Eyke Hüllermeier

    Abstract: We study the non-stationary dueling bandits problem with $K$ arms, where the time horizon $T$ consists of $M$ stationary segments, each of which is associated with its own preference matrix. The learner repeatedly selects a pair of arms and observes a binary preference between them as feedback. To minimize the accumulated regret, the learner needs to pick the Condorcet winner of each stationary se… ▽ More

    Submitted 2 February, 2022; originally announced February 2022.

    MSC Class: 68W27 (Primary) 68T37; 62F07 (Secondary)

  17. arXiv:2109.06234  [pdf, other

    cs.LG cs.AI

    Machine Learning for Online Algorithm Selection under Censored Feedback

    Authors: Alexander Tornede, Viktor Bengs, Eyke Hüllermeier

    Abstract: In online algorithm selection (OAS), instances of an algorithmic problem class are presented to an agent one after another, and the agent has to quickly select a presumably best algorithm from a fixed set of candidate algorithms. For decision problems such as satisfiability (SAT), quality typically refers to the algorithm's runtime. As the latter is known to exhibit a heavy-tail distribution, an a… ▽ More

    Submitted 13 September, 2021; originally announced September 2021.

  18. arXiv:2011.00813  [pdf, other

    cs.LG stat.ML

    Multi-Armed Bandits with Censored Consumption of Resources

    Authors: Viktor Bengs, Eyke Hüllermeier

    Abstract: We consider a resource-aware variant of the classical multi-armed bandit problem: In each round, the learner selects an arm and determines a resource limit. It then observes a corresponding (random) reward, provided the (random) amount of consumed resources remains below the limit. Otherwise, the observation is censored, i.e., no reward is obtained. For this problem setting, we introduce a measure… ▽ More

    Submitted 17 October, 2022; v1 submitted 2 November, 2020; originally announced November 2020.

    MSC Class: 68W27 (Primary) 68T05 (Secondary)

  19. arXiv:2002.04275  [pdf, other

    cs.LG stat.ML

    Online Preselection with Context Information under the Plackett-Luce Model

    Authors: Adil El Mesaoudi-Paul, Viktor Bengs, Eyke Hüllermeier

    Abstract: We consider an extension of the contextual multi-armed bandit problem, in which, instead of selecting a single alternative (arm), a learner is supposed to make a preselection in the form of a subset of alternatives. More specifically, in each iteration, the learner is presented a set of arms and a context, both described in terms of feature vectors. The task of the learner is to preselect $k$ of t… ▽ More

    Submitted 11 February, 2020; originally announced February 2020.

  20. arXiv:1907.06123  [pdf, other

    cs.LG stat.ML

    Preselection Bandits

    Authors: Viktor Bengs, Eyke Hüllermeier

    Abstract: In this paper, we introduce the Preselection Bandit problem, in which the learner preselects a subset of arms (choice alternatives) for a user, which then chooses the final arm from this subset. The learner is not aware of the user's preferences, but can learn them from observed choices. In our concrete setting, we allow these choices to be stochastic and model the user's actions by means of the P… ▽ More

    Submitted 22 December, 2021; v1 submitted 13 July, 2019; originally announced July 2019.

    Journal ref: Proceedings of the 37th International Conference on Machine Learning (ICML 2020). Volume 119 of the Proceedings of Machine Learning Research (PMLR), pages 778-787

  21. arXiv:1903.09864  [pdf, ps, other

    math.PR

    Uniform approximation in classical weak convergence theory

    Authors: Viktor Bengs, Hajo Holzmann

    Abstract: A common statistical task lies in showing asymptotic normality of certain statistics. In many of these situations, classical textbook results on weak convergence theory suffice for the problem at hand. However, there are quite some scenarios where stronger results are needed in order to establish an asymptotic normal approximation uniformly over a family of probability measures. In this note we co… ▽ More

    Submitted 23 March, 2019; originally announced March 2019.

    MSC Class: 60F05; 60B10

  22. Asymptotic confidence sets for the jump curve in bivariate regression problems

    Authors: Viktor Bengs, Matthias Eulert, Hajo Holzmann

    Abstract: We construct uniform and point-wise asymptotic confidence sets for the single edge in an otherwise smooth image function which are based on rotated differences of two one-sided kernel estimators. Using methods from M-estimation, we show consistency of the estimators of location, slope and height of the edge function and develop a uniform linearization of the contrast process. The uniform confidenc… ▽ More

    Submitted 23 March, 2019; originally announced March 2019.

    MSC Class: 62G15; 62G08; 62H12; 62H35

    Journal ref: Journal of Multivariate Analysis, 173, 291-312, 2019

  23. arXiv:1807.11398  [pdf, ps, other

    cs.LG stat.ML

    Preference-based Online Learning with Dueling Bandits: A Survey

    Authors: Viktor Bengs, Robert Busa-Fekete, Adil El Mesaoudi-Paul, Eyke Hüllermeier

    Abstract: In machine learning, the notion of multi-armed bandits refers to a class of online learning problems, in which an agent is supposed to simultaneously explore and exploit a given set of choice alternatives in the course of a sequential decision process. In the standard setting, the agent learns from stochastic feedback in the form of real-valued rewards. In many applications, however, numerical rew… ▽ More

    Submitted 12 July, 2021; v1 submitted 30 July, 2018; originally announced July 2018.

    Comments: 108 pages

    Journal ref: Journal of Machine Learning Research, 22(7):1-108, 2021