Skip to main content

Showing 1–45 of 45 results for author: Bogunovic, I

.
  1. arXiv:2503.08796  [pdf, other

    cs.LG cs.AI

    Robust Multi-Objective Controlled Decoding of Large Language Models

    Authors: Seongho Son, William Bankes, Sangwoong Yoon, Shyam Sundhar Ramesh, Xiaohang Tang, Ilija Bogunovic

    Abstract: Test-time alignment of Large Language Models (LLMs) to human preferences offers a flexible way to generate responses aligned to diverse objectives without extensive retraining of LLMs. Existing methods achieve alignment to multiple objectives simultaneously (e.g., instruction-following, helpfulness, conciseness) by optimizing their corresponding reward functions. However, they often rely on predef… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

    Comments: 24 pages, 9 figures

  2. arXiv:2503.05856  [pdf, other

    cs.CL cs.AI

    This Is Your Doge, If It Please You: Exploring Deception and Robustness in Mixture of LLMs

    Authors: Lorenz Wolf, Sangwoong Yoon, Ilija Bogunovic

    Abstract: Mixture of large language model (LLMs) Agents (MoA) architectures achieve state-of-the-art performance on prominent benchmarks like AlpacaEval 2.0 by leveraging the collaboration of multiple LLMs at inference time. Despite these successes, an evaluation of the safety and reliability of MoA is missing. We present the first comprehensive study of MoA's robustness against deceptive LLM agents that de… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

    Comments: 35 pages, 9 figures, 16 tables

  3. arXiv:2503.00030  [pdf, other

    cs.LG cs.AI

    Game-Theoretic Regularized Self-Play Alignment of Large Language Models

    Authors: Xiaohang Tang, Sangwoong Yoon, Seongho Son, Huizhuo Yuan, Quanquan Gu, Ilija Bogunovic

    Abstract: Self-play alignment algorithms have been developed as effective methods for fine-tuning large language models (LLMs), formulating preference optimization as a two-player game. However, the regularization with respect to the reference policy, which is crucial for mitigating over-optimization, has been insufficiently investigated in self-play alignment. In this paper, we show that our regularization… ▽ More

    Submitted 24 February, 2025; originally announced March 2025.

    Comments: Preprint

  4. arXiv:2502.12315  [pdf, other

    cs.LG cs.MA

    Mean-Field Bayesian Optimisation

    Authors: Petar Steinberg, Juliusz Ziomek, Matej Jusup, Ilija Bogunovic

    Abstract: We address the problem of optimising the average payoff for a large number of cooperating agents, where the payoff function is unknown and treated as a black box. While standard Bayesian Optimisation (BO) methods struggle with the scalability required for high-dimensional input spaces, we demonstrate how leveraging the mean-field assumption on the black-box function can transform BO into an effici… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

    Comments: 16 pages, 5 figures, 2 tables

  5. arXiv:2502.01208  [pdf, other

    cs.LG cs.CL

    Almost Surely Safe Alignment of Large Language Models at Inference-Time

    Authors: Xiaotong Ji, Shyam Sundhar Ramesh, Matthieu Zimmer, Ilija Bogunovic, Jun Wang, Haitham Bou Ammar

    Abstract: Even highly capable large language models (LLMs) can produce biased or unsafe responses, and alignment techniques, such as RLHF, aimed at mitigating this issue, are expensive and prone to overfitting as they retrain the LLM. This paper introduces a novel inference-time alignment approach that ensures LLMs generate safe responses almost surely, i.e., with a probability approaching one. We achieve t… ▽ More

    Submitted 5 February, 2025; v1 submitted 3 February, 2025; originally announced February 2025.

  6. arXiv:2501.05361  [pdf, other

    cs.LG

    No-Regret Linear Bandits under Gap-Adjusted Misspecification

    Authors: Chong Liu, Dan Qiao, Ming Yin, Ilija Bogunovic, Yu-Xiang Wang

    Abstract: This work studies linear bandits under a new notion of gap-adjusted misspecification and is an extension of Liu et al. (2023). When the underlying reward function is not linear, existing linear bandits work usually relies on a uniform misspecification parameter $ε$ that measures the sup-norm error of the best linear approximation. This results in an unavoidable linear regret whenever $ε> 0$. We pr… ▽ More

    Submitted 9 January, 2025; originally announced January 2025.

    Comments: arXiv admin note: substantial text overlap with arXiv:2302.13252

  7. arXiv:2411.00954  [pdf, other

    cs.GT

    Sample-Efficient Regret-Minimizing Double Oracle in Extensive-Form Games

    Authors: Xiaohang Tang, Chiyuan Wang, Chengdong Ma, Ilija Bogunovic, Stephen McAleer, Yaodong Yang

    Abstract: Extensive-Form Game (EFG) represents a fundamental model for analyzing sequential interactions among multiple agents and the primary challenge to solve it lies in mitigating sample complexity. Existing research indicated that Double Oracle (DO) can reduce the sample complexity dependence on the information set number $|S|$ to the final restricted game size $X$ in solving EFG. This is attributed to… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

    Comments: Preprint

  8. arXiv:2410.16972  [pdf, other

    cs.LG

    Sample-efficient Bayesian Optimisation Using Known Invariances

    Authors: Theodore Brown, Alexandru Cioba, Ilija Bogunovic

    Abstract: Bayesian optimisation (BO) is a powerful framework for global optimisation of costly functions, using predictions from Gaussian process models (GPs). In this work, we apply BO to functions that exhibit invariance to a known group of transformations. We show that vanilla and constrained BO algorithms are inefficient when optimising such invariant objectives, and provide a method for incorporating g… ▽ More

    Submitted 22 October, 2024; originally announced October 2024.

    Comments: Accepted as a poster at NeurIPS 2024

  9. arXiv:2407.18676  [pdf, other

    cs.LG

    Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift

    Authors: Seongho Son, William Bankes, Sayak Ray Chowdhury, Brooks Paige, Ilija Bogunovic

    Abstract: Reinforcement learning from human feedback (RLHF) aligns Large Language Models (LLMs) with human preferences. However, these preferences can often change over time due to external factors (e.g. environment change and societal influence). Consequently, what was wrong then might be right now. Current preference optimization algorithms do not account for temporal preference drift in their modeling, w… ▽ More

    Submitted 25 May, 2025; v1 submitted 26 July, 2024; originally announced July 2024.

    Comments: 30 pages, 9 figures. Accepted to ICML 2025

  10. arXiv:2407.18414  [pdf, other

    cs.LG cs.AI

    Adversarially Robust Decision Transformer

    Authors: Xiaohang Tang, Afonso Marques, Parameswaran Kamalaruban, Ilija Bogunovic

    Abstract: Decision Transformer (DT), as one of the representative Reinforcement Learning via Supervised Learning (RvS) methods, has achieved strong performance in offline learning tasks by leveraging the powerful Transformer architecture for sequential decision-making. However, in adversarial environments, these methods can be non-robust, since the return is dependent on the strategies of both the decision-… ▽ More

    Submitted 1 November, 2024; v1 submitted 25 July, 2024; originally announced July 2024.

    Comments: Accepted to NeurIPS 2024

  11. arXiv:2405.20304  [pdf, other

    cs.CL cs.LG

    Group Robust Preference Optimization in Reward-free RLHF

    Authors: Shyam Sundhar Ramesh, Yifan Hu, Iason Chaimalas, Viraj Mehta, Pier Giuseppe Sessa, Haitham Bou Ammar, Ilija Bogunovic

    Abstract: Adapting large language models (LLMs) for specific tasks usually involves fine-tuning through reinforcement learning with human feedback (RLHF) on preference data. While these data often come from diverse labelers' groups (e.g., different demographics, ethnicities, company teams, etc.), traditional RLHF approaches adopt a "one-size-fits-all" approach, i.e., they indiscriminately assume and optimiz… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Preprint

  12. arXiv:2402.02629  [pdf, other

    cs.LG cs.CR

    PROSAC: Provably Safe Certification for Machine Learning Models under Adversarial Attacks

    Authors: Chen Feng, Ziquan Liu, Zhuo Zhi, Ilija Bogunovic, Carsten Gerner-Beuerle, Miguel Rodrigues

    Abstract: It is widely known that state-of-the-art machine learning models, including vision and language models, can be seriously compromised by adversarial perturbations. It is therefore increasingly relevant to develop capabilities to certify their performance in the presence of the most effective adversarial attacks. Our paper offers a new approach to certify the performance of machine learning models i… ▽ More

    Submitted 17 December, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

    Comments: Accepted to AAAI2025

  13. arXiv:2312.00486  [pdf, other

    cs.LG

    REDUCR: Robust Data Downsampling Using Class Priority Reweighting

    Authors: William Bankes, George Hughes, Ilija Bogunovic, Zi Wang

    Abstract: Modern machine learning models are becoming increasingly expensive to train for real-world image and text classification tasks, where massive web-scale data is collected in a streaming fashion. To reduce the training cost, online batch selection techniques have been developed to choose the most informative datapoints. However, these techniques can suffer from poor worst-class generalization perfor… ▽ More

    Submitted 25 November, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: Preprint

  14. arXiv:2312.00267  [pdf, other

    cs.LG cs.AI stat.ML

    Sample Efficient Preference Alignment in LLMs via Active Exploration

    Authors: Viraj Mehta, Syrine Belakaria, Vikramjeet Das, Ojash Neopane, Yijia Dai, Ilija Bogunovic, Barbara Engelhardt, Stefano Ermon, Jeff Schneider, Willie Neiswanger

    Abstract: Preference-based feedback is important for many applications in machine learning where evaluation of a reward function is not feasible. Notable recent examples arise in preference alignment for large language models, including in reinforcement learning from human feedback (RLHF) and direct preference optimization (DPO). For many applications of preference alignment, the cost of acquiring human fee… ▽ More

    Submitted 20 March, 2025; v1 submitted 30 November, 2023; originally announced December 2023.

  15. arXiv:2311.04731  [pdf, other

    cs.LG stat.ML

    Robust Best-arm Identification in Linear Bandits

    Authors: Wei Wang, Sattar Vakili, Ilija Bogunovic

    Abstract: We study the robust best-arm identification problem (RBAI) in the case of linear rewards. The primary objective is to identify a near-optimal robust arm, which involves selecting arms at every round and assessing their robustness by exploring potential adversarial actions. This approach is particularly relevant when utilizing a simulator and seeking to identify a robust solution for real-world tra… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  16. arXiv:2309.02236  [pdf, other

    cs.LG cs.AI stat.ML

    Distributionally Robust Model-based Reinforcement Learning with Large State Spaces

    Authors: Shyam Sundhar Ramesh, Pier Giuseppe Sessa, Yifan Hu, Andreas Krause, Ilija Bogunovic

    Abstract: Three major challenges in reinforcement learning are the complex dynamical systems with large state spaces, the costly data acquisition processes, and the deviation of real-world dynamics from the training environment deployment. To overcome these issues, we study distributionally robust Markov decision processes with continuous state spaces under the widely used Kullback-Leibler, chi-square, and… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Journal ref: AISTATS 2024

  17. arXiv:2306.17052  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Safe Model-Based Multi-Agent Mean-Field Reinforcement Learning

    Authors: Matej Jusup, Barna Pásztor, Tadeusz Janik, Kenan Zhang, Francesco Corman, Andreas Krause, Ilija Bogunovic

    Abstract: Many applications, e.g., in shared mobility, require coordinating a large number of agents. Mean-field reinforcement learning addresses the resulting scalability challenge by optimizing the policy of a representative agent interacting with the infinite population of identical agents instead of considering individual pairwise interactions. In this paper, we address an important generalization where… ▽ More

    Submitted 27 December, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

    Comments: 23 pages, 26 figures, 6 tables

  18. arXiv:2302.04376  [pdf, other

    cs.LG cs.AI stat.ML

    Efficient Planning in Combinatorial Action Spaces with Applications to Cooperative Multi-Agent Reinforcement Learning

    Authors: Volodymyr Tkachuk, Seyed Alireza Bakhtiari, Johannes Kirschner, Matej Jusup, Ilija Bogunovic, Csaba Szepesvári

    Abstract: A practical challenge in reinforcement learning are combinatorial action spaces that make planning computationally demanding. For example, in cooperative multi-agent reinforcement learning, a potentially large number of agents jointly optimize a global reward function, which leads to a combinatorial blow-up in the action space by the number of agents. As a minimal requirement, we assume access to… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  19. arXiv:2212.09510  [pdf, other

    stat.ML cs.AI cs.LG

    Near-optimal Policy Identification in Active Reinforcement Learning

    Authors: Xiang Li, Viraj Mehta, Johannes Kirschner, Ian Char, Willie Neiswanger, Jeff Schneider, Andreas Krause, Ilija Bogunovic

    Abstract: Many real-world reinforcement learning tasks require control of complex dynamical systems that involve both costly data acquisition processes and large state spaces. In cases where the transition dynamics can be readily evaluated at specified states (e.g., via a simulator), agents can operate in what is often referred to as planning with a \emph{generative model}. We propose the AE-LSVI algorithm… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

  20. arXiv:2210.08087  [pdf, other

    stat.ML cs.LG

    Movement Penalized Bayesian Optimization with Application to Wind Energy Systems

    Authors: Shyam Sundhar Ramesh, Pier Giuseppe Sessa, Andreas Krause, Ilija Bogunovic

    Abstract: Contextual Bayesian optimization (CBO) is a powerful framework for sequential decision-making given side information, with important applications, e.g., in wind energy systems. In this setting, the learner receives context (e.g., weather conditions) at each round, and has to choose an action (e.g., turbine parameters). Standard algorithms assume no cost for switching their decisions at every round… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: Accepted to NeurIPS 2022

  21. arXiv:2207.06456  [pdf, other

    cs.LG cs.AI stat.ML

    Graph Neural Network Bandits

    Authors: Parnian Kassraie, Andreas Krause, Ilija Bogunovic

    Abstract: We consider the bandit optimization problem with the reward function defined over graph-structured data. This problem has important applications in molecule design and drug discovery, where the reward is naturally invariant to graph permutations. The key challenges in this setting are scaling to large domains, and to graphs with many nodes. We resolve these challenges by embedding the permutation… ▽ More

    Submitted 11 October, 2022; v1 submitted 13 July, 2022; originally announced July 2022.

    Comments: Accepted to Neurips2022, 37 pages, 8 figures

  22. arXiv:2202.01850  [pdf, other

    stat.ML cs.AI cs.LG

    A Robust Phased Elimination Algorithm for Corruption-Tolerant Gaussian Process Bandits

    Authors: Ilija Bogunovic, Zihan Li, Andreas Krause, Jonathan Scarlett

    Abstract: We consider the sequential optimization of an unknown, continuous, and expensive to evaluate reward function, from noisy and adversarially corrupted observed rewards. When the corruption attacks are subject to a suitable budget $C$ and the function lives in a Reproducing Kernel Hilbert Space (RKHS), the problem can be posed as corrupted Gaussian process (GP) bandit optimization. We propose a novel… ▽ More

    Submitted 28 March, 2022; v1 submitted 3 February, 2022; originally announced February 2022.

    Comments: Added references

  23. arXiv:2111.05008  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Misspecified Gaussian Process Bandit Optimization

    Authors: Ilija Bogunovic, Andreas Krause

    Abstract: We consider the problem of optimizing a black-box function based on noisy bandit feedback. Kernelized bandit algorithms have shown strong empirical and theoretical performance for this problem. They heavily rely on the assumption that the model is well-specified, however, and can fail without it. Instead, we introduce a \emph{misspecified} kernelized bandit setting where the unknown function can b… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

    Comments: Accepted to NeurIPS 2021

  24. arXiv:2111.03637  [pdf, other

    cs.LG

    Risk-averse Heteroscedastic Bayesian Optimization

    Authors: Anastasiia Makarova, Ilnura Usmanova, Ilija Bogunovic, Andreas Krause

    Abstract: Many black-box optimization tasks arising in high-stakes applications require risk-averse decisions. The standard Bayesian optimization (BO) paradigm, however, optimizes the expected value only. We generalize BO to trade mean and input-dependent variance of the objective, both of which we assume to be unknown a priori. In particular, we propose a novel risk-averse heteroscedastic Bayesian optimiza… ▽ More

    Submitted 5 November, 2021; originally announced November 2021.

    Journal ref: Advances in Neural Information Processing Systems, 2021

  25. arXiv:2107.06327  [pdf, other

    cs.GT cs.LG

    Contextual Games: Multi-Agent Learning with Side Information

    Authors: Pier Giuseppe Sessa, Ilija Bogunovic, Andreas Krause, Maryam Kamgarpour

    Abstract: We formulate the novel class of contextual games, a type of repeated games driven by contextual information at each round. By means of kernel-based regularity assumptions, we model the correlation between different contexts and game outcomes and propose a novel online (meta) algorithm that exploits such correlations to minimize the contextual regret of individual players. We define game-theoretic… ▽ More

    Submitted 13 July, 2021; originally announced July 2021.

    Journal ref: Proc. of Neural Information Processing Systems (NeurIPS), 2020

  26. arXiv:2107.04050  [pdf, other

    stat.ML cs.LG cs.MA

    Efficient Model-Based Multi-Agent Mean-Field Reinforcement Learning

    Authors: Barna Pásztor, Ilija Bogunovic, Andreas Krause

    Abstract: Learning in multi-agent systems is highly challenging due to several factors including the non-stationarity introduced by agents' interactions and the combinatorial nature of their state and action spaces. In particular, we consider the Mean-Field Control (MFC) problem which assumes an asymptotically infinite population of identical agents that aim to collaboratively maximize the collective reward… ▽ More

    Submitted 9 May, 2023; v1 submitted 8 July, 2021; originally announced July 2021.

    Journal ref: Pásztor, B., Krause, A., & Bogunovic, I. (2023). Efficient Model-Based Multi-Agent Mean-Field Reinforcement Learning. Transactions on Machine Learning Research

  27. arXiv:2103.10369  [pdf, other

    cs.LG cs.AI stat.ML

    Combining Pessimism with Optimism for Robust and Efficient Model-Based Deep Reinforcement Learning

    Authors: Sebastian Curi, Ilija Bogunovic, Andreas Krause

    Abstract: In real-world tasks, reinforcement learning (RL) agents frequently encounter situations that are not present during training time. To ensure reliable performance, the RL agents need to exhibit robustness against worst-case situations. The robust RL framework addresses this challenge via a worst-case optimization between an agent and an adversary. Previous robust RL algorithms are either sample ine… ▽ More

    Submitted 18 March, 2021; originally announced March 2021.

  28. arXiv:2101.06590  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Cost-Efficient Online Hyperparameter Optimization

    Authors: Jingkang Wang, Mengye Ren, Ilija Bogunovic, Yuwen Xiong, Raquel Urtasun

    Abstract: Recent work on hyperparameters optimization (HPO) has shown the possibility of training certain hyperparameters together with regular parameters. However, these online HPO algorithms still require running evaluation on a set of validation examples at each training step, steeply increasing the training cost. To decide when to query the validation loss, we model online HPO as a time-varying Bayesian… ▽ More

    Submitted 16 January, 2021; originally announced January 2021.

  29. arXiv:2007.05271  [pdf, other

    cs.LG cs.AI stat.ML

    Learning to Play Sequential Games versus Unknown Opponents

    Authors: Pier Giuseppe Sessa, Ilija Bogunovic, Maryam Kamgarpour, Andreas Krause

    Abstract: We consider a repeated sequential game between a learner, who plays first, and an opponent who responds to the chosen action. We seek to design strategies for the learner to successfully interact with the opponent. While most previous approaches consider known opponent models, we focus on the setting in which the opponent's model is unknown. To this end, we use kernel-based regularity assumptions… ▽ More

    Submitted 10 July, 2020; originally announced July 2020.

  30. arXiv:2007.03285  [pdf, other

    stat.ML cs.LG

    Stochastic Linear Bandits Robust to Adversarial Attacks

    Authors: Ilija Bogunovic, Arpan Losalka, Andreas Krause, Jonathan Scarlett

    Abstract: We consider a stochastic linear bandit problem in which the rewards are not only subject to random noise, but also adversarial attacks subject to a suitable budget $C$ (i.e., an upper bound on the sum of corruption magnitudes across the time horizon). We provide two variants of a Robust Phased Elimination algorithm, one that knows $C$ and one that does not. Both variants are shown to attain near-o… ▽ More

    Submitted 27 October, 2020; v1 submitted 7 July, 2020; originally announced July 2020.

  31. arXiv:2003.01971  [pdf, other

    stat.ML cs.LG

    Corruption-Tolerant Gaussian Process Bandit Optimization

    Authors: Ilija Bogunovic, Andreas Krause, Jonathan Scarlett

    Abstract: We consider the problem of optimizing an unknown (typically non-convex) function with a bounded norm in some Reproducing Kernel Hilbert Space (RKHS), based on noisy bandit feedback. We consider a novel variant of this problem in which the point evaluations are not only corrupted by random noise, but also adversarial corruptions. We introduce an algorithm Fast-Slow GP-UCB based on Gaussian process… ▽ More

    Submitted 4 March, 2020; originally announced March 2020.

    Comments: Accepted to AISTATS 2020

  32. arXiv:2002.12613  [pdf, other

    cs.LG stat.ML

    Mixed Strategies for Robust Optimization of Unknown Objectives

    Authors: Pier Giuseppe Sessa, Ilija Bogunovic, Maryam Kamgarpour, Andreas Krause

    Abstract: We consider robust optimization problems, where the goal is to optimize an unknown objective function against the worst-case realization of an uncertain parameter. For this setting, we design a novel sample-efficient algorithm GP-MRO, which sequentially learns about the unknown objective from noisy point evaluations. GP-MRO seeks to discover a robust and randomized mixed strategy, that maximizes t… ▽ More

    Submitted 2 March, 2020; v1 submitted 28 February, 2020; originally announced February 2020.

  33. arXiv:2002.09038  [pdf, other

    stat.ML cs.LG

    Distributionally Robust Bayesian Optimization

    Authors: Johannes Kirschner, Ilija Bogunovic, Stefanie Jegelka, Andreas Krause

    Abstract: Robustness to distributional shift is one of the key challenges of contemporary machine learning. Attaining such robustness is the goal of distributionally robust optimization, which seeks a solution to an optimization problem that is worst-case robust under a specified distributional shift of an uncontrolled covariate. In this paper, we study such a problem when the distributional shift is measur… ▽ More

    Submitted 22 March, 2020; v1 submitted 20 February, 2020; originally announced February 2020.

    Comments: Accepted at AISTATS 2020

  34. arXiv:1909.08540  [pdf, other

    cs.LG cs.GT cs.MA stat.ML

    No-Regret Learning in Unknown Games with Correlated Payoffs

    Authors: Pier Giuseppe Sessa, Ilija Bogunovic, Maryam Kamgarpour, Andreas Krause

    Abstract: We consider the problem of learning to play a repeated multi-agent game with an unknown reward function. Single player online learning algorithms attain strong regret bounds when provided with full information feedback, which unfortunately is unavailable in many real-world scenarios. Bandit feedback alone, i.e., observing outcomes only for the selected action, yields substantially worse performanc… ▽ More

    Submitted 28 October, 2019; v1 submitted 18 September, 2019; originally announced September 2019.

  35. arXiv:1810.10775  [pdf, other

    stat.ML cs.LG

    Adversarially Robust Optimization with Gaussian Processes

    Authors: Ilija Bogunovic, Jonathan Scarlett, Stefanie Jegelka, Volkan Cevher

    Abstract: In this paper, we consider the problem of Gaussian process (GP) optimization with an added robustness requirement: The returned point may be perturbed by an adversary, and we require the function value to remain as high as possible even after this perturbation. This problem is motivated by settings in which the underlying functions during optimization and implementation stages are different, or wh… ▽ More

    Submitted 1 November, 2018; v1 submitted 25 October, 2018; originally announced October 2018.

    Comments: Corrected typos

  36. arXiv:1802.07073  [pdf, ps, other

    stat.ML cs.AI cs.DS cs.LG

    Robust Maximization of Non-Submodular Objectives

    Authors: Ilija Bogunovic, Junyao Zhao, Volkan Cevher

    Abstract: We study the problem of maximizing a monotone set function subject to a cardinality constraint $k$ in the setting where some number of elements $τ$ is deleted from the returned set. The focus of this work is on the worst-case adversarial setting. While there exist constant-factor guarantees when the function is submodular, there are no guarantees for non-submodular objectives. In this work, we pre… ▽ More

    Submitted 2 May, 2020; v1 submitted 20 February, 2018; originally announced February 2018.

    Comments: Revision of Section 4.2

  37. arXiv:1802.07028  [pdf, ps, other

    cs.LG stat.ML

    High-Dimensional Bayesian Optimization via Additive Models with Overlapping Groups

    Authors: Paul Rolland, Jonathan Scarlett, Ilija Bogunovic, Volkan Cevher

    Abstract: Bayesian optimization (BO) is a popular technique for sequential black-box function optimization, with applications including parameter tuning, robotics, environmental monitoring, and more. One of the most important challenges in BO is the development of algorithms that scale to high dimensions, which remains a key open problem despite recent progress. In this paper, we consider the approach of Ka… ▽ More

    Submitted 28 March, 2018; v1 submitted 20 February, 2018; originally announced February 2018.

  38. arXiv:1711.02598  [pdf, other

    cs.DS stat.ML

    Streaming Robust Submodular Maximization: A Partitioned Thresholding Approach

    Authors: Slobodan Mitrović, Ilija Bogunovic, Ashkan Norouzi-Fard, Jakub Tarnawski, Volkan Cevher

    Abstract: We study the classical problem of maximizing a monotone submodular function subject to a cardinality constraint k, with two additional twists: (i) elements arrive in a streaming fashion, and (ii) m items from the algorithm's memory are removed after the stream is finished. We develop a robust submodular algorithm STAR-T. It is based on a novel partitioning structure and an exponentially decreasing… ▽ More

    Submitted 7 November, 2017; originally announced November 2017.

    Comments: To appear in NIPS 2017

    Journal ref: Proc. of 30th Advances in Neural Information Processing Systems (NIPS) 2017, pages 4558-4567

  39. arXiv:1706.04918  [pdf, other

    stat.ML cs.LG

    Robust Submodular Maximization: A Non-Uniform Partitioning Approach

    Authors: Ilija Bogunovic, Slobodan Mitrović, Jonathan Scarlett, Volkan Cevher

    Abstract: We study the problem of maximizing a monotone submodular function subject to a cardinality constraint $k$, with the added twist that a number of items $τ$ from the returned set may be removed. We focus on the worst-case setting considered in (Orlin et al., 2016), in which a constant-factor approximation guarantee was given for $τ= o(\sqrt{k})$. In this paper, we solve a key open problem raised the… ▽ More

    Submitted 15 June, 2017; originally announced June 2017.

    Comments: Accepted to ICML 2017

  40. arXiv:1706.00090  [pdf, other

    stat.ML cs.IT cs.LG

    Lower Bounds on Regret for Noisy Gaussian Process Bandit Optimization

    Authors: Jonathan Scarlett, Ilijia Bogunovic, Volkan Cevher

    Abstract: In this paper, we consider the problem of sequentially optimizing a black-box function $f$ based on noisy samples and bandit feedback. We assume that $f$ is smooth in the sense of having a bounded norm in some reproducing kernel Hilbert space (RKHS), yielding a commonly-considered non-Bayesian form of Gaussian process bandit optimization. We provide algorithm-independent lower bounds on the simple… ▽ More

    Submitted 31 May, 2018; v1 submitted 31 May, 2017; originally announced June 2017.

    Comments: Appearing in COLT 2017. This version corrects a few minor mistakes in Table I, which summarizes the new and existing regret bounds

  41. arXiv:1611.08574  [pdf, other

    cs.DS

    An Efficient Streaming Algorithm for the Submodular Cover Problem

    Authors: Ashkan Norouzi-Fard, Abbas Bazzi, Marwa El Halabi, Ilija Bogunovic, Ya-Ping Hsieh, Volkan Cevher

    Abstract: We initiate the study of the classical Submodular Cover (SC) problem in the data streaming model which we refer to as the Streaming Submodular Cover (SSC). We show that any single pass streaming algorithm using sublinear memory in the size of the stream will fail to provide any non-trivial approximation guarantees for SSC. Hence, we consider a relaxed version of SSC, where we only seek to find a p… ▽ More

    Submitted 25 November, 2016; originally announced November 2016.

    Comments: To appear in NIPS'16

  42. arXiv:1610.07379  [pdf, other

    stat.ML cs.IT cs.LG

    Truncated Variance Reduction: A Unified Approach to Bayesian Optimization and Level-Set Estimation

    Authors: Ilija Bogunovic, Jonathan Scarlett, Andreas Krause, Volkan Cevher

    Abstract: We present a new algorithm, truncated variance reduction (TruVaR), that treats Bayesian optimization (BO) and level-set estimation (LSE) with Gaussian processes in a unified fashion. The algorithm greedily shrinks a sum of truncated variances within a set of potential maximizers (BO) or unclassified points (LSE), which is updated based on confidence bounds. TruVaR is effective in several important… ▽ More

    Submitted 24 October, 2016; originally announced October 2016.

    Comments: Accepted to NIPS 2016

  43. arXiv:1601.06650  [pdf, other

    stat.ML cs.LG

    Time-Varying Gaussian Process Bandit Optimization

    Authors: Ilija Bogunovic, Jonathan Scarlett, Volkan Cevher

    Abstract: We consider the sequential Bayesian optimization problem with bandit feedback, adopting a formulation that allows for the reward function to vary with time. We model the reward function using a Gaussian process whose evolution obeys a simple Markov model. We introduce two natural extensions of the classical Gaussian process upper confidence bound (GP-UCB) algorithm. The first, R-GP-UCB, resets GP-… ▽ More

    Submitted 25 January, 2016; originally announced January 2016.

    Comments: To appear in AISTATS 2016

  44. arXiv:1510.06188  [pdf, other

    cs.IT cs.LG stat.ML

    Learning-based Compressive Subsampling

    Authors: Luca Baldassarre, Yen-Huan Li, Jonathan Scarlett, Baran Gözcü, Ilija Bogunovic, Volkan Cevher

    Abstract: The problem of recovering a structured signal $\mathbf{x} \in \mathbb{C}^p$ from a set of dimensionality-reduced linear measurements $\mathbf{b} = \mathbf {A}\mathbf {x}$ arises in a variety of applications, such as medical imaging, spectroscopy, Fourier optics, and computerized tomography. Due to computational and storage complexity or physical constraints imposed by the problem, the measurement… ▽ More

    Submitted 28 March, 2016; v1 submitted 21 October, 2015; originally announced October 2015.

    Comments: Submitted to IEEE Journal on Selected Topics in Signal Processing

  45. arXiv:1402.2092  [pdf, other

    cs.LG

    Near-Optimally Teaching the Crowd to Classify

    Authors: Adish Singla, Ilija Bogunovic, Gábor Bartók, Amin Karbasi, Andreas Krause

    Abstract: How should we present training examples to learners to teach them classification rules? This is a natural problem when training workers for crowdsourcing labeling tasks, and is also motivated by challenges in data-driven online education. We propose a natural stochastic model of the learners, modeling them as randomly switching among hypotheses based on observed feedback. We then develop STRICT, a… ▽ More

    Submitted 7 March, 2014; v1 submitted 10 February, 2014; originally announced February 2014.