Skip to main content

Showing 1–50 of 130 results for author: Krause, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2502.14689  [pdf, other

    stat.ML cs.LG

    Confidence Estimation via Sequential Likelihood Mixing

    Authors: Johannes Kirschner, Andreas Krause, Michele Meziu, Mojmir Mutny

    Abstract: We present a universal framework for constructing confidence sets based on sequential likelihood mixing. Building upon classical results from sequential analysis, we provide a unifying perspective on several recent lines of work, and establish fundamental connections between sequential mixing, Bayesian inference and regret inequalities from online estimation. The framework applies to any realizabl… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

  2. Toward a Principled Framework for Disclosure Avoidance

    Authors: Michael B Hawes, Evan M Brassell, Anthony Caruso, Ryan Cumings-Menon, Jason Devine, Cassandra Dorius, David Evans, Kenneth Haase, Michele C Hedrick, Alexandra Krause, Philip Leclerc, James Livsey, Rolando A Rodriguez, Luke T Rogers, Matthew Spence, Victoria Velkoff, Michael Walsh, James Whitehorne, Sallie Ann Keller

    Abstract: Responsible disclosure limitation is an iterative exercise in risk assessment and mitigation. From time to time, as disclosure risks grow and evolve and as data users' needs change, agencies must consider redesigning the disclosure avoidance system(s) they use. Discussions about candidate systems often conflate inherent features of those systems with implementation decisions independent of those s… ▽ More

    Submitted 29 May, 2025; v1 submitted 10 February, 2025; originally announced February 2025.

  3. arXiv:2501.13535  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    LITE: Efficiently Estimating Gaussian Probability of Maximality

    Authors: Nicolas Menet, Jonas Hübotter, Parnian Kassraie, Andreas Krause

    Abstract: We consider the problem of computing the probability of maximality (PoM) of a Gaussian random vector, i.e., the probability for each dimension to be maximal. This is a key challenge in applications ranging from Bayesian optimization to reinforcement learning, where the PoM not only helps with finding an optimal action, but yields a fine-grained analysis of the action domain, crucial in tasks such… ▽ More

    Submitted 15 February, 2025; v1 submitted 23 January, 2025; originally announced January 2025.

    Comments: accepted in AISTATS 2025

  4. arXiv:2411.14003  [pdf, other

    cs.LG stat.ML

    Generative Intervention Models for Causal Perturbation Modeling

    Authors: Nora Schneider, Lars Lorch, Niki Kilbertus, Bernhard Schölkopf, Andreas Krause

    Abstract: We consider the problem of predicting perturbation effects via causal models. In many applications, it is a priori unknown which mechanisms of a system are modified by an external perturbation, even though the features of the perturbation are available. For example, in genomics, some properties of a drug may be known, but not their causal effects on the regulatory pathways of cells. We propose a g… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

  5. arXiv:2411.00161  [pdf, other

    stat.ML cs.LG

    Residual Deep Gaussian Processes on Manifolds

    Authors: Kacper Wyrwal, Andreas Krause, Viacheslav Borovitskiy

    Abstract: We propose practical deep Gaussian process models on Riemannian manifolds, similar in spirit to residual neural networks. With manifold-to-manifold hidden layers and an arbitrary last layer, they can model manifold- and scalar-valued functions, as well as vector fields. We target data inherently supported on manifolds, which is too complex for shallow Gaussian processes thereon. For example, while… ▽ More

    Submitted 27 February, 2025; v1 submitted 31 October, 2024; originally announced November 2024.

  6. arXiv:2406.16745  [pdf, other

    cs.LG cs.AI cs.GT stat.ML

    Bandits with Preference Feedback: A Stackelberg Game Perspective

    Authors: Barna Pásztor, Parnian Kassraie, Andreas Krause

    Abstract: Bandits with preference feedback present a powerful tool for optimizing unknown target functions when only pairwise comparisons are allowed instead of direct value queries. This model allows for incorporating human feedback into online inference and optimization and has been employed in systems for fine-tuning large language models. The problem is well understood in simplified settings with linear… ▽ More

    Submitted 30 October, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: In Proceedings of the 38th Conference on Neural Information Processing Systems (NeurIPS), 30 pages, 8 figures

  7. arXiv:2406.11601  [pdf, other

    cs.LG stat.ML

    Standardizing Structural Causal Models

    Authors: Weronika Ormaniec, Scott Sussex, Lars Lorch, Bernhard Schölkopf, Andreas Krause

    Abstract: Synthetic datasets generated by structural causal models (SCMs) are commonly used for benchmarking causal structure learning algorithms. However, the variances and pairwise correlations in SCM data tend to increase along the causal ordering. Several popular algorithms exploit these artifacts, possibly leading to conclusions that do not generalize to real-world settings. Existing metrics like… ▽ More

    Submitted 17 March, 2025; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: Added additional benchmarks, including PC algorithm, GES, GOLEM. Evaluated Var-sortability and R2-sortability of the heuristics for mitigating variance accumulation

  8. arXiv:2406.01575  [pdf, other

    math.OC cs.AI cs.LG stat.ML

    Contextual Bilevel Reinforcement Learning for Incentive Alignment

    Authors: Vinzenz Thoma, Barna Pasztor, Andreas Krause, Giorgia Ramponi, Yifan Hu

    Abstract: The optimal policy in various real-world strategic decision-making problems depends both on the environmental configuration and exogenous events. For these settings, we introduce Contextual Bilevel Reinforcement Learning (CB-RL), a stochastic bilevel decision-making model, where the lower level consists of solving a contextual Markov Decision Process (CMDP). CB-RL can be viewed as a Stackelberg Ga… ▽ More

    Submitted 8 December, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: 60 pages, 21 Figures

  9. arXiv:2402.05724  [pdf, other

    cs.LG cs.AI cs.GT stat.ML

    Model-Based RL for Mean-Field Games is not Statistically Harder than Single-Agent RL

    Authors: Jiawei Huang, Niao He, Andreas Krause

    Abstract: We study the sample complexity of reinforcement learning (RL) in Mean-Field Games (MFGs) with model-based function approximation that requires strategic exploration to find a Nash Equilibrium policy. We introduce the Partial Model-Based Eluder Dimension (P-MBED), a more effective notion to characterize the model class complexity. Notably, P-MBED measures the complexity of the single-agent model cl… ▽ More

    Submitted 3 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: ICML 2024; 55 Pages

  10. arXiv:2311.16706  [pdf, ps, other

    cs.LG math.PR stat.ML

    Sinkhorn Flow: A Continuous-Time Framework for Understanding and Generalizing the Sinkhorn Algorithm

    Authors: Mohammad Reza Karimi, Ya-Ping Hsieh, Andreas Krause

    Abstract: Many problems in machine learning can be formulated as solving entropy-regularized optimal transport on the space of probability measures. The canonical approach involves the Sinkhorn iterates, renowned for their rich mathematical properties. Recently, the Sinkhorn algorithm has been recast within the mirror descent framework, thus benefiting from classical optimization theory insights. Here, we b… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  11. arXiv:2311.04402  [pdf, other

    cs.LG stat.ML

    Likelihood Ratio Confidence Sets for Sequential Decision Making

    Authors: Nicolas Emmenegger, Mojmír Mutný, Andreas Krause

    Abstract: Certifiable, adaptive uncertainty estimates for unknown quantities are an essential ingredient of sequential decision-making algorithms. Standard approaches rely on problem-dependent concentration results and are limited to a specific combination of parameterization, noise family, and estimator. In this paper, we revisit the likelihood-based inference principle and propose to use likelihood ratios… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  12. arXiv:2310.19390  [pdf, other

    stat.ML cs.LG

    Implicit Manifold Gaussian Process Regression

    Authors: Bernardo Fichera, Viacheslav Borovitskiy, Andreas Krause, Aude Billard

    Abstract: Gaussian process regression is widely used because of its ability to provide well-calibrated uncertainty estimates and handle small or sparse datasets. However, it struggles with high-dimensional data. One possible way to scale this technique to higher dimensions is to leverage the implicit low-dimensional manifold upon which the data actually lies, as postulated by the manifold hypothesis. Prior… ▽ More

    Submitted 1 February, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Journal ref: Advances in Neural Information Processing Systems, 2023

  13. arXiv:2310.18824  [pdf, other

    stat.ML cs.LG

    Intrinsic Gaussian Vector Fields on Manifolds

    Authors: Daniel Robert-Nicoud, Andreas Krause, Viacheslav Borovitskiy

    Abstract: Various applications ranging from robotics to climate science require modeling signals on non-Euclidean domains, such as the sphere. Gaussian process models on manifolds have recently been proposed for such tasks, in particular when uncertainty quantification is needed. In the manifold setting, vector-valued signals can behave very differently from scalar-valued ones, with much of the progress so… ▽ More

    Submitted 31 March, 2024; v1 submitted 28 October, 2023; originally announced October 2023.

    Comments: Version accepted at AISTATS 2024

  14. arXiv:2309.02236  [pdf, other

    cs.LG cs.AI stat.ML

    Distributionally Robust Model-based Reinforcement Learning with Large State Spaces

    Authors: Shyam Sundhar Ramesh, Pier Giuseppe Sessa, Yifan Hu, Andreas Krause, Ilija Bogunovic

    Abstract: Three major challenges in reinforcement learning are the complex dynamical systems with large state spaces, the costly data acquisition processes, and the deviation of real-world dynamics from the training environment deployment. To overcome these issues, we study distributionally robust Markov decision processes with continuous state spaces under the widely used Kullback-Leibler, chi-square, and… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Journal ref: AISTATS 2024

  15. arXiv:2307.16625  [pdf, other

    cs.LG stat.ML

    Adversarial Causal Bayesian Optimization

    Authors: Scott Sussex, Pier Giuseppe Sessa, Anastasiia Makarova, Andreas Krause

    Abstract: In Causal Bayesian Optimization (CBO), an agent intervenes on an unknown structural causal model to maximize a downstream reward variable. In this paper, we consider the generalization where other agents or external events also intervene on the system, which is key for enabling adaptiveness to non-stationarities such as weather changes, market forces, or adversaries. We formalize this generalizati… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: 21 pages, 8 figures

  16. arXiv:2307.12897  [pdf, other

    stat.ML cs.AI cs.LG

    Anytime Model Selection in Linear Bandits

    Authors: Parnian Kassraie, Nicolas Emmenegger, Andreas Krause, Aldo Pacchiano

    Abstract: Model selection in the context of bandit optimization is a challenging problem, as it requires balancing exploration and exploitation not only for action selection, but also for model selection. One natural approach is to rely on online learning algorithms that treat different models as experts. Existing methods, however, scale poorly ($\text{poly}M$) with the number of models $M$ in terms of thei… ▽ More

    Submitted 12 November, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023, 37 pages

  17. arXiv:2306.17052  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Safe Model-Based Multi-Agent Mean-Field Reinforcement Learning

    Authors: Matej Jusup, Barna Pásztor, Tadeusz Janik, Kenan Zhang, Francesco Corman, Andreas Krause, Ilija Bogunovic

    Abstract: Many applications, e.g., in shared mobility, require coordinating a large number of agents. Mean-field reinforcement learning addresses the resulting scalability challenge by optimizing the policy of a representative agent interacting with the infinite population of identical agents instead of considering individual pairwise interactions. In this paper, we address an important generalization where… ▽ More

    Submitted 27 December, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

    Comments: 23 pages, 26 figures, 6 tables

  18. arXiv:2305.16147  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Safety Constraints from Demonstrations with Unknown Rewards

    Authors: David Lindner, Xin Chen, Sebastian Tschiatschek, Katja Hofmann, Andreas Krause

    Abstract: We propose Convex Constraint Learning for Reinforcement Learning (CoCoRL), a novel approach for inferring shared constraints in a Constrained Markov Decision Process (CMDP) from a set of safe demonstrations with possibly different reward functions. While previous work is limited to demonstrations with known rewards or fully known environment dynamics, CoCoRL can learn constraints from demonstratio… ▽ More

    Submitted 1 March, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Presented at the International Conference on Artificial Intelligence and Statistics (AISTATS) 2024

  19. arXiv:2303.01076  [pdf, other

    cs.LG cs.AI stat.ML

    Hallucinated Adversarial Control for Conservative Offline Policy Evaluation

    Authors: Jonas Rothfuss, Bhavya Sukhija, Tobias Birchler, Parnian Kassraie, Andreas Krause

    Abstract: We study the problem of conservative off-policy evaluation (COPE) where given an offline dataset of environment interactions, collected by other agents, we seek to obtain a (tight) lower bound on a policy's performance. This is crucial when deciding whether a given policy satisfies certain minimal performance/safety criteria before it can be deployed in the real world. To this end, we introduce HA… ▽ More

    Submitted 26 May, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

    Comments: Conference on Uncertainty in Artificial Intelligence (UAI) 2023, first three authors contributed equally

  20. arXiv:2302.03683  [pdf, ps, other

    cs.LG stat.ML

    Linear Partial Monitoring for Sequential Decision-Making: Algorithms, Regret Bounds and Applications

    Authors: Johannes Kirschner, Tor Lattimore, Andreas Krause

    Abstract: Partial monitoring is an expressive framework for sequential decision-making with an abundance of applications, including graph-structured and dueling bandits, dynamic pricing and transductive feedback models. We survey and extend recent results on the linear formulation of partial monitoring that naturally generalizes the standard linear bandit setting. The main result is that a single algorithm,… ▽ More

    Submitted 13 November, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

  21. arXiv:2212.09510  [pdf, other

    stat.ML cs.AI cs.LG

    Near-optimal Policy Identification in Active Reinforcement Learning

    Authors: Xiang Li, Viraj Mehta, Johannes Kirschner, Ian Char, Willie Neiswanger, Jeff Schneider, Andreas Krause, Ilija Bogunovic

    Abstract: Many real-world reinforcement learning tasks require control of complex dynamical systems that involve both costly data acquisition processes and large state spaces. In cases where the transition dynamics can be readily evaluated at specified states (e.g., via a simulator), agents can operate in what is often referred to as planning with a \emph{generative model}. We propose the AE-LSVI algorithm… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

  22. arXiv:2211.10257  [pdf, other

    cs.LG stat.ML

    Model-based Causal Bayesian Optimization

    Authors: Scott Sussex, Anastasiia Makarova, Andreas Krause

    Abstract: How should we intervene on an unknown structural equation model to maximize a downstream variable of interest? This setting, also known as causal Bayesian optimization (CBO), has important applications in medicine, ecology, and manufacturing. Standard Bayesian optimization algorithms fail to effectively leverage the underlying causal structure. Existing CBO approaches assume noiseless measurements… ▽ More

    Submitted 10 March, 2023; v1 submitted 18 November, 2022; originally announced November 2022.

    Comments: 24 pages, 8 figures, accepted at ICLR 2023

  23. arXiv:2211.07206  [pdf, other

    stat.ML cs.LG

    Scalable PAC-Bayesian Meta-Learning via the PAC-Optimal Hyper-Posterior: From Theory to Practice

    Authors: Jonas Rothfuss, Martin Josifoski, Vincent Fortuin, Andreas Krause

    Abstract: Meta-Learning aims to speed up the learning process on new tasks by acquiring useful inductive biases from datasets of related learning tasks. While, in practice, the number of related tasks available is often small, most of the existing approaches assume an abundance of tasks; making them unrealistic and prone to overfitting. A central question in the meta-learning literature is how to regularize… ▽ More

    Submitted 22 December, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: JMLR, 62 pages, text overlap with arXiv:2002.05551

    Journal ref: Journal of Machine Learning Research (24), 2023, 1-62

  24. arXiv:2211.01689  [pdf, other

    stat.ML cs.LG

    Isotropic Gaussian Processes on Finite Spaces of Graphs

    Authors: Viacheslav Borovitskiy, Mohammad Reza Karimi, Vignesh Ram Somnath, Andreas Krause

    Abstract: We propose a principled way to define Gaussian process priors on various sets of unweighted graphs: directed or undirected, with or without loops. We endow each of these sets with a geometric structure, inducing the notions of closeness and symmetries, by turning them into a vertex set of an appropriate metagraph. Building on this, we describe the class of priors that respect this structure and ar… ▽ More

    Submitted 25 February, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

  25. arXiv:2211.01258  [pdf, other

    stat.ML cs.LG

    Instance-Dependent Generalization Bounds via Optimal Transport

    Authors: Songyan Hou, Parnian Kassraie, Anastasis Kratsios, Andreas Krause, Jonas Rothfuss

    Abstract: Existing generalization bounds fail to explain crucial factors that drive the generalization of modern neural networks. Since such bounds often hold uniformly over all parameters, they suffer from over-parametrization and fail to account for the strong inductive bias of initialization and stochastic gradient descent. As an alternative, we propose a novel optimal transport interpretation of the gen… ▽ More

    Submitted 13 November, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: Journal of Machine Learning Research (JMLR), 51 pages

  26. arXiv:2210.15513  [pdf, other

    stat.ML cs.AI cs.LG

    Lifelong Bandit Optimization: No Prior and No Regret

    Authors: Felix Schur, Parnian Kassraie, Jonas Rothfuss, Andreas Krause

    Abstract: Machine learning algorithms are often repeatedly applied to problems with similar structure over and over again. We focus on solving a sequence of bandit optimization tasks and develop LIBO, an algorithm which adapts to the environment by learning from past experience and becomes more sample-efficient in the process. We assume a kernelized structure where the kernel is unknown but shared across al… ▽ More

    Submitted 20 June, 2023; v1 submitted 27 October, 2022; originally announced October 2022.

    Comments: 35 pages, 6 figures, In Proceedings of UAI 2023

  27. arXiv:2210.13319  [pdf, other

    cs.LG stat.ML

    MARS: Meta-Learning as Score Matching in the Function Space

    Authors: Krunoslav Lehman Pavasovic, Jonas Rothfuss, Andreas Krause

    Abstract: Meta-learning aims to extract useful inductive biases from a set of related datasets. In Bayesian meta-learning, this is typically achieved by constructing a prior distribution over neural network parameters. However, specifying families of computationally viable prior distributions over the high-dimensional neural network parameters is difficult. As a result, existing approaches resort to meta-le… ▽ More

    Submitted 10 June, 2023; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: In International Conference on Learning Representations (ICLR), 2023

  28. arXiv:2210.08087  [pdf, other

    stat.ML cs.LG

    Movement Penalized Bayesian Optimization with Application to Wind Energy Systems

    Authors: Shyam Sundhar Ramesh, Pier Giuseppe Sessa, Andreas Krause, Ilija Bogunovic

    Abstract: Contextual Bayesian optimization (CBO) is a powerful framework for sequential decision-making given side information, with important applications, e.g., in wind energy systems. In this setting, the learner receives context (e.g., weather conditions) at each round, and has to choose an action (e.g., turbine parameters). Standard algorithms assume no cost for switching their decisions at every round… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: Accepted to NeurIPS 2022

  29. arXiv:2210.00762  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Meta-Learning Priors for Safe Bayesian Optimization

    Authors: Jonas Rothfuss, Christopher Koenig, Alisa Rupenyan, Andreas Krause

    Abstract: In robotics, optimizing controller parameters under safety constraints is an important challenge. Safe Bayesian optimization (BO) quantifies uncertainty in the objective and constraints to safely guide exploration in such settings. Hand-designing a suitable probabilistic model can be challenging, however. In the presence of unknown safety constraints, it is crucial to choose reliable model hyper-p… ▽ More

    Submitted 12 June, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: Conference on Robot Learning (CoRL) 2022

  30. arXiv:2207.08645  [pdf, other

    cs.LG cs.AI stat.ML

    Active Exploration for Inverse Reinforcement Learning

    Authors: David Lindner, Andreas Krause, Giorgia Ramponi

    Abstract: Inverse Reinforcement Learning (IRL) is a powerful paradigm for inferring a reward function from expert demonstrations. Many IRL algorithms require a known transition model and sometimes even a known expert policy, or they at least require access to a generative model. However, these assumptions are too strong for many real-world applications, where the environment can be accessed only through seq… ▽ More

    Submitted 22 August, 2023; v1 submitted 18 July, 2022; originally announced July 2022.

    Comments: Presented at Conference on Neural Information Processing Systems (NeurIPS), 2022

  31. arXiv:2207.06456  [pdf, other

    cs.LG cs.AI stat.ML

    Graph Neural Network Bandits

    Authors: Parnian Kassraie, Andreas Krause, Ilija Bogunovic

    Abstract: We consider the bandit optimization problem with the reward function defined over graph-structured data. This problem has important applications in molecule design and drug discovery, where the reward is naturally invariant to graph permutations. The key challenges in this setting are scaling to large domains, and to graphs with many nodes. We resolve these challenges by embedding the permutation… ▽ More

    Submitted 11 October, 2022; v1 submitted 13 July, 2022; originally announced July 2022.

    Comments: Accepted to Neurips2022, 37 pages, 8 figures

  32. arXiv:2206.14332  [pdf, other

    cs.LG stat.ME stat.ML

    Active Exploration via Experiment Design in Markov Chains

    Authors: Mojmír Mutný, Tadeusz Janik, Andreas Krause

    Abstract: A key challenge in science and engineering is to design experiments to learn about some unknown quantity of interest. Classical experimental design optimally allocates the experimental budget to maximize a notion of utility (e.g., reduction in uncertainty about the unknown quantity). We consider a rich setting, where the experiments are associated with states in a {\em Markov chain}, and we can on… ▽ More

    Submitted 9 November, 2022; v1 submitted 28 June, 2022; originally announced June 2022.

  33. arXiv:2206.13414  [pdf, other

    cs.LG math.OC stat.ML

    Learning To Cut By Looking Ahead: Cutting Plane Selection via Imitation Learning

    Authors: Max B. Paulus, Giulia Zarpellon, Andreas Krause, Laurent Charlin, Chris J. Maddison

    Abstract: Cutting planes are essential for solving mixed-integer linear problems (MILPs), because they facilitate bound improvements on the optimal solution value. For selecting cuts, modern solvers rely on manually designed heuristics that are tuned to gauge the potential effectiveness of cuts. We show that a greedy selection rule explicitly looking ahead to select cuts that yield the best bound improvemen… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Comments: ICML 2022

  34. arXiv:2206.11646  [pdf, other

    cs.LG stat.ML

    Invariant Causal Mechanisms through Distribution Matching

    Authors: Mathieu Chevalley, Charlotte Bunne, Andreas Krause, Stefan Bauer

    Abstract: Learning representations that capture the underlying data generating process is a key problem for data efficient and robust use of neural networks. One key property for robustness which the learned representation should capture and which recently received a lot of attention is described by the notion of invariance. In this work we provide a causal perspective and new algorithm for learning invaria… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

  35. arXiv:2206.05255  [pdf, other

    cs.LG cs.AI stat.ML

    Interactively Learning Preference Constraints in Linear Bandits

    Authors: David Lindner, Sebastian Tschiatschek, Katja Hofmann, Andreas Krause

    Abstract: We study sequential decision-making with known rewards and unknown constraints, motivated by situations where the constraints represent expensive-to-evaluate human preferences, such as safe and comfortable driving behavior. We formalize the challenge of interactively learning about these constraints as a novel linear bandit problem which we call constrained linear best-arm identification. To solve… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

    Comments: Accepted to International Conference on Machine Learning (ICML), 2022

  36. arXiv:2206.02063  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    Active Bayesian Causal Inference

    Authors: Christian Toth, Lars Lorch, Christian Knoll, Andreas Krause, Franz Pernkopf, Robert Peharz, Julius von Kügelgen

    Abstract: Causal discovery and causal reasoning are classically treated as separate and consecutive tasks: one first infers the causal graph, and then uses it to estimate causal effects of interventions. However, such a two-stage approach is uneconomical, especially in terms of actively collected interventional data, since the causal query of interest may not require a fully-specified causal model. From a B… ▽ More

    Submitted 15 October, 2022; v1 submitted 4 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022 camera-ready version. RP & JvK are shared last authors. 10 pages + Bibliography + Appendix (34 pages total)

  37. arXiv:2206.01665  [pdf, other

    cs.LG stat.ME stat.ML

    BaCaDI: Bayesian Causal Discovery with Unknown Interventions

    Authors: Alexander Hägele, Jonas Rothfuss, Lars Lorch, Vignesh Ram Somnath, Bernhard Schölkopf, Andreas Krause

    Abstract: Inferring causal structures from experimentation is a central task in many domains. For example, in biology, recent advances allow us to obtain single-cell expression data under multiple interventions such as drugs or gene knockouts. However, the targets of the interventions are often uncertain or unknown and the number of observations limited. As a result, standard causal discovery methods can no… ▽ More

    Submitted 23 February, 2023; v1 submitted 3 June, 2022; originally announced June 2022.

    Comments: Accepted to AISTATS 2023. 26 pages

  38. arXiv:2205.12934  [pdf, other

    cs.LG stat.ML

    Amortized Inference for Causal Structure Learning

    Authors: Lars Lorch, Scott Sussex, Jonas Rothfuss, Andreas Krause, Bernhard Schölkopf

    Abstract: Inferring causal structure poses a combinatorial search problem that typically involves evaluating structures with a score or independence test. The resulting search is costly, and designing suitable scores or tests that capture prior knowledge is difficult. In this work, we propose to amortize causal structure learning. Rather than searching over structures, we train a variational inference model… ▽ More

    Submitted 15 December, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: NeurIPS 2022, fixed formatting of Figure 5

  39. arXiv:2202.01850  [pdf, other

    stat.ML cs.AI cs.LG

    A Robust Phased Elimination Algorithm for Corruption-Tolerant Gaussian Process Bandits

    Authors: Ilija Bogunovic, Zihan Li, Andreas Krause, Jonathan Scarlett

    Abstract: We consider the sequential optimization of an unknown, continuous, and expensive to evaluate reward function, from noisy and adversarially corrupted observed rewards. When the corruption attacks are subject to a suitable budget $C$ and the function lives in a Reproducing Kernel Hilbert Space (RKHS), the problem can be posed as corrupted Gaussian process (GP) bandit optimization. We propose a novel… ▽ More

    Submitted 28 March, 2022; v1 submitted 3 February, 2022; originally announced February 2022.

    Comments: Added references

  40. arXiv:2202.00602  [pdf, other

    stat.ML cs.AI cs.LG

    Meta-Learning Hypothesis Spaces for Sequential Decision-making

    Authors: Parnian Kassraie, Jonas Rothfuss, Andreas Krause

    Abstract: Obtaining reliable, adaptive confidence sets for prediction functions (hypotheses) is a central challenge in sequential decision-making tasks, such as bandits and model-based reinforcement learning. These confidence sets typically rely on prior assumptions on the hypothesis space, e.g., the known kernel of a Reproducing Kernel Hilbert Space (RKHS). Hand-designing such kernels is error prone, and m… ▽ More

    Submitted 17 June, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

    Comments: 23 pages, 11 figures

  41. arXiv:2111.05008  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Misspecified Gaussian Process Bandit Optimization

    Authors: Ilija Bogunovic, Andreas Krause

    Abstract: We consider the problem of optimizing a black-box function based on noisy bandit feedback. Kernelized bandit algorithms have shown strong empirical and theoretical performance for this problem. They heavily rely on the assumption that the model is well-specified, however, and can fail without it. Instead, we introduce a \emph{misspecified} kernelized bandit setting where the unknown function can b… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

    Comments: Accepted to NeurIPS 2021

  42. arXiv:2110.14296  [pdf, other

    cs.LG eess.SY math.DS stat.ML

    Learning Stable Deep Dynamics Models for Partially Observed or Delayed Dynamical Systems

    Authors: Andreas Schlaginhaufen, Philippe Wenk, Andreas Krause, Florian Dörfler

    Abstract: Learning how complex dynamical systems evolve over time is a key challenge in system identification. For safety critical systems, it is often crucial that the learned model is guaranteed to converge to some equilibrium point. To this end, neural ODEs regularized with neural Lyapunov functions are a promising approach when states are fully observed. For practical applications however, partial obser… ▽ More

    Submitted 10 December, 2021; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: Published at NeurIPS 2021

    Journal ref: Advances in Neural Information Processing Systems, 2021

  43. arXiv:2110.11665  [pdf, other

    cs.LG stat.ML

    Diversified Sampling for Batched Bayesian Optimization with Determinantal Point Processes

    Authors: Elvis Nava, Mojmír Mutný, Andreas Krause

    Abstract: In Bayesian Optimization (BO) we study black-box function optimization with noisy point evaluations and Bayesian priors. Convergence of BO can be greatly sped up by batching, where multiple evaluations of the black-box function are performed in a single round. The main difficulty in this setting is to propose at the same time diverse and informative batches of evaluation points. In this work, we i… ▽ More

    Submitted 8 February, 2022; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: To be published in AISTATS 2022

  44. arXiv:2110.11181  [pdf, other

    cs.LG stat.ML

    Sensing Cox Processes via Posterior Sampling and Positive Bases

    Authors: Mojmír Mutný, Andreas Krause

    Abstract: We study adaptive sensing of Cox point processes, a widely used model from spatial statistics. We introduce three tasks: maximization of captured events, search for the maximum of the intensity function and learning level sets of the intensity function. We model the intensity function as a sample from a truncated Gaussian process, represented in a specially constructed positive basis. In this basi… ▽ More

    Submitted 29 March, 2022; v1 submitted 21 October, 2021; originally announced October 2021.

  45. arXiv:2109.12534  [pdf, other

    cs.LG stat.ML

    Data Summarization via Bilevel Optimization

    Authors: Zalán Borsos, Mojmír Mutný, Marco Tagliasacchi, Andreas Krause

    Abstract: The increasing availability of massive data sets poses a series of challenges for machine learning. Prominent among these is the need to learn models under hardware or human resource constraints. In such resource-constrained settings, a simple yet powerful approach is to operate on small subsets of the data. Coresets are weighted subsets of the data that provide approximation guarantees for the op… ▽ More

    Submitted 26 September, 2021; originally announced September 2021.

  46. arXiv:2107.04050  [pdf, other

    stat.ML cs.LG cs.MA

    Efficient Model-Based Multi-Agent Mean-Field Reinforcement Learning

    Authors: Barna Pásztor, Ilija Bogunovic, Andreas Krause

    Abstract: Learning in multi-agent systems is highly challenging due to several factors including the non-stationarity introduced by agents' interactions and the combinatorial nature of their state and action spaces. In particular, we consider the Mean-Field Control (MFC) problem which assumes an asymptotically infinite population of identical agents that aim to collaboratively maximize the collective reward… ▽ More

    Submitted 9 May, 2023; v1 submitted 8 July, 2021; originally announced July 2021.

    Journal ref: Pásztor, B., Krause, A., & Bogunovic, I. (2023). Efficient Model-Based Multi-Agent Mean-Field Reinforcement Learning. Transactions on Machine Learning Research

  47. arXiv:2107.03144  [pdf, other

    stat.ML cs.AI cs.LG

    Neural Contextual Bandits without Regret

    Authors: Parnian Kassraie, Andreas Krause

    Abstract: Contextual bandits are a rich model for sequential decision making given side information, with important applications, e.g., in recommender systems. We propose novel algorithms for contextual bandits harnessing neural networks to approximate the unknown reward function. We resolve the open problem of proving sublinear regret bounds in this setting for general context sequences, considering both f… ▽ More

    Submitted 28 February, 2022; v1 submitted 7 July, 2021; originally announced July 2021.

    Comments: 37 pages, 6 figures

  48. arXiv:2106.11609  [pdf, other

    cs.LG math.DS stat.ML

    Distributional Gradient Matching for Learning Uncertain Neural Dynamics Models

    Authors: Lenart Treven, Philippe Wenk, Florian Dörfler, Andreas Krause

    Abstract: Differential equations in general and neural ODEs in particular are an essential technique in continuous-time system identification. While many deterministic learning algorithms have been designed based on numerical integration via the adjoint method, many downstream tasks such as active learning, exploration in reinforcement learning, robust control, or filtering require accurate estimates of pre… ▽ More

    Submitted 15 October, 2021; v1 submitted 22 June, 2021; originally announced June 2021.

    Comments: Published at NeurIPS 2021

    Journal ref: Advances in Neural Information Processing Systems, 2021

  49. arXiv:2106.07445  [pdf, other

    cs.LG cs.CR cs.CV math.OC stat.ML

    PopSkipJump: Decision-Based Attack for Probabilistic Classifiers

    Authors: Carl-Johann Simon-Gabriel, Noman Ahmed Sheikh, Andreas Krause

    Abstract: Most current classifiers are vulnerable to adversarial examples, small input perturbations that change the classification output. Many existing attack algorithms cover various settings, from white-box to black-box classifiers, but typically assume that the answers are deterministic and often fail when they are not. We therefore propose a new adversarial decision-based attack specifically designed… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

    Comments: ICML'21. Code available at https://github.com/cjsg/PopSkipJump . 9 pages & 7 figures in main part, 14 pages & 10 figures in appendix

  50. arXiv:2106.03195  [pdf, other

    cs.LG cs.AI stat.ML

    Meta-Learning Reliable Priors in the Function Space

    Authors: Jonas Rothfuss, Dominique Heyn, Jinfan Chen, Andreas Krause

    Abstract: When data are scarce meta-learning can improve a learner's accuracy by harnessing previous experience from related learning tasks. However, existing methods have unreliable uncertainty estimates which are often overconfident. Addressing these shortcomings, we introduce a novel meta-learning framework, called F-PACOH, that treats meta-learned priors as stochastic processes and performs meta-level r… ▽ More

    Submitted 11 January, 2022; v1 submitted 6 June, 2021; originally announced June 2021.

    Comments: In Advances of Neural Information Processing Systems (NeurIPS) 2021