Skip to main content

Showing 1–17 of 17 results for author: Kirschner, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2502.14689  [pdf, other

    stat.ML cs.LG

    Confidence Estimation via Sequential Likelihood Mixing

    Authors: Johannes Kirschner, Andreas Krause, Michele Meziu, Mojmir Mutny

    Abstract: We present a universal framework for constructing confidence sets based on sequential likelihood mixing. Building upon classical results from sequential analysis, we provide a unifying perspective on several recent lines of work, and establish fundamental connections between sequential mixing, Bayesian inference and regret inequalities from online estimation. The framework applies to any realizabl… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

  2. arXiv:2312.00616  [pdf, other

    cs.LG stat.ME stat.ML

    Investigating a domain adaptation approach for integrating different measurement instruments in a longitudinal clinical registry

    Authors: Maren Hackenberg, Michelle Pfaffenlehner, Max Behrens, Astrid Pechmann, Janbernd Kirschner, Harald Binder

    Abstract: In a longitudinal clinical registry, different measurement instruments might have been used for assessing individuals at different time points. To combine them, we investigate deep learning techniques for obtaining a joint latent representation, to which the items of different measurement instruments are mapped. This corresponds to domain adaptation, an established concept in computer science for… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: 18 pages, 4 figures

  3. arXiv:2311.16286  [pdf, other

    stat.ME cs.LG stat.AP stat.ML

    A statistical approach to latent dynamic modeling with differential equations

    Authors: Maren Hackenberg, Astrid Pechmann, Clemens Kreutz, Janbernd Kirschner, Harald Binder

    Abstract: Ordinary differential equations (ODEs) can provide mechanistic models of temporally local changes of processes, where parameters are often informed by external knowledge. While ODEs are popular in systems modeling, they are less established for statistical modeling of longitudinal cohort data, e.g., in a clinical setting. Yet, modeling of local changes could also be attractive for assessing the tr… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: 29 pages, 6 figures

  4. arXiv:2302.04376  [pdf, other

    cs.LG cs.AI stat.ML

    Efficient Planning in Combinatorial Action Spaces with Applications to Cooperative Multi-Agent Reinforcement Learning

    Authors: Volodymyr Tkachuk, Seyed Alireza Bakhtiari, Johannes Kirschner, Matej Jusup, Ilija Bogunovic, Csaba Szepesvári

    Abstract: A practical challenge in reinforcement learning are combinatorial action spaces that make planning computationally demanding. For example, in cooperative multi-agent reinforcement learning, a potentially large number of agents jointly optimize a global reward function, which leads to a combinatorial blow-up in the action space by the number of agents. As a minimal requirement, we assume access to… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  5. arXiv:2302.03683  [pdf, ps, other

    cs.LG stat.ML

    Linear Partial Monitoring for Sequential Decision-Making: Algorithms, Regret Bounds and Applications

    Authors: Johannes Kirschner, Tor Lattimore, Andreas Krause

    Abstract: Partial monitoring is an expressive framework for sequential decision-making with an abundance of applications, including graph-structured and dueling bandits, dynamic pricing and transductive feedback models. We survey and extend recent results on the linear formulation of partial monitoring that naturally generalizes the standard linear bandit setting. The main result is that a single algorithm,… ▽ More

    Submitted 13 November, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

  6. arXiv:2212.09510  [pdf, other

    stat.ML cs.AI cs.LG

    Near-optimal Policy Identification in Active Reinforcement Learning

    Authors: Xiang Li, Viraj Mehta, Johannes Kirschner, Ian Char, Willie Neiswanger, Jeff Schneider, Andreas Krause, Ilija Bogunovic

    Abstract: Many real-world reinforcement learning tasks require control of complex dynamical systems that involve both costly data acquisition processes and large state spaces. In cases where the transition dynamics can be readily evaluated at specified states (e.g., via a simulator), agents can operate in what is often referred to as planning with a \emph{generative model}. We propose the AE-LSVI algorithm… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

  7. arXiv:2212.08949  [pdf, other

    cs.LG eess.SY stat.ML

    Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off

    Authors: Zichen Zhang, Johannes Kirschner, Junxi Zhang, Francesco Zanini, Alex Ayoub, Masood Dehghan, Dale Schuurmans

    Abstract: A default assumption in reinforcement learning (RL) and optimal control is that observations arrive at discrete time points on a fixed clock cycle. Yet, many applications involve continuous-time systems where the time discretization, in principle, can be managed. The impact of time discretization on RL methods has not been fully characterized in existing theory, but a more detailed analysis of its… ▽ More

    Submitted 16 January, 2024; v1 submitted 17 December, 2022; originally announced December 2022.

    Comments: NeurIPS 2023

  8. arXiv:2105.11802  [pdf, other

    stat.ML cs.LG

    Bias-Robust Bayesian Optimization via Dueling Bandits

    Authors: Johannes Kirschner, Andreas Krause

    Abstract: We consider Bayesian optimization in settings where observations can be adversarially biased, for example by an uncontrolled hidden confounder. Our first contribution is a reduction of the confounded setting to the dueling bandit model. Then we propose a novel approach for dueling bandits based on information-directed sampling (IDS). Thereby, we obtain the first efficient kernelized algorithm for… ▽ More

    Submitted 9 June, 2021; v1 submitted 25 May, 2021; originally announced May 2021.

  9. arXiv:2101.08534  [pdf, other

    stat.ML cs.LG

    Efficient Pure Exploration for Combinatorial Bandits with Semi-Bandit Feedback

    Authors: Marc Jourdan, Mojmír Mutný, Johannes Kirschner, Andreas Krause

    Abstract: Combinatorial bandits with semi-bandit feedback generalize multi-armed bandits, where the agent chooses sets of arms and observes a noisy reward for each arm contained in the chosen set. The action set satisfies a given structure such as forming a base of a matroid or a path in a graph. We focus on the pure-exploration problem of identifying the best arm with fixed confidence, as well as a more ge… ▽ More

    Submitted 21 January, 2021; originally announced January 2021.

    Comments: 45 pages. 3 tables. Appendices: from A to I. Figures: 1(a), 1(b), 2(a), 2(b), 3(a), 3(b), 3(c), 4(a), 4(b), 5(a), 5(b), 5(c), 5(d), 6(a), 6(b). To be published in the 32nd International Conference on Algorithmic Learning Theory and the Proceedings of Machine Learning Research vol 132:1-45, 2021

  10. arXiv:2012.00634  [pdf, other

    stat.ML cs.LG

    Deep dynamic modeling with just two time points: Can we still allow for individual trajectories?

    Authors: Maren Hackenberg, Philipp Harms, Michelle Pfaffenlehner, Astrid Pechmann, Janbernd Kirschner, Thorsten Schmidt, Harald Binder

    Abstract: Longitudinal biomedical data are often characterized by a sparse time grid and individual-specific development patterns. Specifically, in epidemiological cohort studies and clinical registries we are facing the question of what can be learned from the data in an early phase of the study, when only a baseline characterization and one follow-up measurement are available. Inspired by recent advances… ▽ More

    Submitted 20 December, 2021; v1 submitted 1 December, 2020; originally announced December 2020.

    Comments: 23 pages, 7 figures

  11. arXiv:2011.05944  [pdf, other

    stat.ML cs.LG

    Asymptotically Optimal Information-Directed Sampling

    Authors: Johannes Kirschner, Tor Lattimore, Claire Vernade, Csaba Szepesvári

    Abstract: We introduce a simple and efficient algorithm for stochastic linear bandits with finitely many actions that is asymptotically optimal and (nearly) worst-case optimal in finite time. The approach is based on the frequentist information-directed sampling (IDS) framework, with a surrogate for the information gain that is informed by the optimization problem that defines the asymptotic lower bound. Ou… ▽ More

    Submitted 2 July, 2021; v1 submitted 11 November, 2020; originally announced November 2020.

    Comments: Accepted at COLT 2021

  12. arXiv:2002.11182  [pdf, other

    stat.ML cs.LG

    Information Directed Sampling for Linear Partial Monitoring

    Authors: Johannes Kirschner, Tor Lattimore, Andreas Krause

    Abstract: Partial monitoring is a rich framework for sequential decision making under uncertainty that generalizes many well known bandit models, including linear, combinatorial and dueling bandits. We introduce information directed sampling (IDS) for stochastic partial monitoring with a linear reward and observation structure. IDS achieves adaptive worst-case regret rates that depend on precise observabili… ▽ More

    Submitted 25 February, 2020; originally announced February 2020.

  13. arXiv:2002.09038  [pdf, other

    stat.ML cs.LG

    Distributionally Robust Bayesian Optimization

    Authors: Johannes Kirschner, Ilija Bogunovic, Stefanie Jegelka, Andreas Krause

    Abstract: Robustness to distributional shift is one of the key challenges of contemporary machine learning. Attaining such robustness is the goal of distributionally robust optimization, which seeks a solution to an optimization problem that is worst-case robust under a specified distributional shift of an uncontrolled covariate. In this paper, we study such a problem when the distributional shift is measur… ▽ More

    Submitted 22 March, 2020; v1 submitted 20 February, 2020; originally announced February 2020.

    Comments: Accepted at AISTATS 2020

  14. arXiv:1906.02685  [pdf, other

    stat.ML cs.LG

    Stochastic Bandits with Context Distributions

    Authors: Johannes Kirschner, Andreas Krause

    Abstract: We introduce a stochastic contextual bandit model where at each time step the environment chooses a distribution over a context set and samples the context from this distribution. The learner observes only the context distribution while the exact context realization remains hidden. This allows for a broad range of applications where the context is stochastic or when the learner needs to predict th… ▽ More

    Submitted 14 November, 2019; v1 submitted 6 June, 2019; originally announced June 2019.

    Comments: Accepted at NeurIPS 2019

  15. arXiv:1902.03229  [pdf, other

    cs.LG stat.ML

    Adaptive and Safe Bayesian Optimization in High Dimensions via One-Dimensional Subspaces

    Authors: Johannes Kirschner, Mojmír Mutný, Nicole Hiller, Rasmus Ischebeck, Andreas Krause

    Abstract: Bayesian optimization is known to be difficult to scale to high dimensions, because the acquisition step requires solving a non-convex optimization problem in the same search space. In order to scale the method and keep its benefits, we propose an algorithm (LineBO) that restricts the problem to a sequence of iteratively chosen one-dimensional sub-problems that can be solved efficiently. We show t… ▽ More

    Submitted 28 May, 2019; v1 submitted 8 February, 2019; originally announced February 2019.

  16. arXiv:1812.07544  [pdf, other

    cs.LG cs.AI stat.ML

    Information-Directed Exploration for Deep Reinforcement Learning

    Authors: Nikolay Nikolov, Johannes Kirschner, Felix Berkenkamp, Andreas Krause

    Abstract: Efficient exploration remains a major challenge for reinforcement learning. One reason is that the variability of the returns often depends on the current state and action, and is therefore heteroscedastic. Classical exploration strategies such as upper confidence bound algorithms and Thompson sampling fail to appropriately account for heteroscedasticity, even in the bandit setting. Motivated by r… ▽ More

    Submitted 24 March, 2019; v1 submitted 18 December, 2018; originally announced December 2018.

  17. arXiv:1801.09667  [pdf, other

    stat.ML

    Information Directed Sampling and Bandits with Heteroscedastic Noise

    Authors: Johannes Kirschner, Andreas Krause

    Abstract: In the stochastic bandit problem, the goal is to maximize an unknown function via a sequence of noisy evaluations. Typically, the observation noise is assumed to be independent of the evaluation point and to satisfy a tail bound uniformly on the domain; a restrictive assumption for many applications. In this work, we consider bandits with heteroscedastic noise, where we explicitly allow the noise… ▽ More

    Submitted 19 April, 2018; v1 submitted 29 January, 2018; originally announced January 2018.

    Comments: Figure 1a,2a updated