Skip to main content

Showing 1–17 of 17 results for author: Jørgensen, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2404.12219  [pdf, other

    cs.LG math.NA stat.ML

    A Quadrature Approach for General-Purpose Batch Bayesian Optimization via Probabilistic Lifting

    Authors: Masaki Adachi, Satoshi Hayakawa, Martin Jørgensen, Saad Hamid, Harald Oberhauser, Michael A. Osborne

    Abstract: Parallelisation in Bayesian optimisation is a common strategy but faces several challenges: the need for flexibility in acquisition functions and kernel choices, flexibility dealing with discrete and continuous variables simultaneously, model misspecification, and lastly fast massive parallelisation. To address these challenges, we introduce a versatile and modular framework for batch Bayesian opt… ▽ More

    Submitted 19 April, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: This work is the journal extension of the workshop paper (arXiv:2301.11832) and AISTATS paper (arXiv:2306.05843). 48 pages, 11 figures

    MSC Class: 62C10; 62F15

  2. arXiv:2306.05843  [pdf, other

    cs.LG cs.AI math.NA stat.CO stat.ML

    Adaptive Batch Sizes for Active Learning A Probabilistic Numerics Approach

    Authors: Masaki Adachi, Satoshi Hayakawa, Martin Jørgensen, Xingchen Wan, Vu Nguyen, Harald Oberhauser, Michael A. Osborne

    Abstract: Active learning parallelization is widely used, but typically relies on fixing the batch size throughout experimentation. This fixed approach is inefficient because of a dynamic trade-off between cost and speed -- larger batches are more costly, smaller batches lead to slower wall-clock run-times -- and the trade-off may change over the run (larger batches are often preferable earlier). To address… ▽ More

    Submitted 21 February, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: Accepted at AISTATS 2024. 33 pages, 6 figures

    MSC Class: 62C10; 62F15

    Journal ref: AISTATS 238, 496-504, 2024

  3. arXiv:2303.08874  [pdf, other

    stat.ML cs.LG

    Bayesian Quadrature for Neural Ensemble Search

    Authors: Saad Hamid, Xingchen Wan, Martin Jørgensen, Binxin Ru, Michael Osborne

    Abstract: Ensembling can improve the performance of Neural Networks, but existing approaches struggle when the architecture likelihood surface has dispersed, narrow peaks. Furthermore, existing methods construct equally weighted ensembles, and this is likely to be vulnerable to the failure modes of the weaker architectures. By viewing ensembling as approximately marginalising over architectures we construct… ▽ More

    Submitted 17 March, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

  4. arXiv:2303.05263  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Fast post-process Bayesian inference with Variational Sparse Bayesian Quadrature

    Authors: Chengkun Li, Grégoire Clarté, Martin Jørgensen, Luigi Acerbi

    Abstract: In applied Bayesian inference scenarios, users may have access to a large number of pre-existing model evaluations, for example from maximum-a-posteriori (MAP) optimization runs. However, traditional approximate inference techniques make little to no use of this available information. We propose the framework of post-process Bayesian inference as a means to obtain a quick posterior approximation f… ▽ More

    Submitted 29 November, 2024; v1 submitted 9 March, 2023; originally announced March 2023.

  5. arXiv:2301.11832  [pdf, other

    cs.LG math.NA stat.CO stat.ML

    SOBER: Highly Parallel Bayesian Optimization and Bayesian Quadrature over Discrete and Mixed Spaces

    Authors: Masaki Adachi, Satoshi Hayakawa, Saad Hamid, Martin Jørgensen, Harald Oberhauser, Micheal A. Osborne

    Abstract: Batch Bayesian optimisation and Bayesian quadrature have been shown to be sample-efficient methods of performing optimisation and quadrature where expensive-to-evaluate objective functions can be queried in parallel. However, current methods do not scale to large batch sizes -- a frequent desideratum in practice (e.g. drug discovery or simulation-based inference). We present a novel algorithm, SOB… ▽ More

    Submitted 5 July, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: 34 pages, 12 figures

    MSC Class: 62C10; 62F15

  6. arXiv:2209.00343  [pdf, other

    stat.ML cs.LG

    Bézier Gaussian Processes for Tall and Wide Data

    Authors: Martin Jørgensen, Michael A. Osborne

    Abstract: Modern approximations to Gaussian processes are suitable for "tall data", with a cost that scales well in the number of observations, but under-performs on ``wide data'', scaling poorly in the number of input features. That is, as the number of input features grows, good predictive performance requires the number of summarising variables, and their associated cost, to grow rapidly. We introduce a… ▽ More

    Submitted 13 October, 2022; v1 submitted 1 September, 2022; originally announced September 2022.

  7. arXiv:2206.04734  [pdf, other

    cs.LG math.NA stat.CO stat.ML

    Fast Bayesian Inference with Batch Bayesian Quadrature via Kernel Recombination

    Authors: Masaki Adachi, Satoshi Hayakawa, Martin Jørgensen, Harald Oberhauser, Michael A. Osborne

    Abstract: Calculation of Bayesian posteriors and model evidences typically requires numerical integration. Bayesian quadrature (BQ), a surrogate-model-based approach to numerical integration, is capable of superb sample efficiency, but its lack of parallelisation has hindered its practical applications. In this work, we propose a parallelised (batch) BQ method, employing techniques from kernel quadrature, t… ▽ More

    Submitted 27 January, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 38 pages, 6 figures

    MSC Class: 62C10; 62F15

    Journal ref: NeurIPS 35, 16533--16547 (2022)

  8. arXiv:2106.07512  [pdf, other

    stat.ML cs.LG

    Last Layer Marginal Likelihood for Invariance Learning

    Authors: Pola Schwöbel, Martin Jørgensen, Sebastian W. Ober, Mark van der Wilk

    Abstract: Data augmentation is often used to incorporate inductive biases into models. Traditionally, these are hand-crafted and tuned with cross validation. The Bayesian paradigm for model selection provides a path towards end-to-end learning of invariances using only the training data, by optimising the marginal likelihood. Computing the marginal likelihood is hard for neural networks, but success with tr… ▽ More

    Submitted 1 March, 2022; v1 submitted 14 June, 2021; originally announced June 2021.

    Comments: AISTATS '22

  9. arXiv:2101.10790  [pdf

    cs.LG cs.AI stat.ML

    The Consequences of the Framing of Machine Learning Risk Prediction Models: Evaluation of Sepsis in General Wards

    Authors: Simon Meyer Lauritsen, Bo Thiesson, Marianne Johansson Jørgensen, Anders Hammerich Riis, Ulrick Skipper Espelund, Jesper Bo Weile, Jeppe Lange

    Abstract: Objectives: To evaluate the consequences of the framing of machine learning risk prediction models. We evaluate how framing affects model performance and model learning in four different approaches previously applied in published artificial-intelligence (AI) models. Setting and participants: We analysed structured secondary healthcare data from 221,283 citizens from four Danish municipalities wh… ▽ More

    Submitted 26 January, 2021; originally announced January 2021.

  10. arXiv:2008.05552  [pdf, other

    stat.ML cs.LG

    Reparametrization Invariance in non-parametric Causal Discovery

    Authors: Martin Jørgensen, Søren Hauberg

    Abstract: Causal discovery estimates the underlying physical process that generates the observed data: does X cause Y or does Y cause X? Current methodologies use structural conditions to turn the causal query into a statistical query, when only observational data is available. But what if these statistical queries are sensitive to causal invariants? This study investigates one such invariant: the causal re… ▽ More

    Submitted 12 August, 2020; originally announced August 2020.

  11. arXiv:2006.14895  [pdf, other

    stat.ML cs.LG

    Stochastic Differential Equations with Variational Wishart Diffusions

    Authors: Martin Jørgensen, Marc Peter Deisenroth, Hugh Salimbeni

    Abstract: We present a Bayesian non-parametric way of inferring stochastic differential equations for both regression tasks and continuous-time dynamical modelling. The work has high emphasis on the stochastic part of the differential equation, also known as the diffusion, and modelling it by means of Wishart processes. Further, we present a semi-parametric approach that allows the framework to scale to hig… ▽ More

    Submitted 26 June, 2020; originally announced June 2020.

    Comments: ICML 2020

  12. arXiv:2006.11741  [pdf, other

    stat.ML cs.LG

    Isometric Gaussian Process Latent Variable Model for Dissimilarity Data

    Authors: Martin Jørgensen, Søren Hauberg

    Abstract: We present a probabilistic model where the latent variable respects both the distances and the topology of the modeled data. The model leverages the Riemannian geometry of the generated manifold to endow the latent space with a well-defined stochastic distance measure, which is modeled locally as Nakagami distributions. These stochastic distances are sought to be as similar as possible to observed… ▽ More

    Submitted 8 June, 2021; v1 submitted 21 June, 2020; originally announced June 2020.

    Comments: ICML 2021

  13. arXiv:2004.03637  [pdf, other

    cs.LG stat.ML

    Probabilistic Spatial Transformer Networks

    Authors: Pola Schwöbel, Frederik Warburg, Martin Jørgensen, Kristoffer H. Madsen, Søren Hauberg

    Abstract: Spatial Transformer Networks (STNs) estimate image transformations that can improve downstream tasks by `zooming in' on relevant regions in an image. However, STNs are hard to train and sensitive to mis-predictions of transformations. To circumvent these limitations, we propose a probabilistic extension that estimates a stochastic transformation rather than a deterministic one. Marginalizing trans… ▽ More

    Submitted 15 June, 2022; v1 submitted 7 April, 2020; originally announced April 2020.

    Comments: UAI 2022

  14. arXiv:1912.01266  [pdf, other

    cs.AI cs.LG stat.AP stat.ML

    Explainable artificial intelligence model to predict acute critical illness from electronic health records

    Authors: Simon Meyer Lauritsen, Mads Kristensen, Mathias Vassard Olsen, Morten Skaarup Larsen, Katrine Meyer Lauritsen, Marianne Johansson Jørgensen, Jeppe Lange, Bo Thiesson

    Abstract: We developed an explainable artificial intelligence (AI) early warning score (xAI-EWS) system for early detection of acute critical illness. While maintaining a high predictive performance, our system explains to the clinician on which relevant electronic health records (EHRs) data the prediction is grounded. Acute critical illness is often preceded by deterioration of routinely measured clinical… ▽ More

    Submitted 3 December, 2019; originally announced December 2019.

  15. arXiv:1906.03260  [pdf, other

    stat.ML cs.LG

    Reliable training and estimation of variance networks

    Authors: Nicki S. Detlefsen, Martin Jørgensen, Søren Hauberg

    Abstract: We propose and investigate new complementary methodologies for estimating predictive variance networks in regression neural networks. We derive a locally aware mini-batching scheme that result in sparse robust gradients, and show how to make unbiased weight updates to a variance network. Further, we formulate a heuristic for robustly fitting both the mean and variance networks post hoc. Finally, w… ▽ More

    Submitted 4 November, 2019; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: Appeared at NeurIPS 2019

  16. arXiv:1906.02956  [pdf, other

    cs.LG stat.AP stat.ML

    Early detection of sepsis utilizing deep learning on electronic health record event sequences

    Authors: Simon Meyer Lauritsen, Mads Ellersgaard Kalør, Emil Lund Kongsgaard, Katrine Meyer Lauritsen, Marianne Johansson Jørgensen, Jeppe Lange, Bo Thiesson

    Abstract: The timeliness of detection of a sepsis event in progress is a crucial factor in the outcome for the patient. Machine learning models built from data in electronic health records can be used as an effective tool for improving this timeliness, but so far the potential for clinical implementations has been largely limited to studies in intensive care units. This study will employ a richer data set t… ▽ More

    Submitted 7 June, 2019; originally announced June 2019.

  17. arXiv:1902.10501  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.chem-ph stat.ML

    Atomistic structure learning

    Authors: Mathias S. Jørgensen, Henrik L. Mortensen, Søren A. Meldgaard, Esben L. Kolsbjerg, Thomas L. Jacobsen, Knud H. Sørensen, Bjørk Hammer

    Abstract: One endeavour of modern physical chemistry is to use bottom-up approaches to design materials and drugs with desired properties. Here we introduce an atomistic structure learning algorithm (ASLA) that utilizes a convolutional neural network to build 2D compounds and layered structures atom by atom. The algorithm takes no prior data or knowledge on atomic interactions but inquires a first-principle… ▽ More

    Submitted 27 February, 2019; originally announced February 2019.