Skip to main content

Showing 1–50 of 58 results for author: Deisenroth, M P

Searching in archive stat. Search in all archives.
.
  1. arXiv:2410.07170  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Parameter Efficient Fine-tuning via Explained Variance Adaptation

    Authors: Fabian Paischer, Lukas Hauzenberger, Thomas Schmied, Benedikt Alkin, Marc Peter Deisenroth, Sepp Hochreiter

    Abstract: Foundation models (FMs) are pre-trained on large-scale datasets and then fine-tuned for a specific downstream task. The most common fine-tuning method is to update pretrained weights via low-rank adaptation (LoRA). Existing initialization strategies for LoRA often rely on singular value decompositions (SVD) of gradients or weight matrices. However, they do not provably maximize the expected gradie… ▽ More

    Submitted 21 May, 2025; v1 submitted 9 October, 2024; originally announced October 2024.

    Comments: 9 pages + references and appendix, code available at https://github.com/ml-jku/EVA

  2. arXiv:2406.04759  [pdf, other

    cs.LG stat.ML

    Probabilistic Weather Forecasting with Hierarchical Graph Neural Networks

    Authors: Joel Oskarsson, Tomas Landelius, Marc Peter Deisenroth, Fredrik Lindsten

    Abstract: In recent years, machine learning has established itself as a powerful tool for high-resolution weather forecasting. While most current machine learning models focus on deterministic forecasts, accurately capturing the uncertainty in the chaotic weather system calls for probabilistic modeling. We propose a probabilistic weather forecasting model called Graph-EFM, combining a flexible latent-variab… ▽ More

    Submitted 26 October, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

    Comments: 72 pages, 33 figures. NeurIPS 2024. Code is available at https://github.com/mllam/neural-lam/tree/prob_model_global (global forecasting) and https://github.com/mllam/neural-lam/tree/prob_model_lam (limited area modeling)

  3. arXiv:2404.12968  [pdf, other

    cs.LG cs.DC stat.AP

    Scalable Data Assimilation with Message Passing

    Authors: Oscar Key, So Takao, Daniel Giles, Marc Peter Deisenroth

    Abstract: Data assimilation is a core component of numerical weather prediction systems. The large quantity of data processed during assimilation requires the computation to be distributed across increasingly many compute nodes, yet existing approaches suffer from synchronisation overhead in this setting. In this paper, we exploit the formulation of data assimilation as a Bayesian inference problem and appl… ▽ More

    Submitted 1 October, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

    Journal ref: Environ. Data Science 4 (2025) e1

  4. arXiv:2402.17036  [pdf, other

    stat.ML cs.LG

    Iterated INLA for State and Parameter Estimation in Nonlinear Dynamical Systems

    Authors: Rafael Anderka, Marc Peter Deisenroth, So Takao

    Abstract: Data assimilation (DA) methods use priors arising from differential equations to robustly interpolate and extrapolate data. Popular techniques such as ensemble methods that handle high-dimensional, nonlinear PDE priors focus mostly on state estimation, however can have difficulty learning the parameters accurately. On the other hand, machine learning based approaches can naturally learn the state… ▽ More

    Submitted 3 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  5. arXiv:2311.01198  [pdf, other

    cs.LG stat.ML

    Gaussian Processes on Cellular Complexes

    Authors: Mathieu Alain, So Takao, Brooks Paige, Marc Peter Deisenroth

    Abstract: In recent years, there has been considerable interest in developing machine learning models on graphs to account for topological inductive biases. In particular, recent attention has been given to Gaussian processes on such structures since they can additionally account for uncertainty. However, graphs are limited to modelling relations between two vertices. In this paper, we go beyond this dyadic… ▽ More

    Submitted 16 August, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

  6. arXiv:2310.11527  [pdf, other

    stat.ML cs.LG

    Thin and Deep Gaussian Processes

    Authors: Daniel Augusto de Souza, Alexander Nikitin, ST John, Magnus Ross, Mauricio A. Álvarez, Marc Peter Deisenroth, João P. P. Gomes, Diego Mesquita, César Lincoln C. Mattos

    Abstract: Gaussian processes (GPs) can provide a principled approach to uncertainty quantification with easy-to-interpret kernel hyperparameters, such as the lengthscale, which controls the correlation distance of function values. However, selecting an appropriate kernel can be challenging. Deep GPs avoid manual kernel engineering by successively parameterizing kernels with GP layers, allowing them to learn… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: Accepted at the Conference on Neural Information Processing Systems (NeurIPS) 2023

  7. arXiv:2308.10644  [pdf, other

    cs.LG math.NA stat.ML

    Faster Training of Neural ODEs Using Gauß-Legendre Quadrature

    Authors: Alexander Norcliffe, Marc Peter Deisenroth

    Abstract: Neural ODEs demonstrate strong performance in generative and time-series modelling. However, training them via the adjoint method is slow compared to discrete models due to the requirement of numerically solving ODEs. To speed neural ODEs up, a common approach is to regularise the solutions. However, this approach may affect the expressivity of the model; when the trajectory itself matters, this i… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: 32 pages, 16 figures, 7 tables, published in TMLR 2023

  8. arXiv:2307.05789  [pdf, ps, other

    stat.ML cs.LG

    Implicit regularisation in stochastic gradient descent: from single-objective to two-player games

    Authors: Mihaela Rosca, Marc Peter Deisenroth

    Abstract: Recent years have seen many insights on deep learning optimisation being brought forward by finding implicit regularisation effects of commonly used gradient-based optimisers. Understanding implicit regularisation can not only shed light on optimisation dynamics, but it can also be used to improve performance and stability across problem domains, from supervised learning to two-player games such a… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

  9. arXiv:2304.05091  [pdf, other

    stat.ML cs.LG

    Actually Sparse Variational Gaussian Processes

    Authors: Harry Jake Cunningham, Daniel Augusto de Souza, So Takao, Mark van der Wilk, Marc Peter Deisenroth

    Abstract: Gaussian processes (GPs) are typically criticised for their unfavourable scaling in both computational and memory requirements. For large datasets, sparse GPs reduce these demands by conditioning on a small set of inducing variables designed to summarise the data. In practice however, for large datasets requiring many inducing variables, such as low-lengthscale spatial data, even sparse GPs can be… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

    Comments: 14 pages, 5 figures, published in AISTATS 2023

  10. arXiv:2302.00388  [pdf, other

    cs.LG stat.AP

    Short-term Prediction and Filtering of Solar Power Using State-Space Gaussian Processes

    Authors: Sean Nassimiha, Peter Dudfield, Jack Kelly, Marc Peter Deisenroth, So Takao

    Abstract: Short-term forecasting of solar photovoltaic energy (PV) production is important for powerplant management. Ideally these forecasts are equipped with error bars, so that downstream decisions can account for uncertainty. To produce predictions with error bars in this setting, we consider Gaussian processes (GPs) for modelling and predicting solar photovoltaic energy production in the UK. A standard… ▽ More

    Submitted 30 March, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: Workshop paper submitted to "Tackling Climate Change with Machine Learning: workshop at NeurIPS 2022"

  11. arXiv:2110.14423  [pdf, other

    stat.ML cs.LG

    Vector-valued Gaussian Processes on Riemannian Manifolds via Gauge Independent Projected Kernels

    Authors: Michael Hutchinson, Alexander Terenin, Viacheslav Borovitskiy, So Takao, Yee Whye Teh, Marc Peter Deisenroth

    Abstract: Gaussian processes are machine learning models capable of learning unknown functions in a way that represents uncertainty, thereby facilitating construction of optimal decision-making systems. Motivated by a desire to deploy Gaussian processes in novel areas of science, a rapidly-growing line of research has focused on constructively extending these models to handle non-Euclidean domains, includin… ▽ More

    Submitted 25 November, 2021; v1 submitted 27 October, 2021; originally announced October 2021.

    Journal ref: Advances in Neural Information Processing Systems, 2021

  12. arXiv:2110.12087  [pdf, other

    cs.LG stat.ML

    Gaussian Process Sampling and Optimization with Approximate Upper and Lower Bounds

    Authors: Vu Nguyen, Marc Peter Deisenroth, Michael A. Osborne

    Abstract: Many functions have approximately-known upper and/or lower bounds, potentially aiding the modeling of such functions. In this paper, we introduce Gaussian process models for functions where such bounds are (approximately) known. More specifically, we propose the first use of such bounds to improve Gaussian process (GP) posterior sampling and Bayesian optimization (BO). That is, we transform a GP m… ▽ More

    Submitted 19 October, 2022; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: 20 pages

  13. arXiv:2105.12356  [pdf, other

    cs.LG stat.ML

    The Graph Cut Kernel for Ranked Data

    Authors: Michelangelo Conserva, Marc Peter Deisenroth, K S Sesh Kumar

    Abstract: Many algorithms for ranked data become computationally intractable as the number of objects grows due to the complex geometric structure induced by rankings. An additional challenge is posed by partial rankings, i.e. rankings in which the preference is only known for a subset of all objects. For these reasons, state-of-the-art methods cannot scale to real-world applications, such as recommender sy… ▽ More

    Submitted 17 July, 2022; v1 submitted 26 May, 2021; originally announced May 2021.

    Journal ref: Transactions on Machine Learning Research (2022)

  14. arXiv:2104.05674  [pdf, ps, other

    stat.ML cs.LG

    GPflux: A Library for Deep Gaussian Processes

    Authors: Vincent Dutordoir, Hugh Salimbeni, Eric Hambro, John McLeod, Felix Leibfried, Artem Artemev, Mark van der Wilk, James Hensman, Marc P. Deisenroth, ST John

    Abstract: We introduce GPflux, a Python library for Bayesian deep learning with a strong emphasis on deep Gaussian processes (DGPs). Implementing DGPs is a challenging endeavour due to the various mathematical subtleties that arise when dealing with multivariate Gaussian distributions and the complex bookkeeping of indices. To date, there are no actively maintained, open-sourced and extendable libraries ava… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

  15. arXiv:2102.11206  [pdf, other

    cs.LG cs.RO stat.ML

    Learning Contact Dynamics using Physically Structured Neural Networks

    Authors: Andreas Hochlehnert, Alexander Terenin, Steindór Sæmundsson, Marc Peter Deisenroth

    Abstract: Learning physically structured representations of dynamical systems that include contact between different objects is an important problem for learning-based approaches in robotics. Black-box neural networks can learn to approximately represent discontinuous dynamics, but they typically require large quantities of data and often suffer from pathological behaviour when forecasting for longer time h… ▽ More

    Submitted 15 August, 2022; v1 submitted 22 February, 2021; originally announced February 2021.

    Journal ref: Artificial Intelligence and Statistics, 2021

  16. arXiv:2102.07115  [pdf, other

    stat.ML cs.LG

    Sliced Multi-Marginal Optimal Transport

    Authors: Samuel Cohen, Alexander Terenin, Yannik Pitcan, Brandon Amos, Marc Peter Deisenroth, K S Sesh Kumar

    Abstract: Multi-marginal optimal transport enables one to compare multiple probability measures, which increasingly finds application in multi-task learning problems. One practical limitation of multi-marginal transport is computational scalability in the number of measures, samples and dimensionality. In this work, we propose a multi-marginal optimal transport paradigm based on random one-dimensional proje… ▽ More

    Submitted 23 November, 2021; v1 submitted 14 February, 2021; originally announced February 2021.

    Journal ref: NeurIPS Workshop on Optimal Transport and Machine Learning, 2021

  17. arXiv:2102.07106  [pdf, other

    stat.ML cs.LG

    Healing Products of Gaussian Processes

    Authors: Samuel Cohen, Rendani Mbuvha, Tshilidzi Marwala, Marc Peter Deisenroth

    Abstract: Gaussian processes (GPs) are nonparametric Bayesian models that have been applied to regression and classification problems. One of the approaches to alleviate their cubic training cost is the use of local GP experts trained on subsets of the data. In particular, product-of-expert models combine the predictive distributions of local experts through a tractable product operation. While these expert… ▽ More

    Submitted 14 February, 2021; originally announced February 2021.

    Comments: ICML 2020

  18. arXiv:2102.03782  [pdf, other

    cs.LG stat.AP

    Using Gaussian Processes to Design Dynamic Experiments for Black-Box Model Discrimination under Uncertainty

    Authors: Simon Olofsson, Eduardo S. Schultz, Adel Mhamdi, Alexander Mitsos, Marc Peter Deisenroth, Ruth Misener

    Abstract: Diverse domains of science and engineering use parameterised mechanistic models. Engineers and scientists can often hypothesise several rival models to explain a specific process or phenomenon. Consider a model discrimination setting where we wish to find the best mechanistic, dynamic model candidate and the best model parameter estimates. Typically, several rival mechanistic models can explain th… ▽ More

    Submitted 31 October, 2021; v1 submitted 7 February, 2021; originally announced February 2021.

  19. arXiv:2011.04026  [pdf, other

    stat.ML cs.LG math.ST

    Pathwise Conditioning of Gaussian Processes

    Authors: James T. Wilson, Viacheslav Borovitskiy, Alexander Terenin, Peter Mostowsky, Marc Peter Deisenroth

    Abstract: As Gaussian processes are used to answer increasingly complex questions, analytic solutions become scarcer and scarcer. Monte Carlo methods act as a convenient bridge for connecting intractable mathematical expressions with actionable estimates via sampling. Conventional approaches for simulating Gaussian process posteriors view samples as draws from marginal distributions of process values at fin… ▽ More

    Submitted 30 July, 2021; v1 submitted 8 November, 2020; originally announced November 2020.

    Journal ref: Journal of Machine Learning Research, 22(105):1-47, 2021

  20. arXiv:2010.15538  [pdf, other

    stat.ML cs.LG

    Matérn Gaussian Processes on Graphs

    Authors: Viacheslav Borovitskiy, Iskander Azangulov, Alexander Terenin, Peter Mostowsky, Marc Peter Deisenroth, Nicolas Durrande

    Abstract: Gaussian processes are a versatile framework for learning unknown functions in a manner that permits one to utilize prior information about their properties. Although many different Gaussian process models are readily available when the input space is Euclidean, the choice is much more limited for Gaussian processes whose input space is an undirected graph. In this work, we leverage the stochastic… ▽ More

    Submitted 9 April, 2021; v1 submitted 29 October, 2020; originally announced October 2020.

    Journal ref: Artificial Intelligence and Statistics, 2021

  21. arXiv:2008.00546  [pdf, other

    cs.LG stat.ML

    A Foliated View of Transfer Learning

    Authors: Janith Petangoda, Nick A. M. Monk, Marc Peter Deisenroth

    Abstract: Transfer learning considers a learning process where a new task is solved by transferring relevant knowledge from known solutions to related tasks. While this has been studied experimentally, there lacks a foundational description of the transfer learning problem that exposes what related tasks are, and how they can be exploited. In this work, we present a definition for relatedness between tasks… ▽ More

    Submitted 2 August, 2020; originally announced August 2020.

    Comments: 14 pages, 6 figures

  22. arXiv:2007.08949  [pdf, other

    cs.LG stat.ML

    Probabilistic Active Meta-Learning

    Authors: Jean Kaddour, Steindór Sæmundsson, Marc Peter Deisenroth

    Abstract: Data-efficient learning algorithms are essential in many practical applications where data collection is expensive, e.g., in robotics due to the wear and tear. To address this problem, meta-learning algorithms use prior experience about tasks to learn new, related tasks efficiently. Typically, a set of training tasks is assumed given or randomly chosen. However, this setting does not take into acc… ▽ More

    Submitted 22 October, 2020; v1 submitted 17 July, 2020; originally announced July 2020.

    Comments: NeurIPS 2020

  23. arXiv:2007.07105  [pdf, other

    stat.ML cs.LG

    Estimating Barycenters of Measures in High Dimensions

    Authors: Samuel Cohen, Michael Arbel, Marc Peter Deisenroth

    Abstract: Barycentric averaging is a principled way of summarizing populations of measures. Existing algorithms for estimating barycenters typically parametrize them as weighted sums of Diracs and optimize their weights and/or locations. However, these approaches do not scale to high-dimensional settings due to the curse of dimensionality. In this paper, we propose a scalable and general algorithm for estim… ▽ More

    Submitted 14 February, 2021; v1 submitted 14 July, 2020; originally announced July 2020.

    Comments: In submission

  24. arXiv:2006.14895  [pdf, other

    stat.ML cs.LG

    Stochastic Differential Equations with Variational Wishart Diffusions

    Authors: Martin Jørgensen, Marc Peter Deisenroth, Hugh Salimbeni

    Abstract: We present a Bayesian non-parametric way of inferring stochastic differential equations for both regression tasks and continuous-time dynamical modelling. The work has high emphasis on the stochastic part of the differential equation, also known as the diffusion, and modelling it by means of Wishart processes. Further, we present a semi-parametric approach that allows the framework to scale to hig… ▽ More

    Submitted 26 June, 2020; originally announced June 2020.

    Comments: ICML 2020

  25. arXiv:2006.12648  [pdf, other

    cs.LG stat.ML

    Aligning Time Series on Incomparable Spaces

    Authors: Samuel Cohen, Giulia Luise, Alexander Terenin, Brandon Amos, Marc Peter Deisenroth

    Abstract: Dynamic time warping (DTW) is a useful method for aligning, comparing and combining time series, but it requires them to live in comparable spaces. In this work, we consider a setting in which time series live on different spaces without a sensible ground metric, causing DTW to become ill-defined. To alleviate this, we propose Gromov dynamic time warping (GDTW), a distance between time series on p… ▽ More

    Submitted 22 February, 2021; v1 submitted 22 June, 2020; originally announced June 2020.

    Journal ref: Artificial Intelligence and Statistics, 2021

  26. arXiv:2006.10160  [pdf, other

    stat.ML cs.LG

    Matérn Gaussian processes on Riemannian manifolds

    Authors: Viacheslav Borovitskiy, Alexander Terenin, Peter Mostowsky, Marc Peter Deisenroth

    Abstract: Gaussian processes are an effective model class for learning unknown functions, particularly in settings where accurately representing predictive uncertainty is of key importance. Motivated by applications in the physical sciences, the widely-used Matérn class of Gaussian processes has recently been generalized to model functions whose domains are Riemannian manifolds, by re-expressing said proces… ▽ More

    Submitted 17 April, 2023; v1 submitted 17 June, 2020; originally announced June 2020.

    Journal ref: Advances in Neural Information Processing Systems, 2020

  27. arXiv:2002.09309  [pdf, other

    stat.ML cs.LG stat.CO

    Efficiently Sampling Functions from Gaussian Process Posteriors

    Authors: James T. Wilson, Viacheslav Borovitskiy, Alexander Terenin, Peter Mostowsky, Marc Peter Deisenroth

    Abstract: Gaussian processes are the gold standard for many real-world modeling problems, especially in cases where a model's success hinges upon its ability to faithfully represent predictive uncertainty. These problems typically exist as parts of larger frameworks, wherein quantities of interest are ultimately defined by integrating over posterior distributions. These quantities are frequently intractable… ▽ More

    Submitted 16 August, 2020; v1 submitted 21 February, 2020; originally announced February 2020.

    Journal ref: International Conference on Machine Learning, 2020

  28. arXiv:1910.09349  [pdf, other

    stat.ML cs.LG

    Variational Integrator Networks for Physically Structured Embeddings

    Authors: Steindor Saemundsson, Alexander Terenin, Katja Hofmann, Marc Peter Deisenroth

    Abstract: Learning workable representations of dynamical systems is becoming an increasingly important problem in a number of application areas. By leveraging recent work connecting deep neural networks to systems of differential equations, we propose \emph{variational integrator networks}, a class of neural network architectures designed to preserve the geometric structure of physical systems. This class o… ▽ More

    Submitted 2 March, 2020; v1 submitted 21 October, 2019; originally announced October 2019.

    Journal ref: Artificial Intelligence and Statistics, 2020

  29. arXiv:1905.05435  [pdf, other

    stat.ML cs.LG

    Deep Gaussian Processes with Importance-Weighted Variational Inference

    Authors: Hugh Salimbeni, Vincent Dutordoir, James Hensman, Marc Peter Deisenroth

    Abstract: Deep Gaussian processes (DGPs) can model complex marginal densities as well as complex mappings. Non-Gaussian marginals are essential for modelling real-world data, and can be generated from the DGP by incorporating uncorrelated variables to the model. Previous work on DGP models has introduced noise additively and used variational inference with a combination of sparse Gaussian processes and mean… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

    Comments: Appearing ICML 2019

  30. arXiv:1905.04873  [pdf, ps, other

    cs.LG stat.ML

    Differentially Private Empirical Risk Minimization with Sparsity-Inducing Norms

    Authors: K S Sesh Kumar, Marc Peter Deisenroth

    Abstract: Differential privacy is concerned about the prediction quality while measuring the privacy impact on individuals whose information is contained in the data. We consider differentially private risk minimization problems with regularizers that induce structured sparsity. These regularizers are known to be convex but they are often non-differentiable. We analyze the standard differentially private al… ▽ More

    Submitted 13 May, 2019; originally announced May 2019.

  31. arXiv:1902.10675  [pdf, other

    stat.ML cs.LG

    High-dimensional Bayesian optimization using low-dimensional feature spaces

    Authors: Riccardo Moriconi, Marc P. Deisenroth, K. S. Sesh Kumar

    Abstract: Bayesian optimization (BO) is a powerful approach for seeking the global optimum of expensive black-box functions and has proven successful for fine tuning hyper-parameters of machine learning models. However, BO is practically limited to optimizing 10--20 parameters. To scale BO to high dimensions, we usually make structural assumptions on the decomposition of the objective and\slash or exploit t… ▽ More

    Submitted 25 September, 2020; v1 submitted 27 February, 2019; originally announced February 2019.

  32. GPdoemd: a Python package for design of experiments for model discrimination

    Authors: Simon Olofsson, Lukas Hebing, Sebastian Niedenführ, Marc Peter Deisenroth, Ruth Misener

    Abstract: Model discrimination identifies a mathematical model that usefully explains and predicts a given system's behaviour. Researchers will often have several models, i.e. hypotheses, about an underlying system mechanism, but insufficient experimental data to discriminate between the models, i.e. discard inaccurate models. Given rival mathematical models and an initial experimental data set, optimal des… ▽ More

    Submitted 8 March, 2019; v1 submitted 5 October, 2018; originally announced October 2018.

    Journal ref: Computers & Chemical Engineering, Volume 125, 2019, Pages 54-70

  33. arXiv:1805.10196  [pdf, other

    stat.ML cs.LG

    Maximizing acquisition functions for Bayesian optimization

    Authors: James T. Wilson, Frank Hutter, Marc Peter Deisenroth

    Abstract: Bayesian optimization is a sample-efficient approach to global optimization that relies on theoretically motivated value heuristics (acquisition functions) to guide its search process. Fully maximizing acquisition functions produces the Bayes' decision rule, but this ideal is difficult to achieve since these functions are frequently non-trivial to optimize. This statement is especially true when e… ▽ More

    Submitted 2 December, 2018; v1 submitted 25 May, 2018; originally announced May 2018.

    Comments: Proceedings of the Thirty-second Conference on Neural Information Processing Systems, 2018

  34. arXiv:1803.07551  [pdf, other

    stat.ML cs.LG

    Meta Reinforcement Learning with Latent Variable Gaussian Processes

    Authors: Steindór Sæmundsson, Katja Hofmann, Marc Peter Deisenroth

    Abstract: Learning from small data sets is critical in many practical applications where data collection is time consuming or expensive, e.g., robotics, animal experiments or drug design. Meta learning is one way to increase the data efficiency of learning algorithms by generalizing learned concepts from a set of training tasks to unseen, but related, tasks. Often, this relationship between tasks is hard co… ▽ More

    Submitted 7 July, 2018; v1 submitted 20 March, 2018; originally announced March 2018.

    Comments: 11 pages, 7 figures

  35. arXiv:1802.04170  [pdf, other

    stat.AP stat.ML

    Design of Experiments for Model Discrimination Hybridising Analytical and Data-Driven Approaches

    Authors: Simon Olofsson, Marc Peter Deisenroth, Ruth Misener

    Abstract: Healthcare companies must submit pharmaceutical drugs or medical devices to regulatory bodies before marketing new technology. Regulatory bodies frequently require transparent and interpretable computational modelling to justify a new healthcare technology, but researchers may have several competing models for a biological system and too little data to discriminate between the models. In design of… ▽ More

    Submitted 31 May, 2018; v1 submitted 12 February, 2018; originally announced February 2018.

    Journal ref: Proc.Mach.Learn.Res. 80 (2018) pp. 3908-3917

  36. arXiv:1712.00424  [pdf, other

    stat.ML cs.LG math.OC

    The reparameterization trick for acquisition functions

    Authors: James T. Wilson, Riccardo Moriconi, Frank Hutter, Marc Peter Deisenroth

    Abstract: Bayesian optimization is a sample-efficient approach to solving global optimization problems. Along with a surrogate model, this approach relies on theoretically motivated value heuristics (acquisition functions) to guide the search process. Maximizing acquisition functions yields the best performance; unfortunately, this ideal is difficult to achieve since optimizing acquisition functions per se… ▽ More

    Submitted 1 December, 2017; originally announced December 2017.

    Comments: Accepted at the NIPS 2017 Workshop on Bayesian Optimization (BayesOpt 2017)

  37. arXiv:1708.05866  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    A Brief Survey of Deep Reinforcement Learning

    Authors: Kai Arulkumaran, Marc Peter Deisenroth, Miles Brundage, Anil Anthony Bharath

    Abstract: Deep reinforcement learning is poised to revolutionise the field of AI and represents a step towards building autonomous systems with a higher level understanding of the visual world. Currently, deep learning is enabling reinforcement learning to scale to problems that were previously intractable, such as learning to play video games directly from pixels. Deep reinforcement learning algorithms are… ▽ More

    Submitted 28 September, 2017; v1 submitted 19 August, 2017; originally announced August 2017.

    Comments: IEEE Signal Processing Magazine, Special Issue on Deep Learning for Image Understanding (arXiv extended version)

  38. arXiv:1706.06491  [pdf, other

    eess.SY stat.ML

    Data-Efficient Reinforcement Learning with Probabilistic Model Predictive Control

    Authors: Sanket Kamthe, Marc Peter Deisenroth

    Abstract: Trial-and-error based reinforcement learning (RL) has seen rapid advancements in recent times, especially with the advent of deep neural networks. However, the majority of autonomous RL algorithms require a large number of interactions with the environment. A large number of interactions may be impractical in many real-world applications, such as robotics, and many practical systems have to obey l… ▽ More

    Submitted 22 February, 2018; v1 submitted 20 June, 2017; originally announced June 2017.

    Comments: Accepted at AISTATS 2018,

  39. arXiv:1705.10888  [pdf, other

    stat.ML

    Identification of Gaussian Process State Space Models

    Authors: Stefanos Eleftheriadis, Thomas F. W. Nicholson, Marc Peter Deisenroth, James Hensman

    Abstract: The Gaussian process state space model (GPSSM) is a non-linear dynamical system, where unknown transition and/or measurement mappings are described by GPs. Most research in GPSSMs has focussed on the state estimation problem, i.e., computing a posterior of the latent state given the model. However, the key challenge in GPSSMs has not been satisfactorily addressed yet: system identification, i.e.,… ▽ More

    Submitted 7 November, 2017; v1 submitted 30 May, 2017; originally announced May 2017.

  40. arXiv:1705.10359  [pdf, other

    stat.ML cs.LG

    Neural Embeddings of Graphs in Hyperbolic Space

    Authors: Benjamin Paul Chamberlain, James Clough, Marc Peter Deisenroth

    Abstract: Neural embeddings have been used with great success in Natural Language Processing (NLP). They provide compact representations that encapsulate word similarity and attain state-of-the-art performance in a range of linguistic tasks. The success of neural embeddings has prompted significant amounts of research into applications in domains other than language. One such domain is graph-structured data… ▽ More

    Submitted 29 May, 2017; originally announced May 2017.

    Comments: 7 pages, 5 figures

    Journal ref: 13th international workshop on mining and learning from graphs held in conjunction with KDD, 2017

  41. arXiv:1703.02596  [pdf, other

    cs.LG cs.CY cs.IR cs.NE stat.ML

    Customer Lifetime Value Prediction Using Embeddings

    Authors: Benjamin Paul Chamberlain, Angelo Cardoso, C. H. Bryan Liu, Roberto Pagliari, Marc Peter Deisenroth

    Abstract: We describe the Customer LifeTime Value (CLTV) prediction system deployed at ASOS.com, a global online fashion retailer. CLTV prediction is an important problem in e-commerce where an accurate estimate of future value allows retailers to effectively allocate marketing spend, identify and nurture high value customers and mitigate exposure to losses. The system at ASOS provides daily estimates of th… ▽ More

    Submitted 6 July, 2017; v1 submitted 7 March, 2017; originally announced March 2017.

    Comments: 10 pages, 11 figures

    Journal ref: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining Pages 1753-1762, 2017

  42. arXiv:1611.02704  [pdf, other

    hep-ph stat.ML

    Accelerating the BSM interpretation of LHC data with machine learning

    Authors: Gianfranco Bertone, Marc Peter Deisenroth, Jong Soo Kim, Sebastian Liem, Roberto Ruiz de Austri, Max Welling

    Abstract: The interpretation of Large Hadron Collider (LHC) data in the framework of Beyond the Standard Model (BSM) theories is hampered by the need to run computationally expensive event generators and detector simulators. Performing statistically convergent scans of high-dimensional BSM theories is consequently challenging, and in practice unfeasible for very high-dimensional BSM theories. We present her… ▽ More

    Submitted 8 November, 2016; originally announced November 2016.

    Comments: 5 pages, 2 figures

  43. arXiv:1608.04664  [pdf, other

    stat.ML cs.CV

    Variational Gaussian Process Auto-Encoder for Ordinal Prediction of Facial Action Units

    Authors: Stefanos Eleftheriadis, Ognjen Rudovic, Marc P. Deisenroth, Maja Pantic

    Abstract: We address the task of simultaneous feature fusion and modeling of discrete ordinal outputs. We propose a novel Gaussian process(GP) auto-encoder modeling approach. In particular, we introduce GP encoders to project multiple observed features onto a latent space, while GP decoders are responsible for reconstructing the original features. Inference is performed in a novel variational framework, whe… ▽ More

    Submitted 5 September, 2016; v1 submitted 16 August, 2016; originally announced August 2016.

  44. arXiv:1604.02917  [pdf, other

    stat.ML cs.CV cs.LG

    Gaussian Process Domain Experts for Model Adaptation in Facial Behavior Analysis

    Authors: Stefanos Eleftheriadis, Ognjen Rudovic, Marc P. Deisenroth, Maja Pantic

    Abstract: We present a novel approach for supervised domain adaptation that is based upon the probabilistic framework of Gaussian processes (GPs). Specifically, we introduce domain-specific GPs as local experts for facial expression classification from face images. The adaptation of the classifier is facilitated in probabilistic fashion by conditioning the target expert on multiple source experts. Furthermo… ▽ More

    Submitted 2 May, 2016; v1 submitted 11 April, 2016; originally announced April 2016.

  45. arXiv:1601.04621  [pdf, other

    cs.SI stat.ML

    Probabilistic Inference of Twitter Users' Age based on What They Follow

    Authors: Benjamin Paul Chamberlain, Clive Humby, Marc Peter Deisenroth

    Abstract: Twitter provides an open and rich source of data for studying human behaviour at scale and is widely used in social and network sciences. However, a major criticism of Twitter data is that demographic information is largely absent. Enhancing Twitter data with user ages would advance our ability to study social network structures, information flows and the spread of contagions. Approaches toward ag… ▽ More

    Submitted 24 February, 2017; v1 submitted 18 January, 2016; originally announced January 2016.

    Comments: 9 pages, 9 figures

  46. arXiv:1601.03958  [pdf, other

    cs.SI stat.ML

    Real-Time Community Detection in Large Social Networks on a Laptop

    Authors: Benjamin Paul Chamberlain, Josh Levy-Kramer, Clive Humby, Marc Peter Deisenroth

    Abstract: For a broad range of research, governmental and commercial applications it is important to understand the allegiances, communities and structure of key players in society. One promising direction towards extracting this information is to exploit the rich relational data in digital social networks (the social graph). As social media data sets are very large, most approaches make use of distributed… ▽ More

    Submitted 4 September, 2016; v1 submitted 15 January, 2016; originally announced January 2016.

  47. arXiv:1511.05385  [pdf, other

    stat.ML cs.AI cs.LG math.OC

    Bayesian Optimization with Dimension Scheduling: Application to Biological Systems

    Authors: Doniyor Ulmasov, Caroline Baroukh, Benoit Chachuat, Marc Peter Deisenroth, Ruth Misener

    Abstract: Bayesian Optimization (BO) is a data-efficient method for global black-box optimization of an expensive-to-evaluate fitness function. BO typically assumes that computation cost of BO is cheap, but experiments are time consuming or costly. In practice, this allows us to optimize ten or fewer critical parameters in up to 1,000 experiments. But experiments may be less expensive than BO methods assume… ▽ More

    Submitted 17 November, 2015; originally announced November 2015.

  48. arXiv:1510.02173  [pdf, other

    cs.AI cs.CV cs.LG stat.ML

    Data-Efficient Learning of Feedback Policies from Image Pixels using Deep Dynamical Models

    Authors: John-Alexander M. Assael, Niklas Wahlström, Thomas B. Schön, Marc Peter Deisenroth

    Abstract: Data-efficient reinforcement learning (RL) in continuous state-action spaces using very high-dimensional observations remains a key challenge in developing fully autonomous systems. We consider a particularly important instance of this challenge, the pixels-to-torques problem, where an RL agent learns a closed-loop control policy ("torques") from pixel information only. We introduce a data-efficie… ▽ More

    Submitted 9 October, 2015; v1 submitted 7 October, 2015; originally announced October 2015.

  49. arXiv:1502.02860  [pdf, other

    stat.ML cs.LG cs.RO eess.SY

    Gaussian Processes for Data-Efficient Learning in Robotics and Control

    Authors: Marc Peter Deisenroth, Dieter Fox, Carl Edward Rasmussen

    Abstract: Autonomous learning has been a promising direction in control and robotics for more than a decade since data-driven learning allows to reduce the amount of engineering knowledge, which is otherwise required. However, autonomous reinforcement learning (RL) approaches typically require many interactions with the system to learn controllers, which is a practical limitation in real systems, such as ro… ▽ More

    Submitted 10 October, 2017; v1 submitted 10 February, 2015; originally announced February 2015.

    Comments: 20 pages, 29 figures; fixed a typo in equation on page 8

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 37, issue no 2, pages 408-423, February 2015

  50. arXiv:1502.02843  [pdf, other

    stat.ML

    Distributed Gaussian Processes

    Authors: Marc Peter Deisenroth, Jun Wei Ng

    Abstract: To scale Gaussian processes (GPs) to large data sets we introduce the robust Bayesian Committee Machine (rBCM), a practical and scalable product-of-experts model for large-scale distributed GP regression. Unlike state-of-the-art sparse GP approximations, the rBCM is conceptually simple and does not rely on inducing or variational parameters. The key idea is to recursively distribute computations t… ▽ More

    Submitted 22 May, 2015; v1 submitted 10 February, 2015; originally announced February 2015.

    Comments: 10 pages, 5 figures. Appears in Proceedings of ICML 2015

    Journal ref: JMLR W&CP, vol 37, 2015