Skip to main content

Showing 1–13 of 13 results for author: Artemev, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2402.09849  [pdf, other

    cs.LG stat.ML

    Recommendations for Baselines and Benchmarking Approximate Gaussian Processes

    Authors: Sebastian W. Ober, Artem Artemev, Marcel Wagenländer, Rudolfs Grobins, Mark van der Wilk

    Abstract: Gaussian processes (GPs) are a mature and widely-used component of the ML toolbox. One of their desirable qualities is automatic hyperparameter selection, which allows for training without user intervention. However, in many realistic settings, approximations are typically needed, which typically do require tuning. We argue that this requirement for tuning complicates evaluation, which has led to… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: Preprint. 25 pages, 16 figures

  2. arXiv:2302.08436  [pdf, other

    stat.ML cs.LG

    Trieste: Efficiently Exploring The Depths of Black-box Functions with TensorFlow

    Authors: Victor Picheny, Joel Berkeley, Henry B. Moss, Hrvoje Stojic, Uri Granta, Sebastian W. Ober, Artem Artemev, Khurram Ghani, Alexander Goodall, Andrei Paleyes, Sattar Vakili, Sergio Pascual-Diaz, Stratis Markou, Jixiang Qing, Nasrulloh R. B. S Loka, Ivo Couckuyt

    Abstract: We present Trieste, an open-source Python package for Bayesian optimization and active learning benefiting from the scalability and efficiency of TensorFlow. Our library enables the plug-and-play of popular TensorFlow-based models within sequential decision-making loops, e.g. Gaussian processes from GPflow or GPflux, or neural networks from Keras. This modular mindset is central to the package and… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

  3. arXiv:2210.07893  [pdf, other

    stat.ML cs.LG

    Numerically Stable Sparse Gaussian Processes via Minimum Separation using Cover Trees

    Authors: Alexander Terenin, David R. Burt, Artem Artemev, Seth Flaxman, Mark van der Wilk, Carl Edward Rasmussen, Hong Ge

    Abstract: Gaussian processes are frequently deployed as part of larger machine learning and decision-making systems, for instance in geospatial modeling, Bayesian optimization, or in latent Gaussian models. Within a system, the Gaussian process model needs to perform in a stable and reliable manner to ensure it interacts correctly with other parts of the system. In this work, we study the numerical stabilit… ▽ More

    Submitted 16 January, 2024; v1 submitted 14 October, 2022; originally announced October 2022.

    Journal ref: Journal of Machine Learning Research, 2024

  4. arXiv:2206.14148  [pdf, other

    cs.LG cs.PL stat.ML

    Memory Safe Computations with XLA Compiler

    Authors: Artem Artemev, Tilman Roeder, Mark van der Wilk

    Abstract: Software packages like TensorFlow and PyTorch are designed to support linear algebra operations, and their speed and usability determine their success. However, by prioritising speed, they often neglect memory requirements. As a consequence, the implementations of memory-intensive algorithms that are convenient in terms of software design can often not be run for large problems due to memory overf… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

    Comments: Preprint

  5. arXiv:2109.09417  [pdf, other

    stat.ML cs.LG

    Barely Biased Learning for Gaussian Process Regression

    Authors: David R. Burt, Artem Artemev, Mark van der Wilk

    Abstract: Recent work in scalable approximate Gaussian process regression has discussed a bias-variance-computation trade-off when estimating the log marginal likelihood. We suggest a method that adaptively selects the amount of computation to use when estimating the log marginal likelihood so that the bias of the objective function is guaranteed to be small. While simple in principle, our current implement… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

  6. arXiv:2104.05674  [pdf, ps, other

    stat.ML cs.LG

    GPflux: A Library for Deep Gaussian Processes

    Authors: Vincent Dutordoir, Hugh Salimbeni, Eric Hambro, John McLeod, Felix Leibfried, Artem Artemev, Mark van der Wilk, James Hensman, Marc P. Deisenroth, ST John

    Abstract: We introduce GPflux, a Python library for Bayesian deep learning with a strong emphasis on deep Gaussian processes (DGPs). Implementing DGPs is a challenging endeavour due to the various mathematical subtleties that arise when dealing with multivariate Gaussian distributions and the complex bookkeeping of indices. To date, there are no actively maintained, open-sourced and extendable libraries ava… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

  7. arXiv:2102.08314  [pdf, other

    stat.ML cs.LG

    Tighter Bounds on the Log Marginal Likelihood of Gaussian Process Regression Using Conjugate Gradients

    Authors: Artem Artemev, David R. Burt, Mark van der Wilk

    Abstract: We propose a lower bound on the log marginal likelihood of Gaussian process regression models that can be computed without matrix factorisation of the full kernel matrix. We show that approximate maximum likelihood learning of model parameters by maximising our lower bound retains many of the sparse variational approach benefits while reducing the bias introduced into parameter learning. The basis… ▽ More

    Submitted 16 February, 2021; originally announced February 2021.

    Comments: Preprint

  8. arXiv:2006.14376  [pdf, other

    stat.ML cs.LG math.OC

    Automatic Tuning of Stochastic Gradient Descent with Bayesian Optimisation

    Authors: Victor Picheny, Vincent Dutordoir, Artem Artemev, Nicolas Durrande

    Abstract: Many machine learning models require a training procedure based on running stochastic gradient descent. A key element for the efficiency of those algorithms is the choice of the learning rate schedule. While finding good learning rates schedules using Bayesian optimisation has been tackled by several authors, adapting it dynamically in a data-driven way is an open question. This is of high practic… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

  9. arXiv:2006.05356  [pdf, other

    stat.ML cs.LG

    Scalable Thompson Sampling using Sparse Gaussian Process Models

    Authors: Sattar Vakili, Henry Moss, Artem Artemev, Vincent Dutordoir, Victor Picheny

    Abstract: Thompson Sampling (TS) from Gaussian Process (GP) models is a powerful tool for the optimization of black-box functions. Although TS enjoys strong theoretical guarantees and convincing empirical performance, it incurs a large computational overhead that scales polynomially with the optimization budget. Recently, scalable TS methods based on sparse GP models have been proposed to increase the scope… ▽ More

    Submitted 5 November, 2021; v1 submitted 9 June, 2020; originally announced June 2020.

  10. arXiv:2003.01115  [pdf, other

    stat.ML cs.LG

    A Framework for Interdomain and Multioutput Gaussian Processes

    Authors: Mark van der Wilk, Vincent Dutordoir, ST John, Artem Artemev, Vincent Adam, James Hensman

    Abstract: One obstacle to the use of Gaussian processes (GPs) in large-scale problems, and as a component in deep learning system, is the need for bespoke derivations and implementations for small variations in the model or inference. In order to improve the utility of GPs we need a modular system that allows rapid implementation and testing, as seen in the neural network community. We present a mathematica… ▽ More

    Submitted 2 March, 2020; originally announced March 2020.

  11. arXiv:2001.05363  [pdf, other

    stat.ML cs.LG

    Doubly Sparse Variational Gaussian Processes

    Authors: Vincent Adam, Stefanos Eleftheriadis, Nicolas Durrande, Artem Artemev, James Hensman

    Abstract: The use of Gaussian process models is typically limited to datasets with a few tens of thousands of observations due to their complexity and memory footprint. The two most commonly used methods to overcome this limitation are 1) the variational sparse approximation which relies on inducing points and 2) the state-space equivalent formulation of Gaussian processes which can be seen as exploiting so… ▽ More

    Submitted 15 January, 2020; originally announced January 2020.

    Comments: Accepted at AISTATS 2020

  12. arXiv:1912.02493  [pdf, other

    stat.ML cs.LG math.OC

    Ordinal Bayesian Optimisation

    Authors: Victor Picheny, Sattar Vakili, Artem Artemev

    Abstract: Bayesian optimisation is a powerful tool to solve expensive black-box problems, but fails when the stationary assumption made on the objective function is strongly violated, which is the case in particular for ill-conditioned or discontinuous objectives. We tackle this problem by proposing a new Bayesian optimisation framework that only considers the ordering of variables, both in the input and ou… ▽ More

    Submitted 5 December, 2019; originally announced December 2019.

  13. arXiv:1902.05888  [pdf, other

    stat.ML cs.LG

    Bayesian Image Classification with Deep Convolutional Gaussian Processes

    Authors: Vincent Dutordoir, Mark van der Wilk, Artem Artemev, James Hensman

    Abstract: In decision-making systems, it is important to have classifiers that have calibrated uncertainties, with an optimisation objective that can be used for automated model selection and training. Gaussian processes (GPs) provide uncertainty estimates and a marginal likelihood objective, but their weak inductive biases lead to inferior accuracy. This has limited their applicability in certain tasks (e.… ▽ More

    Submitted 4 March, 2020; v1 submitted 15 February, 2019; originally announced February 2019.

    Comments: Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS) 2020, PMLR: Volume 108