Skip to main content

Showing 1–12 of 12 results for author: Bouchard-Côté, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.21651  [pdf, ps, other

    cs.LG math.OC stat.CO stat.ML

    AutoSGD: Automatic Learning Rate Selection for Stochastic Gradient Descent

    Authors: Nikola Surjanovic, Alexandre Bouchard-Côté, Trevor Campbell

    Abstract: The learning rate is an important tuning parameter for stochastic gradient descent (SGD) and can greatly influence its performance. However, appropriate selection of a learning rate schedule across all iterations typically requires a non-trivial amount of user tuning effort. To address this, we introduce AutoSGD: an SGD method that automatically determines whether to increase or decrease the learn… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  2. arXiv:2502.15110  [pdf, other

    stat.ML cs.LG stat.AP

    Variational phylogenetic inference with products over bipartitions

    Authors: Evan Sidrow, Alexandre Bouchard-Côté, Lloyd T. Elliott

    Abstract: Bayesian phylogenetics requires accurate and efficient approximation of posterior distributions over trees. In this work, we develop a variational Bayesian approach for ultrametric phylogenetic trees. We present a novel variational family based on coalescent times of a single-linkage clustering and derive a closed-form density of the resulting distribution over trees. Unlike existing methods for u… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

    Comments: 20 pages, 5 figures

  3. arXiv:2410.18929  [pdf, other

    stat.CO cs.LG stat.ML

    AutoStep: Locally adaptive involutive MCMC

    Authors: Tiange Liu, Nikola Surjanovic, Miguel Biron-Lattes, Alexandre Bouchard-Côté, Trevor Campbell

    Abstract: Many common Markov chain Monte Carlo (MCMC) kernels can be formulated using a deterministic involutive proposal with a step size parameter. Selecting an appropriate step size is often a challenging task in practice; and for complex multiscale targets, there may not be one choice of step size that works well globally. In this work, we address this problem with a novel class of involutive MCMC metho… ▽ More

    Submitted 20 May, 2025; v1 submitted 24 October, 2024; originally announced October 2024.

  4. arXiv:2402.09598  [pdf, other

    stat.ML cs.LG math.ST stat.CO

    MCMC-driven learning

    Authors: Alexandre Bouchard-Côté, Trevor Campbell, Geoff Pleiss, Nikola Surjanovic

    Abstract: This paper is intended to appear as a chapter for the Handbook of Markov Chain Monte Carlo. The goal of this chapter is to unify various problems at the intersection of Markov chain Monte Carlo (MCMC) and machine learning$\unicode{x2014}$which includes black-box variational inference, adaptive MCMC, normalizing flow construction and transport-assisted MCMC, surrogate-likelihood MCMC, coreset const… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  5. arXiv:2308.09769  [pdf, other

    stat.CO cs.DC

    Pigeons.jl: Distributed Sampling From Intractable Distributions

    Authors: Nikola Surjanovic, Miguel Biron-Lattes, Paul Tiede, Saifuddin Syed, Trevor Campbell, Alexandre Bouchard-Côté

    Abstract: We introduce a software package, Pigeons.jl, that provides a way to leverage distributed computation to obtain samples from complicated probability distributions, such as multimodal posteriors arising in Bayesian inference and high-dimensional distributions in statistical mechanics. Pigeons.jl provides simple APIs to perform such computations single-threaded, multi-threaded, and/or distributed ove… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

  6. arXiv:2006.13925  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Slice Sampling for General Completely Random Measures

    Authors: Peiyuan Zhu, Alexandre Bouchard-Côté, Trevor Campbell

    Abstract: Completely random measures provide a principled approach to creating flexible unsupervised models, where the number of latent features is infinite and the number of features that influence the data grows with the size of the data set. Due to the infinity the latent features, posterior inference requires either marginalization---resulting in dependence structures that prevent efficient computation… ▽ More

    Submitted 25 June, 2020; v1 submitted 24 June, 2020; originally announced June 2020.

  7. arXiv:2001.09367  [pdf, other

    stat.CO cs.LG stat.ML

    Particle-Gibbs Sampling For Bayesian Feature Allocation Models

    Authors: Alexandre Bouchard-Côté, Andrew Roth

    Abstract: Bayesian feature allocation models are a popular tool for modelling data with a combinatorial latent structure. Exact inference in these models is generally intractable and so practitioners typically apply Markov Chain Monte Carlo (MCMC) methods for posterior inference. The most widely used MCMC strategies rely on an element wise Gibbs update of the feature allocation matrix. These element wise up… ▽ More

    Submitted 25 January, 2020; originally announced January 2020.

  8. arXiv:1905.13120  [pdf, other

    stat.ML cs.LG stat.CO

    Analysis of high-dimensional Continuous Time Markov Chains using the Local Bouncy Particle Sampler

    Authors: Tingting Zhao, Alexandre Bouchard-Côté

    Abstract: Sampling the parameters of high-dimensional Continuous Time Markov Chains (CTMC) is a challenging problem with important applications in many fields of applied statistics. In this work a recently proposed type of non-reversible rejection-free Markov Chain Monte Carlo (MCMC) sampler, the Bouncy Particle Sampler (BPS), is brought to bear to this problem. BPS has demonstrated its favorable computatio… ▽ More

    Submitted 29 May, 2021; v1 submitted 30 May, 2019; originally announced May 2019.

  9. arXiv:1901.09881  [pdf, other

    stat.ML cs.LG

    Scalable Metropolis-Hastings for Exact Bayesian Inference with Large Datasets

    Authors: Robert Cornish, Paul Vanetti, Alexandre Bouchard-Côté, George Deligiannidis, Arnaud Doucet

    Abstract: Bayesian inference via standard Markov Chain Monte Carlo (MCMC) methods is too computationally intensive to handle large datasets, since the cost per step usually scales like $Θ(n)$ in the number of data points $n$. We propose the Scalable Metropolis-Hastings (SMH) kernel that exploits Gaussian concentration of the posterior to require processing on average only $O(1)$ or even $O(1/\sqrt{n})$ data… ▽ More

    Submitted 10 June, 2019; v1 submitted 28 January, 2019; originally announced January 2019.

  10. arXiv:1406.4625  [pdf, other

    stat.ML cs.LG

    An Entropy Search Portfolio for Bayesian Optimization

    Authors: Bobak Shahriari, Ziyu Wang, Matthew W. Hoffman, Alexandre Bouchard-Côté, Nando de Freitas

    Abstract: Bayesian optimization is a sample-efficient method for black-box global optimization. How- ever, the performance of a Bayesian optimization method very much depends on its exploration strategy, i.e. the choice of acquisition function, and it is not clear a priori which choice will result in superior performance. While portfolio methods provide an effective, principled way of combining a collection… ▽ More

    Submitted 4 March, 2015; v1 submitted 18 June, 2014; originally announced June 2014.

    Comments: 10 pages, 5 figures

  11. arXiv:1301.5054  [pdf, other

    q-bio.PE cs.FL stat.CO

    A Note on Probabilistic Models over Strings: the Linear Algebra Approach

    Authors: Alexandre Bouchard-Côté

    Abstract: Probabilistic models over strings have played a key role in developing methods allowing indels to be treated as phylogenetically informative events. There is an extensive literature on using automata and transducers on phylogenies to do inference on these probabilistic models, in which an important theoretical question in the field is the complexity of computing the normalization of a class of str… ▽ More

    Submitted 11 July, 2013; v1 submitted 21 January, 2013; originally announced January 2013.

    Comments: 17 pages, 7 figures

  12. arXiv:1205.2658  [pdf

    stat.ML cs.LG

    Optimization of Structured Mean Field Objectives

    Authors: Alexandre Bouchard-Cote, Michael I. Jordan

    Abstract: In intractable, undirected graphical models, an intuitive way of creating structured mean field approximations is to select an acyclic tractable subgraph. We show that the hardness of computing the objective function and gradient of the mean field objective qualitatively depends on a simple graph property. If the tractable subgraph has this property- we call such subgraphs v-acyclic-a very fast bl… ▽ More

    Submitted 9 May, 2012; originally announced May 2012.

    Comments: Appears in Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI2009)

    Report number: UAI-P-2009-PG-67-74