Skip to main content

Showing 1–50 of 67 results for author: Wood, F

Searching in archive stat. Search in all archives.
.
  1. arXiv:2501.12408  [pdf, other

    cs.AI cs.LG cs.RO eess.SY stat.ML

    Control-ITRA: Controlling the Behavior of a Driving Model

    Authors: Vasileios Lioutas, Adam Scibior, Matthew Niedoba, Berend Zwartsenberg, Frank Wood

    Abstract: Simulating realistic driving behavior is crucial for developing and testing autonomous systems in complex traffic environments. Equally important is the ability to control the behavior of simulated agents to tailor scenarios to specific research needs and safety considerations. This paper extends the general-purpose multi-agent driving behavior model ITRA (Scibior et al., 2021), by introducing a m… ▽ More

    Submitted 16 January, 2025; originally announced January 2025.

    Comments: 16 pages, 2 figures

  2. arXiv:2404.09636  [pdf, other

    cs.LG cs.AI stat.ML

    All-in-one simulation-based inference

    Authors: Manuel Gloeckler, Michael Deistler, Christian Weilbach, Frank Wood, Jakob H. Macke

    Abstract: Amortized Bayesian inference trains neural networks to solve stochastic inference problems using model simulations, thereby making it possible to rapidly perform Bayesian inference for any newly observed data. However, current simulation-based amortized inference methods are simulation-hungry and inflexible: They require the specification of a fixed parametric prior, simulator, and inference tasks… ▽ More

    Submitted 15 July, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: To be published in the proceedings of the 41st International Conference on Machine Learning (ICML 2024), Vienna, Austria. PMLR 235, 2024

  3. arXiv:2402.08018  [pdf, other

    cs.LG cs.CV stat.ML

    Nearest Neighbour Score Estimators for Diffusion Generative Models

    Authors: Matthew Niedoba, Dylan Green, Saeid Naderiparizi, Vasileios Lioutas, Jonathan Wilder Lavington, Xiaoxuan Liang, Yunpeng Liu, Ke Zhang, Setareh Dabiri, Adam Ścibior, Berend Zwartsenberg, Frank Wood

    Abstract: Score function estimation is the cornerstone of both training and sampling from diffusion generative models. Despite this fact, the most commonly used estimators are either biased neural network approximations or high variance Monte Carlo estimators based on the conditional score. We introduce a novel nearest neighbour score function estimator which utilizes multiple samples from the training set… ▽ More

    Submitted 16 July, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: 25 pages, 9 figures. To be published in ICML 2024

  4. arXiv:2307.16463  [pdf, other

    cs.LG stat.ML

    Don't be so negative! Score-based Generative Modeling with Oracle-assisted Guidance

    Authors: Saeid Naderiparizi, Xiaoxuan Liang, Berend Zwartsenberg, Frank Wood

    Abstract: The maximum likelihood principle advocates parameter estimation via optimization of the data likelihood function. Models estimated in this way can exhibit a variety of generalization characteristics dictated by, e.g. architecture, parameterization, and optimization bias. This work addresses model learning in a setting where there further exists side-information in the form of an oracle that can la… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

  5. arXiv:2210.12236  [pdf, other

    stat.ML cs.LG

    Uncertain Evidence in Probabilistic Models and Stochastic Simulators

    Authors: Andreas Munk, Alexander Mead, Frank Wood

    Abstract: We consider the problem of performing Bayesian inference in probabilistic models where observations are accompanied by uncertainty, referred to as "uncertain evidence." We explore how to interpret uncertain evidence, and by extension the importance of proper interpretation as it pertains to inference about latent variables. We consider a recently-proposed method "distributional evidence" as well a… ▽ More

    Submitted 26 January, 2023; v1 submitted 21 October, 2022; originally announced October 2022.

  6. arXiv:2206.09021  [pdf, other

    stat.ML cs.LG

    Conditional Permutation Invariant Flows

    Authors: Berend Zwartsenberg, Adam Ścibior, Matthew Niedoba, Vasileios Lioutas, Yunpeng Liu, Justice Sefas, Setareh Dabiri, Jonathan Wilder Lavington, Trevor Campbell, Frank Wood

    Abstract: We present a novel, conditional generative probabilistic model of set-valued data with a tractable log density. This model is a continuous normalizing flow governed by permutation equivariant dynamics. These dynamics are driven by a learnable per-set-element term and pairwise interactions, both parametrized by deep neural networks. We illustrate the utility of this model via applications including… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: 20 pages, 10 figures

    ACM Class: I.2.0

  7. arXiv:2205.15460  [pdf, other

    stat.ML cs.LG

    Critic Sequential Monte Carlo

    Authors: Vasileios Lioutas, Jonathan Wilder Lavington, Justice Sefas, Matthew Niedoba, Yunpeng Liu, Berend Zwartsenberg, Setareh Dabiri, Frank Wood, Adam Scibior

    Abstract: We introduce CriticSMC, a new algorithm for planning as inference built from a composition of sequential Monte Carlo with learned Soft-Q function heuristic factors. These heuristic factors, obtained from parametric approximations of the marginal likelihood ahead, more effectively guide SMC towards the desired target distribution, which is particularly helpful for planning in environments with hard… ▽ More

    Submitted 21 January, 2023; v1 submitted 30 May, 2022; originally announced May 2022.

    Comments: ICLR 2023

  8. arXiv:2202.08587  [pdf, other

    cs.LG stat.ML

    Gradients without Backpropagation

    Authors: Atılım Güneş Baydin, Barak A. Pearlmutter, Don Syme, Frank Wood, Philip Torr

    Abstract: Using backpropagation to compute gradients of objective functions for optimization has remained a mainstay of machine learning. Backpropagation, or reverse-mode differentiation, is a special case within the general family of automatic differentiation algorithms that also includes the forward mode. We present a method to compute gradients based solely on the directional derivative that one can comp… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

    Comments: 10 pages, 6 figures

    MSC Class: 68T07 ACM Class: I.2.6; I.2.5

  9. arXiv:2107.00745  [pdf, other

    cs.LG cs.AI stat.ML

    q-Paths: Generalizing the Geometric Annealing Path using Power Means

    Authors: Vaden Masrani, Rob Brekelmans, Thang Bui, Frank Nielsen, Aram Galstyan, Greg Ver Steeg, Frank Wood

    Abstract: Many common machine learning methods involve the geometric annealing path, a sequence of intermediate densities between two distributions of interest constructed using the geometric average. While alternatives such as the moment-averaging path have demonstrated performance gains in some settings, their practical applicability remains limited by exponential family endpoint assumptions and a lack of… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: arXiv admin note: text overlap with arXiv:2012.07823

  10. arXiv:2106.10314  [pdf, other

    stat.ML cs.LG

    Differentiable Particle Filtering without Modifying the Forward Pass

    Authors: Adam Ścibior, Frank Wood

    Abstract: Particle filters are not compatible with automatic differentiation due to the presence of discrete resampling steps. While known estimators for the score function, based on Fisher's identity, can be computed using particle filters, up to this point they required manual implementation. In this paper we show that such estimators can be computed using automatic differentiation, after introducing a si… ▽ More

    Submitted 19 October, 2021; v1 submitted 18 June, 2021; originally announced June 2021.

    Comments: 24 pages, 3 figures

  11. arXiv:2104.11212  [pdf, other

    stat.ML cs.LG

    Imagining The Road Ahead: Multi-Agent Trajectory Prediction via Differentiable Simulation

    Authors: Adam Scibior, Vasileios Lioutas, Daniele Reda, Peyman Bateni, Frank Wood

    Abstract: We develop a deep generative model built on a fully differentiable simulator for multi-agent trajectory prediction. Agents are modeled with conditional recurrent variational neural networks (CVRNNs), which take as input an ego-centric birdview image representing the current state of the world and output an action, consisting of steering and acceleration, which is used to derive the subsequent agen… ▽ More

    Submitted 22 April, 2021; originally announced April 2021.

    Comments: 10 pages, 8 figures

  12. arXiv:2012.15566  [pdf, other

    cs.LG stat.ML

    Robust Asymmetric Learning in POMDPs

    Authors: Andrew Warrington, J. Wilder Lavington, Adam Ścibior, Mark Schmidt, Frank Wood

    Abstract: Policies for partially observed Markov decision processes can be efficiently learned by imitating policies for the corresponding fully observed Markov decision processes. Unfortunately, existing approaches for this kind of imitation learning have a serious flaw: the expert does not know what the trainee cannot see, and so may encourage actions that are sub-optimal, even unsafe, under partial infor… ▽ More

    Submitted 1 July, 2021; v1 submitted 31 December, 2020; originally announced December 2020.

    Comments: ICML 2021

  13. arXiv:2010.03753  [pdf, other

    cs.LG stat.ML

    Uncertainty in Neural Processes

    Authors: Saeid Naderiparizi, Kenny Chiu, Benjamin Bloem-Reddy, Frank Wood

    Abstract: We explore the effects of architecture and training objective choice on amortized posterior predictive inference in probabilistic conditional generative models. We aim this work to be a counterpoint to a recent trend in the literature that stresses achieving good samples when the amount of conditioning data is large. We instead focus our attention on the case where the amount of conditioning data… ▽ More

    Submitted 8 October, 2020; originally announced October 2020.

  14. arXiv:2010.01274  [pdf, other

    cs.LG stat.ML

    Assisting the Adversary to Improve GAN Training

    Authors: Andreas Munk, William Harvey, Frank Wood

    Abstract: Some of the most popular methods for improving the stability and performance of GANs involve constraining or regularizing the discriminator. In this paper we consider a largely overlooked regularization technique which we refer to as the Adversary's Assistant (AdvAs). We motivate this using a different perspective to that of prior work. Specifically, we consider a common mismatch between theoretic… ▽ More

    Submitted 8 December, 2020; v1 submitted 3 October, 2020; originally announced October 2020.

  15. arXiv:2007.00642  [pdf, other

    cs.LG stat.ML

    All in the Exponential Family: Bregman Duality in Thermodynamic Variational Inference

    Authors: Rob Brekelmans, Vaden Masrani, Frank Wood, Greg Ver Steeg, Aram Galstyan

    Abstract: The recently proposed Thermodynamic Variational Objective (TVO) leverages thermodynamic integration to provide a family of variational inference objectives, which both tighten and generalize the ubiquitous Evidence Lower Bound (ELBO). However, the tightness of TVO bounds was not previously known, an expensive grid search was used to choose a "schedule" of intermediate distributions, and model lear… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

    Comments: ICML 2020

  16. arXiv:2007.00155  [pdf, other

    cs.LG stat.ML

    Semi-supervised Sequential Generative Models

    Authors: Michael Teng, Tuan Anh Le, Adam Scibior, Frank Wood

    Abstract: We introduce a novel objective for training deep generative time-series models with discrete latent variables for which supervision is only sparsely available. This instance of semi-supervised learning is challenging for existing methods, because the exponential number of possible discrete latent configurations results in high variance gradient estimators. We first overcome this problem by extendi… ▽ More

    Submitted 30 June, 2020; originally announced July 2020.

    Comments: Accepted to Uncertainty in Artificial Intelligence 2020

  17. arXiv:2006.12245  [pdf, other

    cs.CV cs.LG stat.ML

    Enhancing Few-Shot Image Classification with Unlabelled Examples

    Authors: Peyman Bateni, Jarred Barber, Jan-Willem van de Meent, Frank Wood

    Abstract: We develop a transductive meta-learning method that uses unlabelled instances to improve few-shot image classification performance. Our approach combines a regularized Mahalanobis-distance-based soft k-means clustering procedure with a modified state of the art neural adaptive feature extractor to achieve improved test-time classification accuracy using unlabelled data. We evaluate our method on t… ▽ More

    Submitted 21 October, 2021; v1 submitted 17 June, 2020; originally announced June 2020.

  18. arXiv:2003.13221  [pdf, other

    q-bio.PE cs.LG stat.ML

    Planning as Inference in Epidemiological Models

    Authors: Frank Wood, Andrew Warrington, Saeid Naderiparizi, Christian Weilbach, Vaden Masrani, William Harvey, Adam Scibior, Boyan Beronov, John Grefenstette, Duncan Campbell, Ali Nasseri

    Abstract: In this work we demonstrate how to automate parts of the infectious disease-control policy-making process via performing inference in existing epidemiological models. The kind of inference tasks undertaken include computing the posterior distribution over controllable, via direct policy-making choices, simulation model parameters that give rise to acceptable disease progression outcomes. Among oth… ▽ More

    Submitted 15 September, 2021; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: Revisions

    Journal ref: Front Artif Intell. 2021; 4: 550603

  19. arXiv:2003.12908  [pdf, other

    cs.LG stat.ML

    Coping With Simulators That Don't Always Return

    Authors: Andrew Warrington, Saeid Naderiparizi, Frank Wood

    Abstract: Deterministic models are approximations of reality that are easy to interpret and often easier to build than stochastic alternatives. Unfortunately, as nature is capricious, observational data can never be fully explained by deterministic models in practice. Observation and process noise need to be added to adapt deterministic models to behave stochastically, such that they are capable of explaini… ▽ More

    Submitted 28 March, 2020; originally announced March 2020.

    Comments: AISTATS 2020 camera ready, version 1.0

  20. arXiv:1910.11961  [pdf, other

    cs.LG stat.ML

    Attention for Inference Compilation

    Authors: William Harvey, Andreas Munk, Atılım Güneş Baydin, Alexander Bergholm, Frank Wood

    Abstract: We present a new approach to automatic amortized inference in universal probabilistic programs which improves performance compared to current methods. Our approach is a variation of inference compilation (IC) which leverages deep neural networks to approximate a posterior distribution over latent variables in a probabilistic program. A challenge with existing IC network architectures is that they… ▽ More

    Submitted 25 October, 2019; originally announced October 2019.

  21. arXiv:1910.11950  [pdf, other

    cs.LG stat.ML

    Probabilistic Surrogate Networks for Simulators with Unbounded Randomness

    Authors: Andreas Munk, Berend Zwartsenberg, Adam Ścibior, Atılım Güneş Baydin, Andrew Stewart, Goran Fernlund, Anoush Poursartip, Frank Wood

    Abstract: We present a framework for automatically structuring and training fast, approximate, deep neural surrogates of stochastic simulators. Unlike traditional approaches to surrogate modeling, our surrogates retain the interpretable structure and control flow of the reference simulator. Our surrogates target stochastic simulators where the number of random variables itself can be stochastic and potentia… ▽ More

    Submitted 20 January, 2023; v1 submitted 25 October, 2019; originally announced October 2019.

  22. arXiv:1910.09056  [pdf, other

    cs.LG cs.AI stat.ML

    Amortized Rejection Sampling in Universal Probabilistic Programming

    Authors: Saeid Naderiparizi, Adam Ścibior, Andreas Munk, Mehrdad Ghadiri, Atılım Güneş Baydin, Bradley Gram-Hansen, Christian Schroeder de Witt, Robert Zinkov, Philip H. S. Torr, Tom Rainforth, Yee Whye Teh, Frank Wood

    Abstract: Naive approaches to amortized inference in probabilistic programs with unbounded loops can produce estimators with infinite variance. This is particularly true of importance sampling inference in programs that explicitly include rejection sampling as part of the user-programmed generative procedure. In this paper we develop a new and efficient amortized importance sampling estimator. We prove fini… ▽ More

    Submitted 28 March, 2022; v1 submitted 20 October, 2019; originally announced October 2019.

    Comments: AISTATS 2022 camera ready

  23. arXiv:1907.11075  [pdf, other

    q-bio.NC cs.AI cs.LG stat.ML

    The Virtual Patch Clamp: Imputing C. elegans Membrane Potentials from Calcium Imaging

    Authors: Andrew Warrington, Arthur Spencer, Frank Wood

    Abstract: We develop a stochastic whole-brain and body simulator of the nematode roundworm Caenorhabditis elegans (C. elegans) and show that it is sufficiently regularizing to allow imputation of latent membrane potentials from partial calcium fluorescence imaging observations. This is the first attempt we know of to "complete the circle," where an anatomically grounded whole-connectome simulator is used to… ▽ More

    Submitted 24 July, 2019; originally announced July 2019.

    Comments: Includes Supplementary Materials

  24. arXiv:1907.08082  [pdf, other

    stat.ML cs.LG stat.CO

    Amortized Monte Carlo Integration

    Authors: Adam Goliński, Frank Wood, Tom Rainforth

    Abstract: Current approaches to amortizing Bayesian inference focus solely on approximating the posterior distribution. Typically, this approximation is, in turn, used to calculate expectations for one or more target functions - a computational pipeline which is inefficient when the target function(s) are known upfront. In this paper, we address this inefficiency by introducing AMCI, a method for amortizing… ▽ More

    Submitted 18 July, 2019; originally announced July 2019.

    Comments: Awarded Best Paper Honourable Mention at International Conference on Machine Learning (ICML) 2019

  25. arXiv:1907.03382  [pdf, other

    cs.LG cs.PF stat.ML

    Etalumis: Bringing Probabilistic Programming to Scientific Simulators at Scale

    Authors: Atılım Güneş Baydin, Lei Shao, Wahid Bhimji, Lukas Heinrich, Lawrence Meadows, Jialin Liu, Andreas Munk, Saeid Naderiparizi, Bradley Gram-Hansen, Gilles Louppe, Mingfei Ma, Xiaohui Zhao, Philip Torr, Victor Lee, Kyle Cranmer, Prabhat, Frank Wood

    Abstract: Probabilistic programming languages (PPLs) are receiving widespread attention for performing Bayesian inference in complex generative models. However, applications to science remain limited because of the impracticability of rewriting complex scientific simulators in a PPL, the computational cost of inference, and the lack of scalable implementations. To address these, we present a novel PPL frame… ▽ More

    Submitted 27 August, 2019; v1 submitted 7 July, 2019; originally announced July 2019.

    Comments: 14 pages, 8 figures

    MSC Class: 68T37; 68T05; 62P35 ACM Class: G.3; I.2.6; J.2

    Journal ref: Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis (SC19), November 17--22, 2019

  26. arXiv:1907.00031  [pdf, other

    cs.LG stat.ML

    The Thermodynamic Variational Objective

    Authors: Vaden Masrani, Tuan Anh Le, Frank Wood

    Abstract: We introduce the thermodynamic variational objective (TVO) for learning in both continuous and discrete deep generative models. The TVO arises from a key connection between variational inference and thermodynamic integration that results in a tighter lower bound to the log marginal likelihood than the standard variational variational evidence lower bound (ELBO) while remaining as broadly applicabl… ▽ More

    Submitted 7 April, 2021; v1 submitted 28 June, 2019; originally announced July 2019.

  27. arXiv:1906.05462  [pdf, other

    cs.LG stat.ML

    Near-Optimal Glimpse Sequences for Improved Hard Attention Neural Network Training

    Authors: William Harvey, Michael Teng, Frank Wood

    Abstract: Hard visual attention is a promising approach to reduce the computational burden of modern computer vision methodologies. Hard attention mechanisms are typically non-differentiable. They can be trained with reinforcement learning but the high-variance training this entails hinders more widespread application. We show how hard attention for image classification can be framed as a Bayesian optimal e… ▽ More

    Submitted 14 June, 2020; v1 submitted 12 June, 2019; originally announced June 2019.

    Comments: 11 pages, 6 figures + appendix with 9 pages, 7 figures.Submitted to NeurIPS 2020

  28. arXiv:1903.02482  [pdf, other

    cs.LG cs.PL stat.ML

    LF-PPL: A Low-Level First Order Probabilistic Programming Language for Non-Differentiable Models

    Authors: Yuan Zhou, Bradley J. Gram-Hansen, Tobias Kohn, Tom Rainforth, Hongseok Yang, Frank Wood

    Abstract: We develop a new Low-level, First-order Probabilistic Programming Language (LF-PPL) suited for models containing a mix of continuous, discrete, and/or piecewise-continuous variables. The key success of this language and its compilation scheme is in its ability to automatically distinguish parameters the density function is discontinuous with respect to, while further providing runtime checks for b… ▽ More

    Submitted 6 March, 2019; originally announced March 2019.

    Comments: Published in the proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS)

  29. arXiv:1809.10756  [pdf, other

    stat.ML cs.AI cs.LG cs.PL

    An Introduction to Probabilistic Programming

    Authors: Jan-Willem van de Meent, Brooks Paige, Hongseok Yang, Frank Wood

    Abstract: This book is a graduate-level introduction to probabilistic programming. It not only provides a thorough background for anyone wishing to use a probabilistic programming system, but also introduces the techniques needed to design and build these systems. It is aimed at people who have an undergraduate-level understanding of either or, ideally, both probabilistic machine learning and programming la… ▽ More

    Submitted 19 October, 2021; v1 submitted 27 September, 2018; originally announced September 2018.

    Comments: Under review at Foundations and Trends in Machine Learning

  30. arXiv:1807.07706  [pdf, other

    cs.LG hep-ph physics.data-an stat.ML

    Efficient Probabilistic Inference in the Quest for Physics Beyond the Standard Model

    Authors: Atılım Güneş Baydin, Lukas Heinrich, Wahid Bhimji, Lei Shao, Saeid Naderiparizi, Andreas Munk, Jialin Liu, Bradley Gram-Hansen, Gilles Louppe, Lawrence Meadows, Philip Torr, Victor Lee, Prabhat, Kyle Cranmer, Frank Wood

    Abstract: We present a novel probabilistic programming framework that couples directly to existing large-scale simulators through a cross-platform probabilistic execution protocol, which allows general-purpose inference engines to record and control random number draws within simulators in a language-agnostic way. The execution of existing simulators as probabilistic programs enables highly interpretable po… ▽ More

    Submitted 17 February, 2020; v1 submitted 20 July, 2018; originally announced July 2018.

    Comments: 20 pages, 9 figures

    MSC Class: 68T37; 68T05; 62P35 ACM Class: G.3; I.2.6; J.2

    Journal ref: In Advances in Neural Information Processing Systems 33 (NeurIPS), Vancouver, Canada, 2019

  31. arXiv:1806.09550  [pdf, other

    stat.CO stat.ML

    Inference Trees: Adaptive Inference with Exploration

    Authors: Tom Rainforth, Yuan Zhou, Xiaoyu Lu, Yee Whye Teh, Frank Wood, Hongseok Yang, Jan-Willem van de Meent

    Abstract: We introduce inference trees (ITs), a new class of inference methods that build on ideas from Monte Carlo tree search to perform adaptive sampling in a manner that balances exploration with exploitation, ensures consistency, and alleviates pathologies in existing adaptive methods. ITs adaptively sample from hierarchical partitions of the parameter space, while simultaneously learning these partiti… ▽ More

    Submitted 25 June, 2018; originally announced June 2018.

  32. arXiv:1806.02426  [pdf, other

    cs.LG stat.ML

    Deep Variational Reinforcement Learning for POMDPs

    Authors: Maximilian Igl, Luisa Zintgraf, Tuan Anh Le, Frank Wood, Shimon Whiteson

    Abstract: Many real-world sequential decision making problems are partially observable by nature, and the environment model is typically unknown. Consequently, there is great need for reinforcement learning methods that can tackle such problems given only a stream of incomplete and noisy observations. In this paper, we propose deep variational reinforcement learning (DVRL), which introduces an inductive bia… ▽ More

    Submitted 6 June, 2018; originally announced June 2018.

  33. arXiv:1805.10469  [pdf, other

    stat.ML cs.LG

    Revisiting Reweighted Wake-Sleep for Models with Stochastic Control Flow

    Authors: Tuan Anh Le, Adam R. Kosiorek, N. Siddharth, Yee Whye Teh, Frank Wood

    Abstract: Stochastic control-flow models (SCFMs) are a class of generative models that involve branching on choices from discrete random variables. Amortized gradient-based learning of SCFMs is challenging as most approaches targeting discrete variables rely on their continuous relaxations---which can be intractable in SCFMs, as branching on relaxations requires evaluating all (exponentially many) branching… ▽ More

    Submitted 16 September, 2019; v1 submitted 26 May, 2018; originally announced May 2018.

    Comments: Tuan Anh Le and Adam R. Kosiorek contributed equally; accepted to Uncertainty in Artificial Intelligence 2019

  34. arXiv:1804.03523  [pdf, other

    stat.CO cs.PL stat.ML

    Hamiltonian Monte Carlo for Probabilistic Programs with Discontinuities

    Authors: Bradley Gram-Hansen, Yuan Zhou, Tobias Kohn, Tom Rainforth, Hongseok Yang, Frank Wood

    Abstract: Hamiltonian Monte Carlo (HMC) is arguably the dominant statistical inference algorithm used in most popular "first-order differentiable" Probabilistic Programming Languages (PPLs). However, the fact that HMC uses derivative information causes complications when the target distribution is non-differentiable with respect to one or more of the latent variables. In this paper, we show how to use exten… ▽ More

    Submitted 2 January, 2019; v1 submitted 7 April, 2018; originally announced April 2018.

    Comments: 4 pages, 2 figures

    Journal ref: Inaugural Conference on Probabilistic Programming, 2018

  35. arXiv:1803.04209  [pdf, ps, other

    cs.DC cs.LG stat.ML

    High Throughput Synchronous Distributed Stochastic Gradient Descent

    Authors: Michael Teng, Frank Wood

    Abstract: We introduce a new, high-throughput, synchronous, distributed, data-parallel, stochastic-gradient-descent learning algorithm. This algorithm uses amortized inference in a compute-cluster-specific, deep, generative, dynamical model to perform joint posterior predictive inference of the mini-batch gradient computation times of all worker-nodes in a parallel computing cluster. We show that a synchron… ▽ More

    Submitted 12 March, 2018; originally announced March 2018.

  36. arXiv:1802.04537  [pdf, other

    stat.ML cs.LG

    Tighter Variational Bounds are Not Necessarily Better

    Authors: Tom Rainforth, Adam R. Kosiorek, Tuan Anh Le, Chris J. Maddison, Maximilian Igl, Frank Wood, Yee Whye Teh

    Abstract: We provide theoretical and empirical evidence that using tighter evidence lower bounds (ELBOs) can be detrimental to the process of learning an inference network by reducing the signal-to-noise ratio of the gradient estimator. Our results call into question common implicit assumptions that tighter ELBOs are better variational objectives for simultaneous model learning and inference amortization sc… ▽ More

    Submitted 5 March, 2019; v1 submitted 13 February, 2018; originally announced February 2018.

    Comments: To appear at ICML 2018

  37. arXiv:1712.00287  [pdf, other

    stat.ML cs.LG

    Faithful Inversion of Generative Models for Effective Amortized Inference

    Authors: Stefan Webb, Adam Golinski, Robert Zinkov, N. Siddharth, Tom Rainforth, Yee Whye Teh, Frank Wood

    Abstract: Inference amortization methods share information across multiple posterior-inference problems, allowing each to be carried out more efficiently. Generally, they require the inversion of the dependency structure in the generative model, as the modeller must learn a mapping from observations to distributions approximating the posterior. Previous approaches have involved inverting the dependency stru… ▽ More

    Submitted 29 November, 2018; v1 submitted 1 December, 2017; originally announced December 2017.

    Comments: To appear at the 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Montreal, Canada

  38. arXiv:1710.11397  [pdf, ps, other

    cs.CV q-bio.QM stat.ML

    Updating the VESICLE-CNN Synapse Detector

    Authors: Andrew Warrington, Frank Wood

    Abstract: We present an updated version of the VESICLE-CNN algorithm presented by Roncal et al. (2014). The original implementation makes use of a patch-based approach. This methodology is known to be slow due to repeated computations. We update this implementation to be fully convolutional through the use of dilated convolutions, recovering the expanded field of view achieved through the use of strided max… ▽ More

    Submitted 31 October, 2017; originally announced October 2017.

    Comments: Submitted as two side extended abstract to NIPS 2017 workshop: BigNeuro 2017: Analyzing brain data from nano to macroscale

  39. arXiv:1709.06181  [pdf, other

    stat.CO stat.ME stat.ML

    On Nesting Monte Carlo Estimators

    Authors: Tom Rainforth, Robert Cornish, Hongseok Yang, Andrew Warrington, Frank Wood

    Abstract: Many problems in machine learning and statistics involve nested expectations and thus do not permit conventional Monte Carlo (MC) estimation. For such problems, one must nest estimators, such that terms in an outer estimator themselves involve calculation of a separate, nested, estimation. We investigate the statistical implications of nesting MC estimators, including cases of multiple levels of n… ▽ More

    Submitted 23 May, 2018; v1 submitted 18 September, 2017; originally announced September 2017.

    Comments: To appear at International Conference on Machine Learning 2018

  40. arXiv:1707.04314  [pdf, other

    stat.ML cs.AI cs.PL stat.CO

    Bayesian Optimization for Probabilistic Programs

    Authors: Tom Rainforth, Tuan Anh Le, Jan-Willem van de Meent, Michael A. Osborne, Frank Wood

    Abstract: We present the first general purpose framework for marginal maximum a posteriori estimation of probabilistic program variables. By using a series of code transformations, the evidence of any probabilistic program, and therefore of any graphical model, can be optimized with respect to an arbitrary subset of its sampled variables. To carry out this optimization, we develop the first Bayesian optimiz… ▽ More

    Submitted 13 July, 2017; originally announced July 2017.

  41. arXiv:1706.00400  [pdf, other

    stat.ML cs.AI cs.LG

    Learning Disentangled Representations with Semi-Supervised Deep Generative Models

    Authors: N. Siddharth, Brooks Paige, Jan-Willem van de Meent, Alban Desmaison, Noah D. Goodman, Pushmeet Kohli, Frank Wood, Philip H. S. Torr

    Abstract: Variational autoencoders (VAEs) learn representations of data by jointly training a probabilistic encoder and decoder network. Typically these models encode all features of the data into a single variable. Here we are interested in learning disentangled representations that encode distinct aspects of the data into separate variables. We propose to learn such representations using model architectur… ▽ More

    Submitted 13 November, 2017; v1 submitted 1 June, 2017; originally announced June 2017.

    Comments: Accepted for publication at NIPS 2017

  42. arXiv:1705.10306  [pdf, other

    stat.ML

    Auto-Encoding Sequential Monte Carlo

    Authors: Tuan Anh Le, Maximilian Igl, Tom Rainforth, Tom Jin, Frank Wood

    Abstract: We build on auto-encoding sequential Monte Carlo (AESMC): a method for model and proposal learning based on maximizing the lower bound to the log marginal likelihood in a broad family of structured probabilistic models. Our approach relies on the efficiency of sequential Monte Carlo (SMC) for performing inference in structured probabilistic models and the flexibility of deep neural networks to mod… ▽ More

    Submitted 5 April, 2018; v1 submitted 29 May, 2017; originally announced May 2017.

  43. arXiv:1703.04782  [pdf, other

    cs.LG stat.ML

    Online Learning Rate Adaptation with Hypergradient Descent

    Authors: Atilim Gunes Baydin, Robert Cornish, David Martinez Rubio, Mark Schmidt, Frank Wood

    Abstract: We introduce a general method for improving the convergence rate of gradient-based optimizers that is easy to implement and works well in practice. We demonstrate the effectiveness of the method in a range of optimization problems by applying it to stochastic gradient descent, stochastic gradient descent with Nesterov momentum, and Adam, showing that it significantly reduces the need for the manua… ▽ More

    Submitted 25 February, 2018; v1 submitted 14 March, 2017; originally announced March 2017.

    Comments: 11 pages, 4 figures

    MSC Class: 68T05 ACM Class: G.1.6; I.2.6

    Journal ref: In Sixth International Conference on Learning Representations (ICLR), Vancouver, Canada, April 30 -- May 3, 2018. https://openreview.net/forum?id=BkrsAzWAb

  44. arXiv:1703.00868  [pdf, other

    cs.LG cs.CV stat.ML

    Using Synthetic Data to Train Neural Networks is Model-Based Reasoning

    Authors: Tuan Anh Le, Atilim Gunes Baydin, Robert Zinkov, Frank Wood

    Abstract: We draw a formal connection between using synthetic training data to optimize neural network parameters and approximate, Bayesian, model-based reasoning. In particular, training a neural network using synthetic data can be viewed as learning a proposal distribution generator for approximate inference in the synthetic-data generative model. We demonstrate this connection in a recognition task where… ▽ More

    Submitted 2 March, 2017; originally announced March 2017.

    Comments: 8 pages, 4 figures

    MSC Class: 68T05; 68T10 ACM Class: I.2.6; I.7.5

  45. arXiv:1612.00951  [pdf, other

    stat.CO stat.ME stat.ML

    On the Pitfalls of Nested Monte Carlo

    Authors: Tom Rainforth, Robert Cornish, Hongseok Yang, Frank Wood

    Abstract: There is an increasing interest in estimating expectations outside of the classical inference framework, such as for models expressed as probabilistic programs. Many of these contexts call for some form of nested inference to be applied. In this paper, we analyse the behaviour of nested Monte Carlo (NMC) schemes, for which classical convergence proofs are insufficient. We give conditions under whi… ▽ More

    Submitted 3 December, 2016; originally announced December 2016.

    Comments: Appearing in NIPS Workshop on Advances in Approximate Bayesian Inference 2016

  46. arXiv:1611.07492  [pdf, other

    stat.ML cs.CV cs.LG

    Inducing Interpretable Representations with Variational Autoencoders

    Authors: N. Siddharth, Brooks Paige, Alban Desmaison, Jan-Willem Van de Meent, Frank Wood, Noah D. Goodman, Pushmeet Kohli, Philip H. S. Torr

    Abstract: We develop a framework for incorporating structured graphical models in the \emph{encoders} of variational autoencoders (VAEs) that allows us to induce interpretable representations through approximate variational inference. This allows us to both perform reasoning (e.g. classification) under the structural constraints of a given graphical model, and use deep generative models to deal with messy,… ▽ More

    Submitted 22 November, 2016; originally announced November 2016.

    Comments: Presented at NIPS 2016 Workshop on Interpretable Machine Learning in Complex Systems

  47. arXiv:1611.06863  [pdf, other

    stat.ML cs.LG

    Probabilistic structure discovery in time series data

    Authors: David Janz, Brooks Paige, Tom Rainforth, Jan-Willem van de Meent, Frank Wood

    Abstract: Existing methods for structure discovery in time series data construct interpretable, compositional kernels for Gaussian process regression models. While the learned Gaussian process model provides posterior mean and variance estimates, typically the structure is learned via a greedy optimization procedure. This restricts the space of possible solutions and leads to over-confident uncertainty esti… ▽ More

    Submitted 21 November, 2016; originally announced November 2016.

  48. arXiv:1610.09900  [pdf, other

    cs.AI cs.LG stat.ML

    Inference Compilation and Universal Probabilistic Programming

    Authors: Tuan Anh Le, Atilim Gunes Baydin, Frank Wood

    Abstract: We introduce a method for using deep neural networks to amortize the cost of inference in models from the family induced by universal probabilistic programming languages, establishing a framework that combines the strengths of probabilistic programming and deep learning methods. We call what we do "compilation of inference" because our method transforms a denotational specification of an inference… ▽ More

    Submitted 2 March, 2017; v1 submitted 31 October, 2016; originally announced October 2016.

    Comments: 11 pages, 6 figures

    MSC Class: 68T37; 68T05 ACM Class: G.3; I.2.6

    Journal ref: In Proceedings of the 20th International Conference on Artificial Intelligence and Statistics (AISTATS 2017), 54:1338--1348. Proceedings of Machine Learning Research. Fort Lauderdale, FL, USA: PMLR

  49. arXiv:1602.06701  [pdf, other

    stat.ML

    Inference Networks for Sequential Monte Carlo in Graphical Models

    Authors: Brooks Paige, Frank Wood

    Abstract: We introduce a new approach for amortizing inference in directed graphical models by learning heuristic approximations to stochastic inverses, designed specifically for use as proposal distributions in sequential Monte Carlo methods. We describe a procedure for constructing and learning a structured neural network which represents an inverse factorization of the graphical model, resulting in a con… ▽ More

    Submitted 7 March, 2018; v1 submitted 22 February, 2016; originally announced February 2016.

    Comments: 10 pages. Updated from version at ICML 2016; includes code at http://github.com/tbrx/compiled-inference

    Journal ref: Paige, B., & Wood, F. (2016). Inference Networks for Sequential Monte Carlo in Graphical Models. In Proceedings of the 33rd International Conference on Machine Learning, JMLR W&CP 48: 3040-3049

  50. arXiv:1602.05128  [pdf, other

    stat.CO stat.ML

    Interacting Particle Markov Chain Monte Carlo

    Authors: Tom Rainforth, Christian A. Naesseth, Fredrik Lindsten, Brooks Paige, Jan-Willem van de Meent, Arnaud Doucet, Frank Wood

    Abstract: We introduce interacting particle Markov chain Monte Carlo (iPMCMC), a PMCMC method based on an interacting pool of standard and conditional sequential Monte Carlo samplers. Like related methods, iPMCMC is a Markov chain Monte Carlo sampler on an extended space. We present empirical results that show significant improvements in mixing rates relative to both non-interacting PMCMC samplers, and a si… ▽ More

    Submitted 12 April, 2017; v1 submitted 16 February, 2016; originally announced February 2016.

    Journal ref: JMLR W&CP 48 : 2616-2625, 2016