Skip to main content

Showing 1–50 of 104 results for author: Wood, F

.
  1. arXiv:2505.12707  [pdf, other

    cs.LG cs.AI cs.MA

    PLAICraft: Large-Scale Time-Aligned Vision-Speech-Action Dataset for Embodied AI

    Authors: Yingchen He, Christian D. Weilbach, Martyna E. Wojciechowska, Yuxuan Zhang, Frank Wood

    Abstract: Advances in deep generative modelling have made it increasingly plausible to train human-level embodied agents. Yet progress has been limited by the absence of large-scale, real-time, multi-modal, and socially interactive datasets that reflect the sensory-motor complexity of natural environments. To address this, we present PLAICraft, a novel data collection platform and dataset capturing multipla… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: 9 pages, 8 figures

  2. arXiv:2503.11537  [pdf, other

    physics.comp-ph physics.chem-ph

    Basic stability tests of machine learning potentials for molecular simulations in computational drug discovery

    Authors: Kavindri Ranasinghe, Adam L. Baskerville, Geoffrey P. F. Wood, Gerhard Koenig

    Abstract: Neural network potentials trained on quantum-mechanical data can calculate molecular interactions with relatively high speed and accuracy. However, neural network potentials might exhibit instabilities, nonphysical behavior, or lack accuracy. To assess the reliability of neural network potentials, a series of tests is conducted during model training, in the gas phase, and in the condensed phase. T… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

    Comments: 30 pages, 5 figures

    MSC Class: 82D03

  3. arXiv:2502.20371  [pdf, other

    cs.LG

    Constrained Generative Modeling with Manually Bridged Diffusion Models

    Authors: Saeid Naderiparizi, Xiaoxuan Liang, Berend Zwartsenberg, Frank Wood

    Abstract: In this paper we describe a novel framework for diffusion-based generative modeling on constrained spaces. In particular, we introduce manual bridges, a framework that expands the kinds of constraints that can be practically used to form so-called diffusion bridges. We develop a mechanism for combining multiple such constraints so that the resulting multiply-constrained model remains a manual brid… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

    Comments: AAAI 2025

  4. arXiv:2502.09587  [pdf, other

    cs.LG cs.RO

    Rolling Ahead Diffusion for Traffic Scene Simulation

    Authors: Yunpeng Liu, Matthew Niedoba, William Harvey, Adam Scibior, Berend Zwartsenberg, Frank Wood

    Abstract: Realistic driving simulation requires that NPCs not only mimic natural driving behaviors but also react to the behavior of other simulated agents. Recent developments in diffusion-based scenario generation focus on creating diverse and realistic traffic scenarios by jointly modelling the motion of all the agents in the scene. However, these traffic scenarios do not react when the motion of agents… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

    Comments: Accepted to Workshop on Machine Learning for Autonomous Driving at AAAI 2025

  5. arXiv:2501.12408  [pdf, other

    cs.AI cs.LG cs.RO eess.SY stat.ML

    Control-ITRA: Controlling the Behavior of a Driving Model

    Authors: Vasileios Lioutas, Adam Scibior, Matthew Niedoba, Berend Zwartsenberg, Frank Wood

    Abstract: Simulating realistic driving behavior is crucial for developing and testing autonomous systems in complex traffic environments. Equally important is the ability to control the behavior of simulated agents to tailor scenarios to specific research needs and safety considerations. This paper extends the general-purpose multi-agent driving behavior model ITRA (Scibior et al., 2021), by introducing a m… ▽ More

    Submitted 16 January, 2025; originally announced January 2025.

    Comments: 16 pages, 2 figures

  6. arXiv:2411.19339  [pdf, other

    cs.LG cs.AI cs.CV

    Towards a Mechanistic Explanation of Diffusion Model Generalization

    Authors: Matthew Niedoba, Berend Zwartsenberg, Kevin Murphy, Frank Wood

    Abstract: We propose a simple, training-free mechanism which explains the generalization behaviour of diffusion models. By comparing pre-trained diffusion models to their theoretically optimal empirical counterparts, we identify a shared local inductive bias across a variety of network architectures. From this observation, we hypothesize that network denoisers generalize through localized denoising operatio… ▽ More

    Submitted 14 February, 2025; v1 submitted 28 November, 2024; originally announced November 2024.

    Comments: 24 pages, 23 figures

  7. arXiv:2410.16818  [pdf, other

    physics.comp-ph physics.bio-ph physics.chem-ph

    An evaluation of machine learning/molecular mechanics end-state corrections with mechanical embedding to calculate relative protein-ligand binding free energies

    Authors: Johannes Karwounopoulos, Mateusz Bieniek, Zhiyi Wu, Adam L. Baskerville, Gerhard Koenig, Benjamin P. Cossins, Geoffrey P. F. Wood

    Abstract: The development of machine-learning (ML) potentials offers significant accuracy improvements compared to molecular mechanics (MM) because of the inclusion of quantum-mechanical effects in molecular interactions. However, ML simulations are several times more computationally demanding than MM simulations, so there is a trade-off between speed and accuracy. One possible compromise are hybrid machine… ▽ More

    Submitted 22 October, 2024; originally announced October 2024.

    Comments: 47 pages, 13 figures, 10 tables

    MSC Class: 82D03

    Journal ref: J. Chem. Theory Comput. 2025, 21(2), 967-977

  8. arXiv:2407.05494  [pdf, other

    cs.LG cs.NE

    Prospective Messaging: Learning in Networks with Communication Delays

    Authors: Ryan Fayyazi, Christian Weilbach, Frank Wood

    Abstract: Inter-neuron communication delays are ubiquitous in physically realized neural networks such as biological neural circuits and neuromorphic hardware. These delays have significant and often disruptive consequences on network dynamics during training and inference. It is therefore essential that communication delays be accounted for, both in computational models of biological neural networks and in… ▽ More

    Submitted 8 July, 2024; v1 submitted 7 July, 2024; originally announced July 2024.

  9. arXiv:2406.04814  [pdf, other

    cs.CV cs.LG

    Lifelong Learning of Video Diffusion Models From a Single Video Stream

    Authors: Jason Yoo, Yingchen He, Saeid Naderiparizi, Dylan Green, Gido M. van de Ven, Geoff Pleiss, Frank Wood

    Abstract: This work demonstrates that training autoregressive video diffusion models from a single, continuous video stream is not only possible but remarkably can also be competitive with standard offline training approaches given the same number of gradient steps. Our demonstration further reveals that this main result can be achieved using experience replay that only retains a subset of the preceding vid… ▽ More

    Submitted 28 November, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

  10. arXiv:2405.04491  [pdf, other

    cs.AI cs.LG cs.MA cs.RO

    TorchDriveEnv: A Reinforcement Learning Benchmark for Autonomous Driving with Reactive, Realistic, and Diverse Non-Playable Characters

    Authors: Jonathan Wilder Lavington, Ke Zhang, Vasileios Lioutas, Matthew Niedoba, Yunpeng Liu, Dylan Green, Saeid Naderiparizi, Xiaoxuan Liang, Setareh Dabiri, Adam Ścibior, Berend Zwartsenberg, Frank Wood

    Abstract: The training, testing, and deployment, of autonomous vehicles requires realistic and efficient simulators. Moreover, because of the high variability between different problems presented in different autonomous systems, these simulators need to be easy to use, and easy to modify. To address these problems we introduce TorchDriveSim and its benchmark extension TorchDriveEnv. TorchDriveEnv is a light… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  11. arXiv:2405.00251  [pdf, other

    cs.CV cs.LG

    Semantically Consistent Video Inpainting with Conditional Diffusion Models

    Authors: Dylan Green, William Harvey, Saeid Naderiparizi, Matthew Niedoba, Yunpeng Liu, Xiaoxuan Liang, Jonathan Lavington, Ke Zhang, Vasileios Lioutas, Setareh Dabiri, Adam Scibior, Berend Zwartsenberg, Frank Wood

    Abstract: Current state-of-the-art methods for video inpainting typically rely on optical flow or attention-based approaches to inpaint masked regions by propagating visual information across frames. While such approaches have led to significant progress on standard benchmarks, they struggle with tasks that require the synthesis of novel content that is not present in other frames. In this paper, we reframe… ▽ More

    Submitted 8 October, 2024; v1 submitted 30 April, 2024; originally announced May 2024.

  12. arXiv:2404.09636  [pdf, other

    cs.LG cs.AI stat.ML

    All-in-one simulation-based inference

    Authors: Manuel Gloeckler, Michael Deistler, Christian Weilbach, Frank Wood, Jakob H. Macke

    Abstract: Amortized Bayesian inference trains neural networks to solve stochastic inference problems using model simulations, thereby making it possible to rapidly perform Bayesian inference for any newly observed data. However, current simulation-based amortized inference methods are simulation-hungry and inflexible: They require the specification of a fixed parametric prior, simulator, and inference tasks… ▽ More

    Submitted 15 July, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: To be published in the proceedings of the 41st International Conference on Machine Learning (ICML 2024), Vienna, Austria. PMLR 235, 2024

  13. arXiv:2403.00025  [pdf, ps, other

    cs.LG cs.AI

    On the Challenges and Opportunities in Generative AI

    Authors: Laura Manduchi, Kushagra Pandey, Clara Meister, Robert Bamler, Ryan Cotterell, Sina Däubener, Sophie Fellenz, Asja Fischer, Thomas Gärtner, Matthias Kirchler, Marius Kloft, Yingzhen Li, Christoph Lippert, Gerard de Melo, Eric Nalisnick, Björn Ommer, Rajesh Ranganath, Maja Rudolph, Karen Ullrich, Guy Van den Broeck, Julia E Vogt, Yixin Wang, Florian Wenzel, Frank Wood, Stephan Mandt , et al. (1 additional authors not shown)

    Abstract: The field of deep generative modeling has grown rapidly in the last few years. With the availability of massive amounts of training data coupled with advances in scalable unsupervised learning paradigms, recent large-scale generative models show tremendous promise in synthesizing high-resolution images and text, as well as structured data such as videos and molecules. However, we argue that curren… ▽ More

    Submitted 20 March, 2025; v1 submitted 28 February, 2024; originally announced March 2024.

  14. arXiv:2402.09542  [pdf, other

    cs.LG

    Layerwise Proximal Replay: A Proximal Point Method for Online Continual Learning

    Authors: Jason Yoo, Yunpeng Liu, Frank Wood, Geoff Pleiss

    Abstract: In online continual learning, a neural network incrementally learns from a non-i.i.d. data stream. Nearly all online continual learning methods employ experience replay to simultaneously prevent catastrophic forgetting and underfitting on past data. Our work demonstrates a limitation of this approach: neural networks trained with experience replay tend to have unstable optimization trajectories, i… ▽ More

    Submitted 18 July, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  15. arXiv:2402.08018  [pdf, other

    cs.LG cs.CV stat.ML

    Nearest Neighbour Score Estimators for Diffusion Generative Models

    Authors: Matthew Niedoba, Dylan Green, Saeid Naderiparizi, Vasileios Lioutas, Jonathan Wilder Lavington, Xiaoxuan Liang, Yunpeng Liu, Ke Zhang, Setareh Dabiri, Adam Ścibior, Berend Zwartsenberg, Frank Wood

    Abstract: Score function estimation is the cornerstone of both training and sampling from diffusion generative models. Despite this fact, the most commonly used estimators are either biased neural network approximations or high variance Monte Carlo estimators based on the conditional score. We introduce a novel nearest neighbour score function estimator which utilizes multiple samples from the training set… ▽ More

    Submitted 16 July, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: 25 pages, 9 figures. To be published in ICML 2024

  16. arXiv:2309.12508  [pdf, other

    cs.LG cs.RO

    A Diffusion-Model of Joint Interactive Navigation

    Authors: Matthew Niedoba, Jonathan Wilder Lavington, Yunpeng Liu, Vasileios Lioutas, Justice Sefas, Xiaoxuan Liang, Dylan Green, Setareh Dabiri, Berend Zwartsenberg, Adam Scibior, Frank Wood

    Abstract: Simulation of autonomous vehicle systems requires that simulated traffic participants exhibit diverse and realistic behaviors. The use of prerecorded real-world traffic scenarios in simulation ensures realism but the rarity of safety critical events makes large scale collection of driving scenarios expensive. In this paper, we present DJINN - a diffusion based method of generating traffic scenario… ▽ More

    Submitted 24 October, 2023; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: 10 pages, 4 figures. Accepted to NeurIPS 2023

  17. arXiv:2307.16463  [pdf, other

    cs.LG stat.ML

    Don't be so negative! Score-based Generative Modeling with Oracle-assisted Guidance

    Authors: Saeid Naderiparizi, Xiaoxuan Liang, Berend Zwartsenberg, Frank Wood

    Abstract: The maximum likelihood principle advocates parameter estimation via optimization of the data likelihood function. Models estimated in this way can exhibit a variety of generalization characteristics dictated by, e.g. architecture, parameterization, and optimization bias. This work addresses model learning in a setting where there further exists side-information in the form of an oracle that can la… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

  18. arXiv:2305.14621  [pdf, other

    cs.CV

    Realistically distributing object placements in synthetic training data improves the performance of vision-based object detection models

    Authors: Setareh Dabiri, Vasileios Lioutas, Berend Zwartsenberg, Yunpeng Liu, Matthew Niedoba, Xiaoxuan Liang, Dylan Green, Justice Sefas, Jonathan Wilder Lavington, Frank Wood, Adam Scibior

    Abstract: When training object detection models on synthetic data, it is important to make the distribution of synthetic data as close as possible to the distribution of real data. We investigate specifically the impact of object placement distribution, keeping all other aspects of synthetic data fixed. Our experiment, training a 3D vehicle detection model in CARLA and testing on KITTI, demonstrates a subst… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  19. arXiv:2305.11856  [pdf, other

    cs.CV cs.RO

    Video Killed the HD-Map: Predicting Multi-Agent Behavior Directly From Aerial Images

    Authors: Yunpeng Liu, Vasileios Lioutas, Jonathan Wilder Lavington, Matthew Niedoba, Justice Sefas, Setareh Dabiri, Dylan Green, Xiaoxuan Liang, Berend Zwartsenberg, Adam Ścibior, Frank Wood

    Abstract: The development of algorithms that learn multi-agent behavioral models using human demonstrations has led to increasingly realistic simulations in the field of autonomous driving. In general, such models learn to jointly predict trajectories for all controlled agents by exploiting road context information such as drivable lanes obtained from manually annotated high-definition (HD) maps. Recent stu… ▽ More

    Submitted 19 September, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    ACM Class: I.2.9; I.4.9

    Journal ref: 2023 IEEE 26th International Conference on Intelligent Transportation Systems (ITSC)

  20. arXiv:2303.16187  [pdf, other

    cs.CV cs.LG

    Visual Chain-of-Thought Diffusion Models

    Authors: William Harvey, Frank Wood

    Abstract: Recent progress with conditional image diffusion models has been stunning, and this holds true whether we are speaking about models conditioned on a text description, a scene layout, or a sketch. Unconditional image diffusion models are also improving but lag behind, as do diffusion models which are conditioned on lower-dimensional features like class labels. We propose to close the gap between co… ▽ More

    Submitted 20 June, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

  21. arXiv:2210.12236  [pdf, other

    stat.ML cs.LG

    Uncertain Evidence in Probabilistic Models and Stochastic Simulators

    Authors: Andreas Munk, Alexander Mead, Frank Wood

    Abstract: We consider the problem of performing Bayesian inference in probabilistic models where observations are accompanied by uncertainty, referred to as "uncertain evidence." We explore how to interpret uncertain evidence, and by extension the importance of proper interpretation as it pertains to inference about latent variables. We consider a recently-proposed method "distributional evidence" as well a… ▽ More

    Submitted 26 January, 2023; v1 submitted 21 October, 2022; originally announced October 2022.

  22. arXiv:2210.11633  [pdf, other

    cs.LG cs.NE cs.PL

    Graphically Structured Diffusion Models

    Authors: Christian Weilbach, William Harvey, Frank Wood

    Abstract: We introduce a framework for automatically defining and learning deep generative models with problem-specific structure. We tackle problem domains that are more traditionally solved by algorithms such as sorting, constraint satisfaction for Sudoku, and matrix factorization. Concretely, we train diffusion models with an architecture tailored to the problem specification. This problem specification… ▽ More

    Submitted 16 June, 2023; v1 submitted 20 October, 2022; originally announced October 2022.

    ACM Class: G.3

  23. arXiv:2208.04987  [pdf, other

    cs.AI cs.HC cs.RO eess.SY

    Vehicle Type Specific Waypoint Generation

    Authors: Yunpeng Liu, Jonathan Wilder Lavington, Adam Scibior, Frank Wood

    Abstract: We develop a generic mechanism for generating vehicle-type specific sequences of waypoints from a probabilistic foundation model of driving behavior. Many foundation behavior models are trained on data that does not include vehicle information, which limits their utility in downstream applications such as planning. Our novel methodology conditionally specializes such a behavior predictive model to… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

    Journal ref: 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  24. arXiv:2206.09021  [pdf, other

    stat.ML cs.LG

    Conditional Permutation Invariant Flows

    Authors: Berend Zwartsenberg, Adam Ścibior, Matthew Niedoba, Vasileios Lioutas, Yunpeng Liu, Justice Sefas, Setareh Dabiri, Jonathan Wilder Lavington, Trevor Campbell, Frank Wood

    Abstract: We present a novel, conditional generative probabilistic model of set-valued data with a tractable log density. This model is a continuous normalizing flow governed by permutation equivariant dynamics. These dynamics are driven by a learnable per-set-element term and pairwise interactions, both parametrized by deep neural networks. We illustrate the utility of this model via applications including… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: 20 pages, 10 figures

    ACM Class: I.2.0

  25. arXiv:2205.15460  [pdf, other

    stat.ML cs.LG

    Critic Sequential Monte Carlo

    Authors: Vasileios Lioutas, Jonathan Wilder Lavington, Justice Sefas, Matthew Niedoba, Yunpeng Liu, Berend Zwartsenberg, Setareh Dabiri, Frank Wood, Adam Scibior

    Abstract: We introduce CriticSMC, a new algorithm for planning as inference built from a composition of sequential Monte Carlo with learned Soft-Q function heuristic factors. These heuristic factors, obtained from parametric approximations of the marginal likelihood ahead, more effectively guide SMC towards the desired target distribution, which is particularly helpful for planning in environments with hard… ▽ More

    Submitted 21 January, 2023; v1 submitted 30 May, 2022; originally announced May 2022.

    Comments: ICLR 2023

  26. arXiv:2205.11495  [pdf, other

    cs.CV cs.LG

    Flexible Diffusion Modeling of Long Videos

    Authors: William Harvey, Saeid Naderiparizi, Vaden Masrani, Christian Weilbach, Frank Wood

    Abstract: We present a framework for video modeling based on denoising diffusion probabilistic models that produces long-duration video completions in a variety of realistic environments. We introduce a generative model that can at test-time sample any arbitrary subset of video frames conditioned on any other subset and present an architecture adapted for this purpose. Doing so allows us to efficiently comp… ▽ More

    Submitted 15 December, 2022; v1 submitted 23 May, 2022; originally announced May 2022.

  27. arXiv:2205.09930  [pdf, other

    cs.LG cs.AI

    BayesPCN: A Continually Learnable Predictive Coding Associative Memory

    Authors: Jason Yoo, Frank Wood

    Abstract: Associative memory plays an important role in human intelligence and its mechanisms have been linked to attention in machine learning. While the machine learning community's interest in associative memories has recently been rekindled, most work has focused on memory recall ($read$) over memory learning ($write$). In this paper, we present BayesPCN, a hierarchical associative memory capable of per… ▽ More

    Submitted 11 November, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

  28. arXiv:2202.08587  [pdf, other

    cs.LG stat.ML

    Gradients without Backpropagation

    Authors: Atılım Güneş Baydin, Barak A. Pearlmutter, Don Syme, Frank Wood, Philip Torr

    Abstract: Using backpropagation to compute gradients of objective functions for optimization has remained a mainstay of machine learning. Backpropagation, or reverse-mode differentiation, is a special case within the general family of automatic differentiation algorithms that also includes the forward mode. We present a method to compute gradients based solely on the directional derivative that one can comp… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

    Comments: 10 pages, 6 figures

    MSC Class: 68T07 ACM Class: I.2.6; I.2.5

  29. arXiv:2202.02693  [pdf, other

    cs.LG cs.AI

    Exploration with Multi-Sample Target Values for Distributional Reinforcement Learning

    Authors: Michael Teng, Michiel van de Panne, Frank Wood

    Abstract: Distributional reinforcement learning (RL) aims to learn a value-network that predicts the full distribution of the returns for a given state, often modeled via a quantile-based critic. This approach has been successfully integrated into common RL methods for continuous control, giving rise to algorithms such as Distributional Soft Actor-Critic (DSAC). In this paper, we introduce multi-sample targ… ▽ More

    Submitted 5 February, 2022; originally announced February 2022.

    Comments: Submitted to ICML 2022

  30. arXiv:2201.05151  [pdf, other

    cs.CV

    Beyond Simple Meta-Learning: Multi-Purpose Models for Multi-Domain, Active and Continual Few-Shot Learning

    Authors: Peyman Bateni, Jarred Barber, Raghav Goyal, Vaden Masrani, Jan-Willem van de Meent, Leonid Sigal, Frank Wood

    Abstract: Modern deep learning requires large-scale extensively labelled datasets for training. Few-shot learning aims to alleviate this issue by learning effectively from few labelled examples. In previously proposed few-shot visual classifiers, it is assumed that the feature manifold, where classifier decisions are made, has uncorrelated feature dimensions and uniform feature variance. In this work, we fo… ▽ More

    Submitted 12 December, 2022; v1 submitted 13 January, 2022; originally announced January 2022.

  31. arXiv:2107.00745  [pdf, other

    cs.LG cs.AI stat.ML

    q-Paths: Generalizing the Geometric Annealing Path using Power Means

    Authors: Vaden Masrani, Rob Brekelmans, Thang Bui, Frank Nielsen, Aram Galstyan, Greg Ver Steeg, Frank Wood

    Abstract: Many common machine learning methods involve the geometric annealing path, a sequence of intermediate densities between two distributions of interest constructed using the geometric average. While alternatives such as the moment-averaging path have demonstrated performance gains in some settings, their practical applicability remains limited by exponential family endpoint assumptions and a lack of… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: arXiv admin note: text overlap with arXiv:2012.07823

  32. arXiv:2106.10314  [pdf, other

    stat.ML cs.LG

    Differentiable Particle Filtering without Modifying the Forward Pass

    Authors: Adam Ścibior, Frank Wood

    Abstract: Particle filters are not compatible with automatic differentiation due to the presence of discrete resampling steps. While known estimators for the score function, based on Fisher's identity, can be computed using particle filters, up to this point they required manual implementation. In this paper we show that such estimators can be computed using automatic differentiation, after introducing a si… ▽ More

    Submitted 19 October, 2021; v1 submitted 18 June, 2021; originally announced June 2021.

    Comments: 24 pages, 3 figures

  33. arXiv:2104.11212  [pdf, other

    stat.ML cs.LG

    Imagining The Road Ahead: Multi-Agent Trajectory Prediction via Differentiable Simulation

    Authors: Adam Scibior, Vasileios Lioutas, Daniele Reda, Peyman Bateni, Frank Wood

    Abstract: We develop a deep generative model built on a fully differentiable simulator for multi-agent trajectory prediction. Agents are modeled with conditional recurrent variational neural networks (CVRNNs), which take as input an ego-centric birdview image representing the current state of the world and output an action, consisting of steering and acceleration, which is used to derive the subsequent agen… ▽ More

    Submitted 22 April, 2021; originally announced April 2021.

    Comments: 10 pages, 8 figures

  34. arXiv:2102.12037  [pdf, other

    cs.CV cs.AI

    Conditional Image Generation by Conditioning Variational Auto-Encoders

    Authors: William Harvey, Saeid Naderiparizi, Frank Wood

    Abstract: We present a conditional variational auto-encoder (VAE) which, to avoid the substantial cost of training from scratch, uses an architecture and training objective capable of leveraging a foundation model in the form of a pretrained unconditional VAE. To train the conditional VAE, we only need to train an artifact to perform amortized inference over the unconditional VAE's latent variables given a… ▽ More

    Submitted 28 May, 2022; v1 submitted 23 February, 2021; originally announced February 2021.

    Comments: 37 pages, 20 figures

  35. arXiv:2012.15566  [pdf, other

    cs.LG stat.ML

    Robust Asymmetric Learning in POMDPs

    Authors: Andrew Warrington, J. Wilder Lavington, Adam Ścibior, Mark Schmidt, Frank Wood

    Abstract: Policies for partially observed Markov decision processes can be efficiently learned by imitating policies for the corresponding fully observed Markov decision processes. Unfortunately, existing approaches for this kind of imitation learning have a serious flaw: the expert does not know what the trainee cannot see, and so may encourage actions that are sub-optimal, even unsafe, under partial infor… ▽ More

    Submitted 1 July, 2021; v1 submitted 31 December, 2020; originally announced December 2020.

    Comments: ICML 2021

  36. arXiv:2012.07823  [pdf, other

    cs.LG

    Annealed Importance Sampling with q-Paths

    Authors: Rob Brekelmans, Vaden Masrani, Thang Bui, Frank Wood, Aram Galstyan, Greg Ver Steeg, Frank Nielsen

    Abstract: Annealed importance sampling (AIS) is the gold standard for estimating partition functions or marginal likelihoods, corresponding to importance sampling over a path of distributions between a tractable base and an unnormalized target. While AIS yields an unbiased estimator for any path, existing literature has been primarily limited to the geometric mixture or moment-averaged paths associated with… ▽ More

    Submitted 14 December, 2020; originally announced December 2020.

    Comments: NeurIPS Workshop on Deep Learning through Information Geometry (Best Paper Award)

    Journal ref: Published at UAI 2021 https://arxiv.boxedpaper.com/abs/2107.00745

  37. arXiv:2012.05390  [pdf, other

    cs.LG cs.AI

    Ensemble Squared: A Meta AutoML System

    Authors: Jason Yoo, Tony Joseph, Dylan Yung, S. Ali Nasseri, Frank Wood

    Abstract: There are currently many barriers that prevent non-experts from exploiting machine learning solutions ranging from the lack of intuition on statistical learning techniques to the trickiness of hyperparameter tuning. Such barriers have led to an explosion of interest in automated machine learning (AutoML), whereby an off-the-shelf system can take care of many of the steps for end-users without the… ▽ More

    Submitted 19 June, 2021; v1 submitted 9 December, 2020; originally announced December 2020.

  38. arXiv:2010.15750  [pdf, other

    cs.LG

    Gaussian Process Bandit Optimization of the Thermodynamic Variational Objective

    Authors: Vu Nguyen, Vaden Masrani, Rob Brekelmans, Michael A. Osborne, Frank Wood

    Abstract: Achieving the full promise of the Thermodynamic Variational Objective (TVO), a recently proposed variational lower bound on the log evidence involving a one-dimensional Riemann integral approximation, requires choosing a "schedule" of sorted discretization points. This paper introduces a bespoke Gaussian process bandit optimization method for automatically choosing these points. Our approach not o… ▽ More

    Submitted 20 November, 2020; v1 submitted 29 October, 2020; originally announced October 2020.

    Comments: NeurIPS 2020

  39. arXiv:2010.03753  [pdf, other

    cs.LG stat.ML

    Uncertainty in Neural Processes

    Authors: Saeid Naderiparizi, Kenny Chiu, Benjamin Bloem-Reddy, Frank Wood

    Abstract: We explore the effects of architecture and training objective choice on amortized posterior predictive inference in probabilistic conditional generative models. We aim this work to be a counterpoint to a recent trend in the literature that stresses achieving good samples when the amount of conditioning data is large. We instead focus our attention on the case where the amount of conditioning data… ▽ More

    Submitted 8 October, 2020; originally announced October 2020.

  40. arXiv:2010.01274  [pdf, other

    cs.LG stat.ML

    Assisting the Adversary to Improve GAN Training

    Authors: Andreas Munk, William Harvey, Frank Wood

    Abstract: Some of the most popular methods for improving the stability and performance of GANs involve constraining or regularizing the discriminator. In this paper we consider a largely overlooked regularization technique which we refer to as the Adversary's Assistant (AdvAs). We motivate this using a different perspective to that of prior work. Specifically, we consider a common mismatch between theoretic… ▽ More

    Submitted 8 December, 2020; v1 submitted 3 October, 2020; originally announced October 2020.

  41. arXiv:2007.00642  [pdf, other

    cs.LG stat.ML

    All in the Exponential Family: Bregman Duality in Thermodynamic Variational Inference

    Authors: Rob Brekelmans, Vaden Masrani, Frank Wood, Greg Ver Steeg, Aram Galstyan

    Abstract: The recently proposed Thermodynamic Variational Objective (TVO) leverages thermodynamic integration to provide a family of variational inference objectives, which both tighten and generalize the ubiquitous Evidence Lower Bound (ELBO). However, the tightness of TVO bounds was not previously known, an expensive grid search was used to choose a "schedule" of intermediate distributions, and model lear… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

    Comments: ICML 2020

  42. arXiv:2007.00155  [pdf, other

    cs.LG stat.ML

    Semi-supervised Sequential Generative Models

    Authors: Michael Teng, Tuan Anh Le, Adam Scibior, Frank Wood

    Abstract: We introduce a novel objective for training deep generative time-series models with discrete latent variables for which supervision is only sparsely available. This instance of semi-supervised learning is challenging for existing methods, because the exponential number of possible discrete latent configurations results in high variance gradient estimators. We first overcome this problem by extendi… ▽ More

    Submitted 30 June, 2020; originally announced July 2020.

    Comments: Accepted to Uncertainty in Artificial Intelligence 2020

  43. arXiv:2006.12245  [pdf, other

    cs.CV cs.LG stat.ML

    Enhancing Few-Shot Image Classification with Unlabelled Examples

    Authors: Peyman Bateni, Jarred Barber, Jan-Willem van de Meent, Frank Wood

    Abstract: We develop a transductive meta-learning method that uses unlabelled instances to improve few-shot image classification performance. Our approach combines a regularized Mahalanobis-distance-based soft k-means clustering procedure with a modified state of the art neural adaptive feature extractor to achieve improved test-time classification accuracy using unlabelled data. We evaluate our method on t… ▽ More

    Submitted 21 October, 2021; v1 submitted 17 June, 2020; originally announced June 2020.

  44. arXiv:2003.13221  [pdf, other

    q-bio.PE cs.LG stat.ML

    Planning as Inference in Epidemiological Models

    Authors: Frank Wood, Andrew Warrington, Saeid Naderiparizi, Christian Weilbach, Vaden Masrani, William Harvey, Adam Scibior, Boyan Beronov, John Grefenstette, Duncan Campbell, Ali Nasseri

    Abstract: In this work we demonstrate how to automate parts of the infectious disease-control policy-making process via performing inference in existing epidemiological models. The kind of inference tasks undertaken include computing the posterior distribution over controllable, via direct policy-making choices, simulation model parameters that give rise to acceptable disease progression outcomes. Among oth… ▽ More

    Submitted 15 September, 2021; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: Revisions

    Journal ref: Front Artif Intell. 2021; 4: 550603

  45. arXiv:2003.12908  [pdf, other

    cs.LG stat.ML

    Coping With Simulators That Don't Always Return

    Authors: Andrew Warrington, Saeid Naderiparizi, Frank Wood

    Abstract: Deterministic models are approximations of reality that are easy to interpret and often easier to build than stochastic alternatives. Unfortunately, as nature is capricious, observational data can never be fully explained by deterministic models in practice. Observation and process noise need to be added to adapt deterministic models to behave stochastically, such that they are capable of explaini… ▽ More

    Submitted 28 March, 2020; originally announced March 2020.

    Comments: AISTATS 2020 camera ready, version 1.0

  46. arXiv:1912.03432  [pdf, other

    cs.CV

    Improved Few-Shot Visual Classification

    Authors: Peyman Bateni, Raghav Goyal, Vaden Masrani, Frank Wood, Leonid Sigal

    Abstract: Few-shot learning is a fundamental task in computer vision that carries the promise of alleviating the need for exhaustively labeled data. Most few-shot learning approaches to date have focused on progressively more complex neural feature extractors and classifier adaptation strategies, as well as the refinement of the task definition itself. In this paper, we explore the hypothesis that a simple… ▽ More

    Submitted 11 June, 2020; v1 submitted 6 December, 2019; originally announced December 2019.

  47. arXiv:1910.11961  [pdf, other

    cs.LG stat.ML

    Attention for Inference Compilation

    Authors: William Harvey, Andreas Munk, Atılım Güneş Baydin, Alexander Bergholm, Frank Wood

    Abstract: We present a new approach to automatic amortized inference in universal probabilistic programs which improves performance compared to current methods. Our approach is a variation of inference compilation (IC) which leverages deep neural networks to approximate a posterior distribution over latent variables in a probabilistic program. A challenge with existing IC network architectures is that they… ▽ More

    Submitted 25 October, 2019; originally announced October 2019.

  48. arXiv:1910.11950  [pdf, other

    cs.LG stat.ML

    Probabilistic Surrogate Networks for Simulators with Unbounded Randomness

    Authors: Andreas Munk, Berend Zwartsenberg, Adam Ścibior, Atılım Güneş Baydin, Andrew Stewart, Goran Fernlund, Anoush Poursartip, Frank Wood

    Abstract: We present a framework for automatically structuring and training fast, approximate, deep neural surrogates of stochastic simulators. Unlike traditional approaches to surrogate modeling, our surrogates retain the interpretable structure and control flow of the reference simulator. Our surrogates target stochastic simulators where the number of random variables itself can be stochastic and potentia… ▽ More

    Submitted 20 January, 2023; v1 submitted 25 October, 2019; originally announced October 2019.

  49. arXiv:1910.09056  [pdf, other

    cs.LG cs.AI stat.ML

    Amortized Rejection Sampling in Universal Probabilistic Programming

    Authors: Saeid Naderiparizi, Adam Ścibior, Andreas Munk, Mehrdad Ghadiri, Atılım Güneş Baydin, Bradley Gram-Hansen, Christian Schroeder de Witt, Robert Zinkov, Philip H. S. Torr, Tom Rainforth, Yee Whye Teh, Frank Wood

    Abstract: Naive approaches to amortized inference in probabilistic programs with unbounded loops can produce estimators with infinite variance. This is particularly true of importance sampling inference in programs that explicitly include rejection sampling as part of the user-programmed generative procedure. In this paper we develop a new and efficient amortized importance sampling estimator. We prove fini… ▽ More

    Submitted 28 March, 2022; v1 submitted 20 October, 2019; originally announced October 2019.

    Comments: AISTATS 2022 camera ready

  50. arXiv:1909.09721   

    cs.RO cs.LG cs.MA

    Safer End-to-End Autonomous Driving via Conditional Imitation Learning and Command Augmentation

    Authors: Renhao Wang, Adam Scibior, Frank Wood

    Abstract: Imitation learning is a promising approach to end-to-end training of autonomous vehicle controllers. Typically the driving process with such approaches is entirely automatic and black-box, although in practice it is desirable to control the vehicle through high-level commands, such as telling it which way to go at an intersection. In existing work this has been accomplished by the application of a… ▽ More

    Submitted 20 November, 2020; v1 submitted 20 September, 2019; originally announced September 2019.

    Comments: Architecture fails to sufficiently disentangle representations and obey varied commands