Skip to main content

Showing 1–20 of 20 results for author: Lawson, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2503.02859  [pdf, other

    stat.ML cs.LG stat.ME

    Unsupervised Attributed Dynamic Network Embedding with Stability Guarantees

    Authors: Emma Ceccherini, Ian Gallagher, Andrew Jones, Daniel Lawson

    Abstract: Stability for dynamic network embeddings ensures that nodes behaving the same at different times receive the same embedding, allowing comparison of nodes in the network across time. We present attributed unfolded adjacency spectral embedding (AUASE), a stable unsupervised representation learning framework for dynamic networks in which nodes are attributed with time-varying covariate information. T… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

    Comments: 27 pages, 5 figures

  2. arXiv:2410.20895  [pdf, other

    stat.CO stat.AP stat.ML

    Valid Bootstraps for Network Embeddings with Applications to Network Visualisation

    Authors: Emerald Dilworth, Ed Davis, Daniel J. Lawson

    Abstract: Quantifying uncertainty in networks is an important step in modelling relationships and interactions between entities. We consider the challenge of bootstrapping an inhomogeneous random graph when only a single observation of the network is made and the underlying data generating function is unknown. We address this problem by considering embeddings of the observed and bootstrapped network that ar… ▽ More

    Submitted 14 May, 2025; v1 submitted 28 October, 2024; originally announced October 2024.

  3. arXiv:2410.08226  [pdf, other

    physics.geo-ph cs.LG stat.AP stat.ML

    EarthquakeNPP: Benchmark Datasets for Earthquake Forecasting with Neural Point Processes

    Authors: Samuel Stockman, Daniel Lawson, Maximilian Werner

    Abstract: Classical point process models, such as the epidemic-type aftershock sequence (ETAS) model, have been widely used for forecasting the event times and locations of earthquakes for decades. Recent advances have led to Neural Point Processes (NPPs), which promise greater flexibility and improvements over classical models. However, the currently-used benchmark dataset for NPPs does not represent an up… ▽ More

    Submitted 27 September, 2024; originally announced October 2024.

  4. arXiv:2405.19230  [pdf, other

    stat.ML cs.LG

    Valid Conformal Prediction for Dynamic GNNs

    Authors: Ed Davis, Ian Gallagher, Daniel John Lawson, Patrick Rubin-Delanchy

    Abstract: Dynamic graphs provide a flexible data abstraction for modelling many sorts of real-world systems, such as transport, trade, and social networks. Graph neural networks (GNNs) are powerful tools allowing for different kinds of prediction and inference on these systems, but getting a handle on uncertainty, especially in dynamic settings, is a challenging problem. In this work we propose to use a dyn… ▽ More

    Submitted 26 March, 2025; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: 25 pages, 6 figures

    MSC Class: 62H30

  5. arXiv:2404.16590  [pdf, other

    stat.AP

    SB-ETAS: using simulation based inference for scalable, likelihood-free inference for the ETAS model of earthquake occurrences

    Authors: Samuel Stockman, Daniel J. Lawson, Maximilian J. Werner

    Abstract: Performing Bayesian inference for the Epidemic-Type Aftershock Sequence (ETAS) model of earthquakes typically requires MCMC sampling using the likelihood function or estimating the latent branching structure. These tasks have computational complexity $O(n^2)$ with the number of earthquakes and therefore do not scale well with new enhanced catalogs, which can now contain an order of $10^6$ events.… ▽ More

    Submitted 28 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

  6. arXiv:2311.09251  [pdf, other

    cs.SI cs.LG stat.ML

    A Simple and Powerful Framework for Stable Dynamic Network Embedding

    Authors: Ed Davis, Ian Gallagher, Daniel John Lawson, Patrick Rubin-Delanchy

    Abstract: In this paper, we address the problem of dynamic network embedding, that is, representing the nodes of a dynamic network as evolving vectors within a low-dimensional space. While the field of static network embedding is wide and established, the field of dynamic network embedding is comparatively in its infancy. We propose that a wide class of established static network embedding methods can be us… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: 33 pages, 9 figures

    MSC Class: 62H15 (Primary) 62H30; 62M10; 62G99 (Secondary)

  7. arXiv:2308.14864  [pdf, other

    cs.LG cs.AI stat.ML

    NAS-X: Neural Adaptive Smoothing via Twisting

    Authors: Dieterich Lawson, Michael Li, Scott Linderman

    Abstract: Sequential latent variable models (SLVMs) are essential tools in statistics and machine learning, with applications ranging from healthcare to neuroscience. As their flexibility increases, analytic inference and model learning can become challenging, necessitating approximate methods. Here we introduce neural adaptive smoothing via twisting (NAS-X), a method that extends reweighted wake-sleep (RWS… ▽ More

    Submitted 30 October, 2023; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: Updating for clarity and adding new baselines

  8. arXiv:2301.09948  [pdf, other

    physics.geo-ph stat.AP stat.ML

    Forecasting the 2016-2017 Central Apennines Earthquake Sequence with a Neural Point Process

    Authors: Samuel Stockman, Daniel J. Lawson, Maximilian J. Werner

    Abstract: Point processes have been dominant in modeling the evolution of seismicity for decades, with the Epidemic Type Aftershock Sequence (ETAS) model being most popular. Recent advances in machine learning have constructed highly flexible point process models using neural networks to improve upon existing parametric models. We investigate whether these flexible point process models can be applied to sho… ▽ More

    Submitted 2 October, 2023; v1 submitted 24 January, 2023; originally announced January 2023.

  9. arXiv:2206.05952  [pdf, other

    cs.LG cs.AI stat.ML

    SIXO: Smoothing Inference with Twisted Objectives

    Authors: Dieterich Lawson, Allan Raventós, Andrew Warrington, Scott Linderman

    Abstract: Sequential Monte Carlo (SMC) is an inference algorithm for state space models that approximates the posterior by sampling from a sequence of target distributions. The target distributions are often chosen to be the filtering distributions, but these ignore information from future observations, leading to practical and theoretical limitations in inference and model learning. We introduce SIXO, a me… ▽ More

    Submitted 20 June, 2022; v1 submitted 13 June, 2022; originally announced June 2022.

    Comments: v2: Updates for clarity throughout. Results unchanged

  10. arXiv:2110.04629  [pdf, other

    cs.LG cs.AI stat.ML

    The Neural Testbed: Evaluating Joint Predictions

    Authors: Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Botao Hao, Morteza Ibrahimi, Dieterich Lawson, Xiuyuan Lu, Brendan O'Donoghue, Benjamin Van Roy

    Abstract: Predictive distributions quantify uncertainties ignored by point estimates. This paper introduces The Neural Testbed: an open-source benchmark for controlled and principled evaluation of agents that generate such predictions. Crucially, the testbed assesses agents not only on the quality of their marginal predictions per input, but also on their joint predictions across many inputs. We evaluate a… ▽ More

    Submitted 1 November, 2022; v1 submitted 9 October, 2021; originally announced October 2021.

  11. arXiv:2006.00077  [pdf, other

    stat.ME stat.ML

    CLARITY -- Comparing heterogeneous data using dissimiLARITY

    Authors: Daniel J. Lawson, Vinesh Solanki, Igor Yanovich, Johannes Dellert, Damian Ruck, Phillip Endicott

    Abstract: Integrating datasets from different disciplines is hard because the data are often qualitatively different in meaning, scale, and reliability. When two datasets describe the same entities, many scientific questions can be phrased around whether the (dis)similarities between entities are conserved across such different data. Our method, CLARITY, quantifies consistency across datasets, identifies wh… ▽ More

    Submitted 2 December, 2021; v1 submitted 29 May, 2020; originally announced June 2020.

    Comments: R package available from https://github.com/danjlawson/CLARITY . 30 pages, 8 Figures

  12. arXiv:1910.14265  [pdf, other

    cs.LG stat.ML

    Energy-Inspired Models: Learning with Sampler-Induced Distributions

    Authors: Dieterich Lawson, George Tucker, Bo Dai, Rajesh Ranganath

    Abstract: Energy-based models (EBMs) are powerful probabilistic models, but suffer from intractable sampling and density evaluation due to the partition function. As a result, inference in EBMs relies on approximate sampling algorithms, leading to a mismatch between the model and inference. Motivated by this, we consider the sampler-induced distribution as the model of interest and maximize the likelihood o… ▽ More

    Submitted 9 January, 2020; v1 submitted 31 October, 2019; originally announced October 2019.

    Comments: Presented at the 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

  13. arXiv:1810.04152  [pdf, other

    cs.LG stat.ML

    Doubly Reparameterized Gradient Estimators for Monte Carlo Objectives

    Authors: George Tucker, Dieterich Lawson, Shixiang Gu, Chris J. Maddison

    Abstract: Deep latent variable models have become a popular model choice due to the scalable learning algorithms introduced by (Kingma & Welling, 2013; Rezende et al., 2014). These approaches maximize a variational lower bound on the intractable log likelihood of the observed data. Burda et al. (2015) introduced a multi-sample variational bound, IWAE, that is at least as tight as the standard variational lo… ▽ More

    Submitted 19 November, 2018; v1 submitted 9 October, 2018; originally announced October 2018.

  14. arXiv:1706.06428  [pdf, other

    cs.CL cs.LG stat.ML

    An online sequence-to-sequence model for noisy speech recognition

    Authors: Chung-Cheng Chiu, Dieterich Lawson, Yuping Luo, George Tucker, Kevin Swersky, Ilya Sutskever, Navdeep Jaitly

    Abstract: Generative models have long been the dominant approach for speech recognition. The success of these models however relies on the use of sophisticated recipes and complicated machinery that is not easily accessible to non-practitioners. Recent innovations in Deep Learning have given rise to an alternative - discriminative models called Sequence-to-Sequence models, that can almost match the accuracy… ▽ More

    Submitted 16 June, 2017; originally announced June 2017.

    Comments: arXiv admin note: substantial text overlap with arXiv:1608.01281

  15. arXiv:1705.09279  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Filtering Variational Objectives

    Authors: Chris J. Maddison, Dieterich Lawson, George Tucker, Nicolas Heess, Mohammad Norouzi, Andriy Mnih, Arnaud Doucet, Yee Whye Teh

    Abstract: When used as a surrogate objective for maximum likelihood estimation in latent variable models, the evidence lower bound (ELBO) produces state-of-the-art results. Inspired by this, we consider the extension of the ELBO to a family of lower bounds defined by a particle filter's estimator of the marginal likelihood, the filtering variational objectives (FIVOs). FIVOs take the same arguments as the E… ▽ More

    Submitted 12 November, 2017; v1 submitted 25 May, 2017; originally announced May 2017.

  16. arXiv:1705.05524  [pdf, other

    cs.AI cs.LG stat.ML

    Learning Hard Alignments with Variational Inference

    Authors: Dieterich Lawson, Chung-Cheng Chiu, George Tucker, Colin Raffel, Kevin Swersky, Navdeep Jaitly

    Abstract: There has recently been significant interest in hard attention models for tasks such as object recognition, visual captioning and speech recognition. Hard attention can offer benefits over soft attention such as decreased computational cost, but training hard attention models can be difficult because of the discrete latent variables they introduce. Previous work used REINFORCE and Q-learning to ap… ▽ More

    Submitted 1 November, 2017; v1 submitted 16 May, 2017; originally announced May 2017.

  17. arXiv:1703.07370  [pdf, other

    cs.LG stat.ML

    REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models

    Authors: George Tucker, Andriy Mnih, Chris J. Maddison, Dieterich Lawson, Jascha Sohl-Dickstein

    Abstract: Learning in models with discrete latent variables is challenging due to high variance gradient estimators. Generally, approaches have relied on control variates to reduce the variance of the REINFORCE estimator. Recent work (Jang et al. 2016, Maddison et al. 2016) has taken a different approach, introducing a continuous relaxation of discrete variables to produce low-variance, but biased, gradient… ▽ More

    Submitted 6 November, 2017; v1 submitted 21 March, 2017; originally announced March 2017.

    Comments: NIPS 2017

  18. arXiv:1702.07780  [pdf, other

    stat.ML cs.LG

    Changing Model Behavior at Test-Time Using Reinforcement Learning

    Authors: Augustus Odena, Dieterich Lawson, Christopher Olah

    Abstract: Machine learning models are often used at test-time subject to constraints and trade-offs not present at training-time. For example, a computer vision model operating on an embedded device may need to perform real-time inference, or a translation model operating on a cell phone may wish to bound its average compute time in order to be power-efficient. In this work we describe a mixture-of-experts… ▽ More

    Submitted 24 February, 2017; originally announced February 2017.

    Comments: Submitted to ICLR 2017 Workshop Track

  19. arXiv:1403.4054  [pdf, other

    stat.CO

    A general decision framework for structuring computation using Data Directional Scaling to process massive similarity matrices

    Authors: Daniel John Lawson, Niall M Adams

    Abstract: As datasets grow it becomes infeasible to process them completely with a desired model. For giant datasets, we frame the order in which computation is performed as a decision problem. The order is designed so that partial computations are of value and early stopping yields useful results. Our approach comprises two related tools: a decision framework to choose the order to perform computations, an… ▽ More

    Submitted 17 March, 2014; originally announced March 2014.

    Comments: 30 pages, 5 figures

  20. arXiv:1307.2921  [pdf, other

    physics.soc-ph stat.AP

    Apparent strength conceals instability in a model for the collapse of historical states

    Authors: Daniel John Lawson, Neeraj Oak

    Abstract: An explanation for the political processes leading to the sudden collapse of empires and states would be useful for understanding both historical and contemporary political events. We seek a general description of state collapse spanning eras and cultures, from small kingdoms to continental empires, drawing on a suitably diverse range of historical sources. Our aim is to provide an accessible verb… ▽ More

    Submitted 10 July, 2013; originally announced July 2013.