Skip to main content

Showing 1–11 of 11 results for author: Bloem-Reddy, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.13757  [pdf, other

    stat.ML cs.LG

    Identifying Metric Structures of Deep Latent Variable Models

    Authors: Stas Syrota, Yevgen Zainchkovskyy, Johnny Xi, Benjamin Bloem-Reddy, Søren Hauberg

    Abstract: Deep latent variable models learn condensed representations of data that, hopefully, reflect the inner workings of the studied phenomena. Unfortunately, these latent representations are not statistically identifiable, meaning they cannot be uniquely determined. Domain experts, therefore, need to tread carefully when interpreting these. Current solutions limit the lack of identifiability through ad… ▽ More

    Submitted 30 May, 2025; v1 submitted 19 February, 2025; originally announced February 2025.

    Journal ref: Forty-second International Conference on Machine Learning. ICML 2025. Vancouver, Canada. July 13-19, 2025

  2. arXiv:2502.05122  [pdf, ps, other

    stat.ML cs.LG stat.ME

    Distinguishing Cause from Effect with Causal Velocity Models

    Authors: Johnny Xi, Hugh Dance, Peter Orbanz, Benjamin Bloem-Reddy

    Abstract: Bivariate structural causal models (SCM) are often used to infer causal direction by examining their goodness-of-fit under restricted model classes. In this paper, we describe a parametrization of bivariate SCMs in terms of a causal velocity by viewing the cause variable as time in a dynamical system. The velocity implicitly defines counterfactual curves via the solution of initial value problems… ▽ More

    Submitted 9 June, 2025; v1 submitted 7 February, 2025; originally announced February 2025.

    Comments: ICML 2025

  3. arXiv:2308.15613  [pdf, other

    stat.CO cs.LG stat.ML

    Mixed Variational Flows for Discrete Variables

    Authors: Gian Carlo Diluvi, Benjamin Bloem-Reddy, Trevor Campbell

    Abstract: Variational flows allow practitioners to learn complex continuous distributions, but approximating discrete distributions remains a challenge. Current methodologies typically embed the discrete target in a continuous space - usually via continuous relaxation or dequantization - and then apply a continuous flow. These approaches involve a surrogate target that may not capture the original discrete… ▽ More

    Submitted 26 February, 2024; v1 submitted 29 August, 2023; originally announced August 2023.

  4. arXiv:2206.00801  [pdf, other

    stat.ML cs.LG

    Indeterminacy in Generative Models: Characterization and Strong Identifiability

    Authors: Quanhan Xi, Benjamin Bloem-Reddy

    Abstract: Most modern probabilistic generative models, such as the variational autoencoder (VAE), have certain indeterminacies that are unresolvable even with an infinite amount of data. Different tasks tolerate different indeterminacies, however recent applications have indicated the need for strongly identifiable models, in which an observation corresponds to a unique latent code. Progress has been made t… ▽ More

    Submitted 2 March, 2023; v1 submitted 1 June, 2022; originally announced June 2022.

    Comments: AISTATS 2023 (title corrected from v4)

  5. arXiv:2106.10800  [pdf, other

    cs.LG cs.IT stat.ML

    Lossy Compression for Lossless Prediction

    Authors: Yann Dubois, Benjamin Bloem-Reddy, Karen Ullrich, Chris J. Maddison

    Abstract: Most data is automatically collected and only ever "seen" by algorithms. Yet, data compressors preserve perceptual fidelity rather than just the information needed by algorithms performing downstream tasks. In this paper, we characterize the bit-rate required to ensure high performance on all predictive tasks that are invariant under a set of transformations, such as data augmentations. Based on o… ▽ More

    Submitted 28 January, 2022; v1 submitted 20 June, 2021; originally announced June 2021.

    Comments: Accepted at NeurIPS 2021

  6. arXiv:2010.03753  [pdf, other

    cs.LG stat.ML

    Uncertainty in Neural Processes

    Authors: Saeid Naderiparizi, Kenny Chiu, Benjamin Bloem-Reddy, Frank Wood

    Abstract: We explore the effects of architecture and training objective choice on amortized posterior predictive inference in probabilistic conditional generative models. We aim this work to be a counterpoint to a recent trend in the literature that stresses achieving good samples when the amount of conditioning data is large. We instead focus our attention on the case where the amount of conditioning data… ▽ More

    Submitted 8 October, 2020; originally announced October 2020.

  7. arXiv:2005.00178  [pdf, other

    cs.LG stat.ML

    On the Benefits of Invariance in Neural Networks

    Authors: Clare Lyle, Mark van der Wilk, Marta Kwiatkowska, Yarin Gal, Benjamin Bloem-Reddy

    Abstract: Many real world data analysis problems exhibit invariant structure, and models that take advantage of this structure have shown impressive empirical performance, particularly in deep learning. While the literature contains a variety of methods to incorporate invariance into models, theoretical understanding is poor and there is no way to assess when one method should be preferred over another. In… ▽ More

    Submitted 30 April, 2020; originally announced May 2020.

  8. arXiv:1901.06082  [pdf, other

    stat.ML cs.LG

    Probabilistic symmetries and invariant neural networks

    Authors: Benjamin Bloem-Reddy, Yee Whye Teh

    Abstract: Treating neural network inputs and outputs as random variables, we characterize the structure of neural networks that can be used to model data that are invariant or equivariant under the action of a compact group. Much recent research has been devoted to encoding invariance under symmetry transformations into neural network architectures, in an effort to improve the performance of deep neural net… ▽ More

    Submitted 16 September, 2020; v1 submitted 17 January, 2019; originally announced January 2019.

    Comments: Revised structure for clarity; fixed minor mistakes; incorporated reviewer feedback for publication

    Journal ref: Journal of Machine Learning Research, 21(90):1-61, 2020

  9. arXiv:1807.04932  [pdf, other

    stat.ML cs.LG

    Sequential sampling of Gaussian process latent variable models

    Authors: Martin Tegner, Benjamin Bloem-Reddy, Stephen Roberts

    Abstract: We consider the problem of inferring a latent function in a probabilistic model of data. When dependencies of the latent function are specified by a Gaussian process and the data likelihood is complex, efficient computation often involve Markov chain Monte Carlo sampling with limited applicability to large data sets. We extend some of these techniques to scale efficiently when the problem exhibits… ▽ More

    Submitted 20 July, 2018; v1 submitted 13 July, 2018; originally announced July 2018.

    Comments: In 2018 ICML Workshop on Tractable Probabilistic Models (TPM 2018)

  10. arXiv:1807.03113  [pdf, other

    stat.ML cs.LG cs.SI stat.ME

    Sampling and Inference for Beta Neutral-to-the-Left Models of Sparse Networks

    Authors: Benjamin Bloem-Reddy, Adam Foster, Emile Mathieu, Yee Whye Teh

    Abstract: Empirical evidence suggests that heavy-tailed degree distributions occurring in many real networks are well-approximated by power laws with exponents $η$ that may take values either less than and greater than two. Models based on various forms of exchangeability are able to capture power laws with $η< 2$, and admit tractable inference algorithms; we draw on previous results to show that $η> 2$ can… ▽ More

    Submitted 9 July, 2018; originally announced July 2018.

    Comments: Accepted for publication in the proceedings of Conference on Uncertainty in Artificial Intelligence (UAI) 2018

  11. arXiv:1710.02159  [pdf, other

    math.PR cs.SI math.ST physics.soc-ph

    Preferential Attachment and Vertex Arrival Times

    Authors: Benjamin Bloem-Reddy, Peter Orbanz

    Abstract: We study preferential attachment mechanisms in random graphs that are parameterized by (i) a constant bias affecting the degree-biased distribution on the vertex set and (ii) the distribution of times at which new vertices are created by the model. The class of random graphs so defined admits a representation theorem reminiscent of residual allocation, or "stick-breaking" schemes. We characterize… ▽ More

    Submitted 5 October, 2017; originally announced October 2017.

    Comments: 34 pages, 1 figure