Skip to main content

Showing 1–27 of 27 results for author: Hoogeboom, E

.
  1. arXiv:2411.02068  [pdf, other

    cs.LG cs.CV

    Model Integrity when Unlearning with T2I Diffusion Models

    Authors: Andrea Schioppa, Emiel Hoogeboom, Jonathan Heek

    Abstract: The rapid advancement of text-to-image Diffusion Models has led to their widespread public accessibility. However these models, trained on large internet datasets, can sometimes generate undesirable outputs. To mitigate this, approximate Machine Unlearning algorithms have been proposed to modify model weights to reduce the generation of specific types of images, characterized by samples from a ``f… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

  2. arXiv:2410.19324  [pdf, other

    cs.CV cs.LG stat.ML

    Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusion

    Authors: Emiel Hoogeboom, Thomas Mensink, Jonathan Heek, Kay Lamerigts, Ruiqi Gao, Tim Salimans

    Abstract: Latent diffusion models have become the popular choice for scaling up diffusion models for high resolution image synthesis. Compared to pixel-space models that are trained end-to-end, latent models are perceived to be more efficient and to produce higher image quality at high resolution. Here we challenge these notions, and show that pixel-space models can be very competitive to latent models both… ▽ More

    Submitted 22 March, 2025; v1 submitted 25 October, 2024; originally announced October 2024.

    Comments: Accepted to CVPR 2025

  3. arXiv:2408.07009  [pdf, other

    cs.CV

    Imagen 3

    Authors: Imagen-Team-Google, :, Jason Baldridge, Jakob Bauer, Mukul Bhutani, Nicole Brichtova, Andrew Bunner, Lluis Castrejon, Kelvin Chan, Yichang Chen, Sander Dieleman, Yuqing Du, Zach Eaton-Rosen, Hongliang Fei, Nando de Freitas, Yilin Gao, Evgeny Gladchenko, Sergio Gómez Colmenarejo, Mandy Guo, Alex Haig, Will Hawkins, Hexiang Hu, Huilian Huang, Tobenna Peter Igwe, Christos Kaplanis , et al. (237 additional authors not shown)

    Abstract: We introduce Imagen 3, a latent diffusion model that generates high quality images from text prompts. We describe our quality and responsibility evaluations. Imagen 3 is preferred over other state-of-the-art (SOTA) models at the time of evaluation. In addition, we discuss issues around safety and representation, as well as methods we used to minimize the potential harm of our models.

    Submitted 21 December, 2024; v1 submitted 13 August, 2024; originally announced August 2024.

  4. arXiv:2406.04103  [pdf, other

    cs.LG cs.AI cs.CV cs.NE

    Multistep Distillation of Diffusion Models via Moment Matching

    Authors: Tim Salimans, Thomas Mensink, Jonathan Heek, Emiel Hoogeboom

    Abstract: We present a new method for making diffusion models faster to sample. The method distills many-step diffusion models into few-step models by matching conditional expectations of the clean data given noisy data along the sampling trajectory. Our approach extends recently proposed one-step methods to the multi-step case, and provides a new perspective by interpreting these approaches in terms of mom… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  5. arXiv:2405.14857  [pdf, other

    cs.CV cs.AI cs.LG

    Conditional Diffusion on Web-Scale Image Pairs leads to Diverse Image Variations

    Authors: Manoj Kumar, Neil Houlsby, Emiel Hoogeboom

    Abstract: Generating image variations, where a model produces variations of an input image while preserving the semantic context has gained increasing attention. Current image variation techniques involve adapting a text-to-image model to reconstruct an input image conditioned on the same image. We first demonstrate that a diffusion model trained to reconstruct an input image from frozen embeddings, can rec… ▽ More

    Submitted 2 October, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

  6. arXiv:2403.06807  [pdf, other

    cs.LG cs.CV stat.ML

    Multistep Consistency Models

    Authors: Jonathan Heek, Emiel Hoogeboom, Tim Salimans

    Abstract: Diffusion models are relatively easy to train but require many steps to generate samples. Consistency models are far more difficult to train, but generate samples in a single step. In this paper we propose Multistep Consistency Models: A unification between Consistency Models (Song et al., 2023) and TRACT (Berthelot et al., 2023) that can interpolate between a consistency model and a diffusion m… ▽ More

    Submitted 19 November, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  7. arXiv:2402.09470  [pdf, other

    cs.LG stat.ML

    Rolling Diffusion Models

    Authors: David Ruhe, Jonathan Heek, Tim Salimans, Emiel Hoogeboom

    Abstract: Diffusion models have recently been increasingly applied to temporal data such as video, fluid mechanics simulations, or climate data. These methods generally treat subsequent frames equally regarding the amount of noise in the diffusion process. This paper explores Rolling Diffusion: a new approach that uses a sliding window denoising process. It ensures that the diffusion process progressively c… ▽ More

    Submitted 9 September, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  8. arXiv:2306.08068  [pdf, other

    cs.CV cs.AI cs.LG

    DORSal: Diffusion for Object-centric Representations of Scenes et al

    Authors: Allan Jabri, Sjoerd van Steenkiste, Emiel Hoogeboom, Mehdi S. M. Sajjadi, Thomas Kipf

    Abstract: Recent progress in 3D scene understanding enables scalable learning of representations across large datasets of diverse scenes. As a consequence, generalization to unseen scenes and objects, rendering novel views from just a single or a handful of input images, and controllable scene generation that supports editing, is now possible. However, training jointly on a large number of scenes typically… ▽ More

    Submitted 2 May, 2024; v1 submitted 13 June, 2023; originally announced June 2023.

    Comments: Accepted to ICLR 2024. Project page: https://www.sjoerdvansteenkiste.com/dorsal

  9. arXiv:2305.18231  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    High-Fidelity Image Compression with Score-based Generative Models

    Authors: Emiel Hoogeboom, Eirikur Agustsson, Fabian Mentzer, Luca Versari, George Toderici, Lucas Theis

    Abstract: Despite the tremendous success of diffusion generative models in text-to-image generation, replicating this success in the domain of image compression has proven difficult. In this paper, we demonstrate that diffusion can significantly improve perceptual quality at a given bit-rate, outperforming state-of-the-art approaches PO-ELIC and HiFiC as measured by FID score. This is achieved using a simpl… ▽ More

    Submitted 7 March, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

  10. arXiv:2301.11093  [pdf, other

    cs.CV cs.LG stat.ML

    Simple diffusion: End-to-end diffusion for high resolution images

    Authors: Emiel Hoogeboom, Jonathan Heek, Tim Salimans

    Abstract: Currently, applying diffusion models in pixel space of high resolution images is difficult. Instead, existing approaches focus on diffusion in lower dimensional spaces (latent diffusion), or have multiple super-resolution levels of generation referred to as cascades. The downside is that these approaches add additional complexity to the diffusion framework. This paper aims to improve denoising d… ▽ More

    Submitted 12 December, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

  11. arXiv:2209.05557  [pdf, other

    cs.LG cs.CV stat.ML

    Blurring Diffusion Models

    Authors: Emiel Hoogeboom, Tim Salimans

    Abstract: Recently, Rissanen et al., (2022) have presented a new type of diffusion process for generative modeling based on heat dissipation, or blurring, as an alternative to isotropic Gaussian diffusion. Here, we show that blurring can equivalently be defined through a Gaussian diffusion process with non-isotropic noise. In making this connection, we bridge the gap between inverse heat dissipation and den… ▽ More

    Submitted 1 May, 2024; v1 submitted 12 September, 2022; originally announced September 2022.

  12. arXiv:2203.17003  [pdf, other

    cs.LG q-bio.QM stat.ML

    Equivariant Diffusion for Molecule Generation in 3D

    Authors: Emiel Hoogeboom, Victor Garcia Satorras, Clément Vignac, Max Welling

    Abstract: This work introduces a diffusion model for molecule generation in 3D that is equivariant to Euclidean transformations. Our E(3) Equivariant Diffusion Model (EDM) learns to denoise a diffusion process with an equivariant network that jointly operates on both continuous (atom coordinates) and categorical features (atom types). In addition, we provide a probabilistic analysis which admits likelihood… ▽ More

    Submitted 16 June, 2022; v1 submitted 31 March, 2022; originally announced March 2022.

    Comments: Accepted at International Conference on Machine Learning (ICML) 2022

  13. arXiv:2110.02037  [pdf, other

    cs.LG stat.ML

    Autoregressive Diffusion Models

    Authors: Emiel Hoogeboom, Alexey A. Gritsenko, Jasmijn Bastings, Ben Poole, Rianne van den Berg, Tim Salimans

    Abstract: We introduce Autoregressive Diffusion Models (ARDMs), a model class encompassing and generalizing order-agnostic autoregressive models (Uria et al., 2014) and absorbing discrete diffusion (Austin et al., 2021), which we show are special cases of ARDMs under mild assumptions. ARDMs are simple to implement and easy to train. Unlike standard ARMs, they do not require causal masking of model represent… ▽ More

    Submitted 1 February, 2022; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: Published as a conference paper at International Conference on Learning Representations (ICLR) 2022

  14. arXiv:2107.11625  [pdf, other

    cs.LG

    Discrete Denoising Flows

    Authors: Alexandra Lindt, Emiel Hoogeboom

    Abstract: Discrete flow-based models are a recently proposed class of generative models that learn invertible transformations for discrete random variables. Since they do not require data dequantization and maximize an exact likelihood objective, they can be used in a straight-forward manner for lossless compression. In this paper, we introduce a new discrete flow-based model for categorical random variable… ▽ More

    Submitted 24 July, 2021; originally announced July 2021.

    Comments: Accepted to the Third workshop on Invertible Neural Networks, Normalizing Flows, and Explicit Likelihood Models (ICML 2021)

  15. arXiv:2105.09016  [pdf, other

    cs.LG physics.chem-ph stat.ML

    E(n) Equivariant Normalizing Flows

    Authors: Victor Garcia Satorras, Emiel Hoogeboom, Fabian B. Fuchs, Ingmar Posner, Max Welling

    Abstract: This paper introduces a generative model equivariant to Euclidean symmetries: E(n) Equivariant Normalizing Flows (E-NFs). To construct E-NFs, we take the discriminative E(n) graph neural networks and integrate them as a differential equation to obtain an invertible equivariant function: a continuous-time normalizing flow. We demonstrate that E-NFs considerably outperform baselines and existing met… ▽ More

    Submitted 14 January, 2022; v1 submitted 19 May, 2021; originally announced May 2021.

    Comments: Accepted at Neural Information Processing Systems (NeurIPS 2021)

  16. arXiv:2102.09844  [pdf, other

    cs.LG stat.ML

    E(n) Equivariant Graph Neural Networks

    Authors: Victor Garcia Satorras, Emiel Hoogeboom, Max Welling

    Abstract: This paper introduces a new model to learn graph neural networks equivariant to rotations, translations, reflections and permutations called E(n)-Equivariant Graph Neural Networks (EGNNs). In contrast with existing methods, our work does not require computationally expensive higher-order representations in intermediate layers while it still achieves competitive or better performance. In addition,… ▽ More

    Submitted 16 February, 2022; v1 submitted 19 February, 2021; originally announced February 2021.

  17. arXiv:2102.05379  [pdf, other

    stat.ML cs.CL cs.LG

    Argmax Flows and Multinomial Diffusion: Learning Categorical Distributions

    Authors: Emiel Hoogeboom, Didrik Nielsen, Priyank Jaini, Patrick Forré, Max Welling

    Abstract: Generative flows and diffusion models have been predominantly trained on ordinal data, for example natural images. This paper introduces two extensions of flows and diffusion for categorical data such as language or image segmentation: Argmax Flows and Multinomial Diffusion. Argmax Flows are defined by a composition of a continuous distribution (such as a normalizing flow), and an argmax function.… ▽ More

    Submitted 22 October, 2021; v1 submitted 10 February, 2021; originally announced February 2021.

    Comments: Accepted at Neural Information Processing Systems (NeurIPS 2021)

  18. arXiv:2012.13311  [pdf, other

    cs.LG stat.ML

    Variational Determinant Estimation with Spherical Normalizing Flows

    Authors: Simon Passenheim, Emiel Hoogeboom

    Abstract: This paper introduces the Variational Determinant Estimator (VDE), a variational extension of the recently proposed determinant estimator discovered by arXiv:2005.06553v2. Our estimator significantly reduces the variance even for low sample sizes by combining (importance-weighted) variational inference and a family of normalizing flows which allow density estimation on hyperspheres. In the ideal c… ▽ More

    Submitted 8 January, 2021; v1 submitted 24 December, 2020; originally announced December 2020.

    Comments: Accepted at 3rd Symposium on Advances in Approximate Bayesian Inference (AABI) 2021

  19. arXiv:2011.07248  [pdf, other

    cs.LG cs.NE stat.ML

    Self Normalizing Flows

    Authors: T. Anderson Keller, Jorn W. T. Peters, Priyank Jaini, Emiel Hoogeboom, Patrick Forré, Max Welling

    Abstract: Efficient gradient computation of the Jacobian determinant term is a core problem in many machine learning settings, and especially so in the normalizing flow framework. Most proposed flow models therefore either restrict to a function class with easy evaluation of the Jacobian determinant, or an efficient estimator thereof. However, these restrictions limit the performance of such density models,… ▽ More

    Submitted 9 June, 2021; v1 submitted 14 November, 2020; originally announced November 2020.

  20. arXiv:2007.02731  [pdf, other

    cs.LG stat.ML

    SurVAE Flows: Surjections to Bridge the Gap between VAEs and Flows

    Authors: Didrik Nielsen, Priyank Jaini, Emiel Hoogeboom, Ole Winther, Max Welling

    Abstract: Normalizing flows and variational autoencoders are powerful generative models that can represent complicated density functions. However, they both impose constraints on the models: Normalizing flows use bijective transformations to model densities whereas VAEs learn stochastic transformations that are non-invertible and thus typically do not provide tractable estimates of the marginal likelihood.… ▽ More

    Submitted 30 October, 2020; v1 submitted 6 July, 2020; originally announced July 2020.

  21. arXiv:2006.01910  [pdf, other

    cs.LG cs.CV stat.ML

    The Convolution Exponential and Generalized Sylvester Flows

    Authors: Emiel Hoogeboom, Victor Garcia Satorras, Jakub M. Tomczak, Max Welling

    Abstract: This paper introduces a new method to build linear flows, by taking the exponential of a linear transformation. This linear transformation does not need to be invertible itself, and the exponential has the following desirable properties: it is guaranteed to be invertible, its inverse is straightforward to compute and the log Jacobian determinant is equal to the trace of the linear transformation.… ▽ More

    Submitted 26 October, 2020; v1 submitted 2 June, 2020; originally announced June 2020.

    Comments: Accepted to Neural Information Processing Systems (NeurIPS) 2020

  22. arXiv:2002.09928  [pdf, other

    cs.LG stat.ML

    Predictive Sampling with Forecasting Autoregressive Models

    Authors: Auke Wiggers, Emiel Hoogeboom

    Abstract: Autoregressive models (ARMs) currently hold state-of-the-art performance in likelihood-based modeling of image and audio data. Generally, neural network based ARMs are designed to allow fast inference, but sampling from these models is impractically slow. In this paper, we introduce the predictive sampling algorithm: a procedure that exploits the fast inference property of ARMs in order to speed u… ▽ More

    Submitted 8 July, 2020; v1 submitted 23 February, 2020; originally announced February 2020.

    Comments: Accepted at the 37th International Conference on Machine Learning (ICML 2020). 14 pages, 13 figures

  23. arXiv:2001.11235  [pdf, other

    cs.LG stat.ML

    Learning Discrete Distributions by Dequantization

    Authors: Emiel Hoogeboom, Taco S. Cohen, Jakub M. Tomczak

    Abstract: Media is generally stored digitally and is therefore discrete. Many successful deep distribution models in deep learning learn a density, i.e., the distribution of a continuous random variable. Naïve optimization on discrete data leads to arbitrarily high likelihoods, and instead, it has become standard practice to add noise to datapoints. In this paper, we present a general framework for dequanti… ▽ More

    Submitted 30 January, 2020; originally announced January 2020.

  24. arXiv:1912.00042  [pdf, other

    cs.LG cs.CV stat.ML

    Learning Likelihoods with Conditional Normalizing Flows

    Authors: Christina Winkler, Daniel Worrall, Emiel Hoogeboom, Max Welling

    Abstract: Normalizing Flows (NFs) are able to model complicated distributions p(y) with strong inter-dimensional correlations and high multimodality by transforming a simple base density p(z) through an invertible neural network under the change of variables formula. Such behavior is desirable in multivariate structured prediction tasks, where handcrafted per-pixel loss-based methods inadequately capture st… ▽ More

    Submitted 12 November, 2023; v1 submitted 29 November, 2019; originally announced December 2019.

    Comments: 18 pages, 8 Tables, 9 Figures, Preprint

  25. arXiv:1905.07376  [pdf, other

    cs.LG cs.CV stat.ML

    Integer Discrete Flows and Lossless Compression

    Authors: Emiel Hoogeboom, Jorn W. T. Peters, Rianne van den Berg, Max Welling

    Abstract: Lossless compression methods shorten the expected representation size of data without loss of information, using a statistical model. Flow-based models are attractive in this setting because they admit exact likelihood optimization, which is equivalent to minimizing the expected number of bits per message. However, conventional flows assume continuous data, which may lead to reconstruction errors… ▽ More

    Submitted 6 December, 2019; v1 submitted 17 May, 2019; originally announced May 2019.

    Comments: Accepted as a conference paper at Neural Information Processing Systems (NeurIPS) 2019

  26. arXiv:1901.11137  [pdf, other

    cs.LG stat.ML

    Emerging Convolutions for Generative Normalizing Flows

    Authors: Emiel Hoogeboom, Rianne van den Berg, Max Welling

    Abstract: Generative flows are attractive because they admit exact likelihood optimization and efficient image synthesis. Recently, Kingma & Dhariwal (2018) demonstrated with Glow that generative flows are capable of generating high quality images. We generalize the 1 x 1 convolutions proposed in Glow to invertible d x d convolutions, which are more flexible since they operate on both channel and spatial ax… ▽ More

    Submitted 20 May, 2019; v1 submitted 30 January, 2019; originally announced January 2019.

    Comments: Accepted at International Conference on Machine Learning (ICML) 2019

  27. arXiv:1803.02108  [pdf, other

    cs.LG stat.ML

    HexaConv

    Authors: Emiel Hoogeboom, Jorn W. T. Peters, Taco S. Cohen, Max Welling

    Abstract: The effectiveness of Convolutional Neural Networks stems in large part from their ability to exploit the translation invariance that is inherent in many learning problems. Recently, it was shown that CNNs can exploit other invariances, such as rotation invariance, by using group convolutions instead of planar convolutions. However, for reasons of performance and ease of implementation, it has been… ▽ More

    Submitted 6 March, 2018; originally announced March 2018.