Skip to main content

Showing 1–14 of 14 results for author: Ballé, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2505.06098  [pdf, ps, other

    cs.IT eess.SP

    Discretized Approximate Ancestral Sampling

    Authors: Alfredo De la Fuente, Saurabh Singh, Jona Ballé

    Abstract: The Fourier Basis Density Model (FBM) was recently introduced as a flexible probability model for band-limited distributions, i.e. ones which are smooth in the sense of having a characteristic function with limited support around the origin. Its density and cumulative distribution functions can be efficiently evaluated and trained with stochastic optimization methods, which makes the model suitabl… ▽ More

    Submitted 9 May, 2025; originally announced May 2025.

    Comments: 11 pages, 7 figures. Accepted for presentation at the Learn to Compress & Compress to Learn Workshop at ISIT 2025

  2. arXiv:2412.00505  [pdf, other

    cs.CV eess.IV

    Good, Cheap, and Fast: Overfitted Image Compression with Wasserstein Distortion

    Authors: Jona Ballé, Luca Versari, Emilien Dupont, Hyunjik Kim, Matthias Bauer

    Abstract: Inspired by the success of generative image models, recent work on learned image compression increasingly focuses on better probabilistic models of the natural image distribution, leading to excellent image quality. This, however, comes at the expense of a computational complexity that is several orders of magnitude higher than today's commercial codecs, and thus prohibitive for most practical app… ▽ More

    Submitted 23 March, 2025; v1 submitted 30 November, 2024; originally announced December 2024.

    Comments: 16 pages, 12 figures. Accepted for presentation at CVPR 2025

  3. Neural Distributed Compressor Discovers Binning

    Authors: Ezgi Ozyilkan, Johannes Ballé, Elza Erkip

    Abstract: We consider lossy compression of an information source when the decoder has lossless access to a correlated one. This setup, also known as the Wyner-Ziv problem, is a special case of distributed source coding. To this day, practical approaches for the Wyner-Ziv problem have neither been fully developed nor heavily investigated. We propose a data-driven method based on machine learning that leverag… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: draft of a journal version of our previous ISIT 2023 paper (available at: arXiv:2305.04380). arXiv admin note: substantial text overlap with arXiv:2305.04380

  4. arXiv:2310.03629  [pdf, other

    cs.IT cs.CV eess.IV

    Wasserstein Distortion: Unifying Fidelity and Realism

    Authors: Yang Qiu, Aaron B. Wagner, Johannes Ballé, Lucas Theis

    Abstract: We introduce a distortion measure for images, Wasserstein distortion, that simultaneously generalizes pixel-level fidelity on the one hand and realism or perceptual quality on the other. We show how Wasserstein distortion reduces to a pure fidelity constraint or a pure realism constraint under different parameter choices and discuss its metric properties. Pairs of images that are close under Wasse… ▽ More

    Submitted 28 March, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

  5. arXiv:2305.04380  [pdf, other

    cs.IT eess.SP

    Learned Wyner-Ziv Compressors Recover Binning

    Authors: Ezgi Ozyilkan, Johannes Ballé, Elza Erkip

    Abstract: We consider lossy compression of an information source when the decoder has lossless access to a correlated one. This setup, also known as the Wyner-Ziv problem, is a special case of distributed source coding. To this day, real-world applications of this problem have neither been fully developed nor heavily investigated. We propose a data-driven method based on machine learning that leverages the… ▽ More

    Submitted 7 May, 2023; originally announced May 2023.

    Comments: to be appearing in ISIT 2023

  6. arXiv:2107.12038  [pdf, other

    eess.IV cs.CV

    Neural Video Compression using GANs for Detail Synthesis and Propagation

    Authors: Fabian Mentzer, Eirikur Agustsson, Johannes Ballé, David Minnen, Nick Johnston, George Toderici

    Abstract: We present the first neural video compression method based on generative adversarial networks (GANs). Our approach significantly outperforms previous neural and non-neural video compression methods in a user study, setting a new state-of-the-art in visual quality for neural methods. We show that the GAN loss is crucial to obtain this high visual quality. Two components make the GAN loss effective:… ▽ More

    Submitted 12 July, 2022; v1 submitted 26 July, 2021; originally announced July 2021.

    Comments: First two authors contributed equally. ECCV Camera ready version

  7. arXiv:2106.04427  [pdf, other

    cs.CV eess.IV q-bio.NC

    On the relation between statistical learning and perceptual distances

    Authors: Alexander Hepburn, Valero Laparra, Raul Santos-Rodriguez, Johannes Ballé, Jesús Malo

    Abstract: It has been demonstrated many times that the behavior of the human visual system is connected to the statistics of natural images. Since machine learning relies on the statistics of training data as well, the above connection has interesting implications when using perceptual distances (which mimic the behavior of the human visual system) as a loss function. In this paper, we aim to unravel the no… ▽ More

    Submitted 16 March, 2022; v1 submitted 8 June, 2021; originally announced June 2021.

  8. arXiv:2104.12456  [pdf, other

    cs.CV eess.IV

    3D Scene Compression through Entropy Penalized Neural Representation Functions

    Authors: Thomas Bird, Johannes Ballé, Saurabh Singh, Philip A. Chou

    Abstract: Some forms of novel visual media enable the viewer to explore a 3D scene from arbitrary viewpoints, by interpolating between a discrete set of original views. Compared to 2D imagery, these types of applications require much larger amounts of storage space, which we seek to reduce. Existing approaches for compressing 3D scenes are based on a separation of compression and rendering: each of the orig… ▽ More

    Submitted 26 April, 2021; originally announced April 2021.

    Comments: accepted (in an abridged format) as a contribution to the Learning-based Image Coding special session of the Picture Coding Symposium 2021

  9. arXiv:2011.05065  [pdf, other

    cs.IT eess.IV

    Neural Networks Optimally Compress the Sawbridge

    Authors: Aaron B. Wagner, Johannes Ballé

    Abstract: Neural-network-based compressors have proven to be remarkably effective at compressing sources, such as images, that are nominally high-dimensional but presumed to be concentrated on a low-dimensional manifold. We consider a continuous-time random process that models an extreme version of such a source, wherein the realizations fall along a one-dimensional "curve" in function space that has infini… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.

  10. arXiv:2007.11797  [pdf, other

    cs.CV eess.IV

    End-to-end Learning of Compressible Features

    Authors: Saurabh Singh, Sami Abu-El-Haija, Nick Johnston, Johannes Ballé, Abhinav Shrivastava, George Toderici

    Abstract: Pre-trained convolutional neural networks (CNNs) are powerful off-the-shelf feature generators and have been shown to perform very well on a variety of tasks. Unfortunately, the generated features are high dimensional and expensive to store: potentially hundreds of thousands of floats per example when processing videos. Traditional entropy based lossless compression methods are of little help as t… ▽ More

    Submitted 23 July, 2020; originally announced July 2020.

    Comments: Accepted at ICIP 2020

  11. arXiv:2007.03034  [pdf, other

    cs.IT eess.IV

    Nonlinear Transform Coding

    Authors: Johannes Ballé, Philip A. Chou, David Minnen, Saurabh Singh, Nick Johnston, Eirikur Agustsson, Sung Jin Hwang, George Toderici

    Abstract: We review a class of methods that can be collected under the name nonlinear transform coding (NTC), which over the past few years have become competitive with the best linear transform codecs for images, and have superseded them in terms of rate--distortion performance under established perceptual quality metrics such as MS-SSIM. We assess the empirical rate--distortion performance of NTC with the… ▽ More

    Submitted 23 October, 2020; v1 submitted 6 July, 2020; originally announced July 2020.

    Comments: 17 pages, 14 figures. Accepted for publication in IEEE Journal of Selected Topics in Signal Processing

  12. arXiv:1912.08771  [pdf, other

    eess.IV cs.LG stat.ML

    Computationally Efficient Neural Image Compression

    Authors: Nick Johnston, Elad Eban, Ariel Gordon, Johannes Ballé

    Abstract: Image compression using neural networks have reached or exceeded non-neural methods (such as JPEG, WebP, BPG). While these networks are state of the art in ratedistortion performance, computational feasibility of these models remains a challenge. We apply automatic network optimization techniques to reduce the computational complexity of a popular architecture used in neural image compression, ana… ▽ More

    Submitted 18 December, 2019; originally announced December 2019.

    Comments: In submission to a conference

  13. arXiv:1802.01436  [pdf, other

    eess.IV cs.IT

    Variational image compression with a scale hyperprior

    Authors: Johannes Ballé, David Minnen, Saurabh Singh, Sung Jin Hwang, Nick Johnston

    Abstract: We describe an end-to-end trainable model for image compression based on variational autoencoders. The model incorporates a hyperprior to effectively capture spatial dependencies in the latent representation. This hyperprior relates to side information, a concept universal to virtually all modern image codecs, but largely unexplored in image compression using artificial neural networks (ANNs). Unl… ▽ More

    Submitted 1 May, 2018; v1 submitted 31 January, 2018; originally announced February 2018.

    Comments: accepted as a conference contribution to International Conference on Learning Representations 2018

  14. arXiv:1802.00847  [pdf, ps, other

    eess.IV

    Efficient Nonlinear Transforms for Lossy Image Compression

    Authors: Johannes Ballé

    Abstract: We assess the performance of two techniques in the context of nonlinear transform coding with artificial neural networks, Sadam and GDN. Both techniques have been successfully used in state-of-the-art image compression methods, but their performance has not been individually assessed to this point. Together, the techniques stabilize the training procedure of nonlinear image transforms and increase… ▽ More

    Submitted 30 July, 2018; v1 submitted 31 January, 2018; originally announced February 2018.

    Comments: accepted as a conference contribution to Picture Coding Symposium 2018