Skip to main content

Showing 1–17 of 17 results for author: Berthelot, D

.
  1. arXiv:2506.06276  [pdf, ps, other

    cs.CV cs.AI cs.LG

    STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis

    Authors: Jiatao Gu, Tianrong Chen, David Berthelot, Huangjie Zheng, Yuyang Wang, Ruixiang Zhang, Laurent Dinh, Miguel Angel Bautista, Josh Susskind, Shuangfei Zhai

    Abstract: We present STARFlow, a scalable generative model based on normalizing flows that achieves strong performance in high-resolution image synthesis. The core of STARFlow is Transformer Autoregressive Flow (TARFlow), which combines the expressive power of normalizing flows with the structured modeling capabilities of Autoregressive Transformers. We first establish the theoretical universality of TARFlo… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

    Comments: TLDR: We show for the first time that normalizing flows can be scaled for high-resolution and text-conditioned image synthesis

  2. arXiv:2502.04549  [pdf, other

    cs.LG

    Mechanisms of Projective Composition of Diffusion Models

    Authors: Arwen Bradley, Preetum Nakkiran, David Berthelot, James Thornton, Joshua M. Susskind

    Abstract: We study the theoretical foundations of composition in diffusion models, with a particular focus on out-of-distribution extrapolation and length-generalization. Prior work has shown that composing distributions via linear score combination can achieve promising results, including length-generalization in some cases (Du et al., 2023; Liu et al., 2022). However, our theoretical understanding of how… ▽ More

    Submitted 14 May, 2025; v1 submitted 6 February, 2025; originally announced February 2025.

    Comments: 10 pages, 8 figures. The first two authors contributed equally

  3. arXiv:2412.06329  [pdf, ps, other

    cs.CV cs.LG

    Normalizing Flows are Capable Generative Models

    Authors: Shuangfei Zhai, Ruixiang Zhang, Preetum Nakkiran, David Berthelot, Jiatao Gu, Huangjie Zheng, Tianrong Chen, Miguel Angel Bautista, Navdeep Jaitly, Josh Susskind

    Abstract: Normalizing Flows (NFs) are likelihood-based models for continuous inputs. They have demonstrated promising results on both density estimation and generative modeling tasks, but have received relatively little attention in recent years. In this work, we demonstrate that NFs are more powerful than previously believed. We present TarFlow: a simple and scalable architecture that enables highly perfor… ▽ More

    Submitted 6 June, 2025; v1 submitted 9 December, 2024; originally announced December 2024.

    Comments: ICML 2025

  4. arXiv:2303.04248  [pdf, other

    cs.LG cs.CV

    TRACT: Denoising Diffusion Models with Transitive Closure Time-Distillation

    Authors: David Berthelot, Arnaud Autef, Jierui Lin, Dian Ang Yap, Shuangfei Zhai, Siyuan Hu, Daniel Zheng, Walter Talbott, Eric Gu

    Abstract: Denoising Diffusion models have demonstrated their proficiency for generative sampling. However, generating good samples often requires many iterations. Consequently, techniques such as binary time-distillation (BTD) have been proposed to reduce the number of network calls for a fixed architecture. In this paper, we introduce TRAnsitive Closure Time-distillation (TRACT), a new method that extends… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

  5. arXiv:2106.04732  [pdf, other

    cs.LG cs.AI cs.CV

    AdaMatch: A Unified Approach to Semi-Supervised Learning and Domain Adaptation

    Authors: David Berthelot, Rebecca Roelofs, Kihyuk Sohn, Nicholas Carlini, Alex Kurakin

    Abstract: We extend semi-supervised learning to the problem of domain adaptation to learn significantly higher-accuracy models that train on one data distribution and test on a different one. With the goal of generality, we introduce AdaMatch, a method that unifies the tasks of unsupervised domain adaptation (UDA), semi-supervised learning (SSL), and semi-supervised domain adaptation (SSDA). In an extensive… ▽ More

    Submitted 15 March, 2022; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: Accepted to ICLR 2022

  6. arXiv:2011.14004  [pdf, other

    cs.CV cs.LG eess.IV

    Assessing Post-Disaster Damage from Satellite Imagery using Semi-Supervised Learning Techniques

    Authors: Jihyeon Lee, Joseph Z. Xu, Kihyuk Sohn, Wenhan Lu, David Berthelot, Izzeddin Gur, Pranav Khaitan, Ke-Wei, Huang, Kyriacos Koupparis, Bernhard Kowatsch

    Abstract: To respond to disasters such as earthquakes, wildfires, and armed conflicts, humanitarian organizations require accurate and timely data in the form of damage assessments, which indicate what buildings and population centers have been most affected. Recent research combines machine learning with remote sensing to automatically extract such information from satellite imagery, reducing manual labor… ▽ More

    Submitted 24 November, 2020; originally announced November 2020.

    Comments: NeurIPS 2020 Artificial Intelligence for Humanitarian Assistance and Disaster Response Workshop

    ACM Class: I.2.10; I.2.1; I.5.4

  7. arXiv:2003.02365  [pdf, other

    cs.CV eess.IV stat.ML

    Creating High Resolution Images with a Latent Adversarial Generator

    Authors: David Berthelot, Peyman Milanfar, Ian Goodfellow

    Abstract: Generating realistic images is difficult, and many formulations for this task have been proposed recently. If we restrict the task to that of generating a particular class of images, however, the task becomes more tractable. That is to say, instead of generating an arbitrary image as a sample from the manifold of natural images, we propose to sample images from a particular "subspace" of natural i… ▽ More

    Submitted 4 March, 2020; originally announced March 2020.

  8. arXiv:2002.03480  [pdf, other

    cs.LG cs.CV stat.ML

    Semi-Supervised Class Discovery

    Authors: Jeremy Nixon, Jeremiah Liu, David Berthelot

    Abstract: One promising approach to dealing with datapoints that are outside of the initial training distribution (OOD) is to create new classes that capture similarities in the datapoints previously rejected as uncategorizable. Systems that generate labels can be deployed against an arbitrary amount of data, discovering classification schemes that through training create a higher quality representation of… ▽ More

    Submitted 21 February, 2020; v1 submitted 9 February, 2020; originally announced February 2020.

  9. arXiv:2001.07685  [pdf

    cs.LG cs.CV stat.ML

    FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence

    Authors: Kihyuk Sohn, David Berthelot, Chun-Liang Li, Zizhao Zhang, Nicholas Carlini, Ekin D. Cubuk, Alex Kurakin, Han Zhang, Colin Raffel

    Abstract: Semi-supervised learning (SSL) provides an effective means of leveraging unlabeled data to improve a model's performance. In this paper, we demonstrate the power of a simple combination of two common SSL methods: consistency regularization and pseudo-labeling. Our algorithm, FixMatch, first generates pseudo-labels using the model's predictions on weakly-augmented unlabeled images. For a given imag… ▽ More

    Submitted 25 November, 2020; v1 submitted 21 January, 2020; originally announced January 2020.

    Comments: Published at NeurIPS 2020 as a conference paper

  10. arXiv:1912.00594  [pdf, other

    cs.LG stat.ML

    Combining MixMatch and Active Learning for Better Accuracy with Fewer Labels

    Authors: Shuang Song, David Berthelot, Afshin Rostamizadeh

    Abstract: We propose using active learning based techniques to further improve the state-of-the-art semi-supervised learning MixMatch algorithm. We provide a thorough empirical evaluation of several active-learning and baseline methods, which successfully demonstrate a significant improvement on the benchmark CIFAR-10, CIFAR-100, and SVHN datasets (as much as 1.5% in absolute accuracy). We also provide an e… ▽ More

    Submitted 2 December, 2019; v1 submitted 2 December, 2019; originally announced December 2019.

  11. arXiv:1911.09785  [pdf, other

    cs.LG cs.CV stat.ML

    ReMixMatch: Semi-Supervised Learning with Distribution Alignment and Augmentation Anchoring

    Authors: David Berthelot, Nicholas Carlini, Ekin D. Cubuk, Alex Kurakin, Kihyuk Sohn, Han Zhang, Colin Raffel

    Abstract: We improve the recently-proposed "MixMatch" semi-supervised learning algorithm by introducing two new techniques: distribution alignment and augmentation anchoring. Distribution alignment encourages the marginal distribution of predictions on unlabeled data to be close to the marginal distribution of ground-truth labels. Augmentation anchoring feeds multiple strongly augmented versions of an input… ▽ More

    Submitted 13 February, 2020; v1 submitted 21 November, 2019; originally announced November 2019.

  12. arXiv:1909.01838  [pdf, other

    cs.LG cs.CR stat.ML

    High Accuracy and High Fidelity Extraction of Neural Networks

    Authors: Matthew Jagielski, Nicholas Carlini, David Berthelot, Alex Kurakin, Nicolas Papernot

    Abstract: In a model extraction attack, an adversary steals a copy of a remotely deployed machine learning model, given oracle prediction access. We taxonomize model extraction attacks around two objectives: *accuracy*, i.e., performing well on the underlying learning task, and *fidelity*, i.e., matching the predictions of the remote victim classifier on any input. To extract a high-accuracy model, we dev… ▽ More

    Submitted 3 March, 2020; v1 submitted 3 September, 2019; originally announced September 2019.

    Comments: USENIX Security 2020, 18 pages, 6 figures

  13. arXiv:1905.02249  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    MixMatch: A Holistic Approach to Semi-Supervised Learning

    Authors: David Berthelot, Nicholas Carlini, Ian Goodfellow, Nicolas Papernot, Avital Oliver, Colin Raffel

    Abstract: Semi-supervised learning has proven to be a powerful paradigm for leveraging unlabeled data to mitigate the reliance on large labeled datasets. In this work, we unify the current dominant approaches for semi-supervised learning to produce a new algorithm, MixMatch, that works by guessing low-entropy labels for data-augmented unlabeled examples and mixing labeled and unlabeled data using MixUp. We… ▽ More

    Submitted 23 October, 2019; v1 submitted 6 May, 2019; originally announced May 2019.

  14. arXiv:1807.07543  [pdf, other

    cs.LG stat.ML

    Understanding and Improving Interpolation in Autoencoders via an Adversarial Regularizer

    Authors: David Berthelot, Colin Raffel, Aurko Roy, Ian Goodfellow

    Abstract: Autoencoders provide a powerful framework for learning compressed representations by encoding all of the information needed to reconstruct a data point in a latent code. In some cases, autoencoders can "interpolate": By decoding the convex combination of the latent codes for two datapoints, the autoencoder can produce an output which semantically mixes characteristics from the datapoints. In this… ▽ More

    Submitted 23 July, 2018; v1 submitted 19 July, 2018; originally announced July 2018.

  15. arXiv:1703.10717  [pdf, other

    cs.LG stat.ML

    BEGAN: Boundary Equilibrium Generative Adversarial Networks

    Authors: David Berthelot, Thomas Schumm, Luke Metz

    Abstract: We propose a new equilibrium enforcing method paired with a loss derived from the Wasserstein distance for training auto-encoder based Generative Adversarial Networks. This method balances the generator and discriminator during training. Additionally, it provides a new approximate convergence measure, fast and stable training and high visual quality. We also derive a way of controlling the trade-o… ▽ More

    Submitted 31 May, 2017; v1 submitted 30 March, 2017; originally announced March 2017.

  16. arXiv:1610.00768  [pdf, ps, other

    cs.LG cs.CR stat.ML

    Technical Report on the CleverHans v2.1.0 Adversarial Examples Library

    Authors: Nicolas Papernot, Fartash Faghri, Nicholas Carlini, Ian Goodfellow, Reuben Feinman, Alexey Kurakin, Cihang Xie, Yash Sharma, Tom Brown, Aurko Roy, Alexander Matyasko, Vahid Behzadan, Karen Hambardzumyan, Zhishuai Zhang, Yi-Lin Juang, Zhi Li, Ryan Sheatsley, Abhibhav Garg, Jonathan Uesato, Willi Gierke, Yinpeng Dong, David Berthelot, Paul Hendricks, Jonas Rauber, Rujun Long , et al. (1 additional authors not shown)

    Abstract: CleverHans is a software library that provides standardized reference implementations of adversarial example construction techniques and adversarial training. The library may be used to develop more robust machine learning models and to provide standardized benchmarks of models' performance in the adversarial setting. Benchmarks constructed without a standardized implementation of adversarial exam… ▽ More

    Submitted 27 June, 2018; v1 submitted 3 October, 2016; originally announced October 2016.

    Comments: Technical report for https://github.com/tensorflow/cleverhans

  17. arXiv:1608.03542  [pdf, other

    cs.CL

    WikiReading: A Novel Large-scale Language Understanding Task over Wikipedia

    Authors: Daniel Hewlett, Alexandre Lacoste, Llion Jones, Illia Polosukhin, Andrew Fandrianto, Jay Han, Matthew Kelcey, David Berthelot

    Abstract: We present WikiReading, a large-scale natural language understanding task and publicly-available dataset with 18 million instances. The task is to predict textual values from the structured knowledge base Wikidata by reading the text of the corresponding Wikipedia articles. The task contains a rich variety of challenging classification and extraction sub-tasks, making it well-suited for end-to-end… ▽ More

    Submitted 15 March, 2017; v1 submitted 11 August, 2016; originally announced August 2016.

    Journal ref: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2016, pp. 1535-1545