Skip to main content

Showing 1–15 of 15 results for author: Guth, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.05310  [pdf, ps, other

    cs.LG

    Learning normalized image densities via dual score matching

    Authors: Florentin Guth, Zahra Kadkhodaie, Eero P Simoncelli

    Abstract: Learning probability models from data is at the heart of many machine learning endeavors, but is notoriously difficult due to the curse of dimensionality. We introduce a new framework for learning \emph{normalized} energy (log probability) models that is inspired from diffusion generative models, which rely on networks optimized to estimate the score. We modify a score network architecture to comp… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

  2. arXiv:2410.03505  [pdf, other

    cs.CV cs.LG

    Classification-Denoising Networks

    Authors: Louis Thiry, Florentin Guth

    Abstract: Image classification and denoising suffer from complementary issues of lack of robustness or partially ignoring conditioning information. We argue that they can be alleviated by unifying both tasks through a model of the joint probability of (noisy) images and class labels. Classification is performed with a forward pass followed by conditioning. Using the Tweedie-Miyasawa formula, we evaluate the… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

    Comments: 18 pages, 5 figures

  3. arXiv:2409.19460  [pdf, other

    cs.LG cs.CV

    On the universality of neural encodings in CNNs

    Authors: Florentin Guth, Brice Ménard

    Abstract: We explore the universality of neural encodings in convolutional neural networks trained on image classification tasks. We develop a procedure to directly compare the learned weights rather than their representations. It is based on a factorization of spatial and channel dimensions and measures the similarity of aligned weight covariances. We show that, for a range of layers of VGG-type networks,… ▽ More

    Submitted 28 September, 2024; originally announced September 2024.

    Comments: Appeared at the ICLR 2024 Workshop on Representational Alignment (Re-Align), 13 pages, 5 figures

  4. arXiv:2310.02557  [pdf, other

    cs.CV cs.LG

    Generalization in diffusion models arises from geometry-adaptive harmonic representations

    Authors: Zahra Kadkhodaie, Florentin Guth, Eero P. Simoncelli, Stéphane Mallat

    Abstract: Deep neural networks (DNNs) trained for image denoising are able to generate high-quality samples with score-based reverse diffusion algorithms. These impressive capabilities seem to imply an escape from the curse of dimensionality, but recent reports of memorization of the training set raise the question of whether these networks are learning the "true" continuous density of the data. Here, we sh… ▽ More

    Submitted 12 April, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: Accepted for oral presentation at ICLR, Vienna, May 2024

    Journal ref: Int'l Conf on Learning Representations (ICLR), vol.12, Vienna, May 2024. Outstanding Paper award

  5. arXiv:2306.00181  [pdf, other

    stat.ML cs.CV cs.LG eess.SP

    Conditionally Strongly Log-Concave Generative Models

    Authors: Florentin Guth, Etienne Lempereur, Joan Bruna, Stéphane Mallat

    Abstract: There is a growing gap between the impressive results of deep image generative models and classical algorithms that offer theoretical guarantees. The former suffer from mode collapse or memorization issues, limiting their application to scientific data. The latter require restrictive assumptions such as log-concavity to escape the curse of dimensionality. We partially bridge this gap by introducin… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

    Comments: 28 pages, 12 figures, accepted at ICML 2023

  6. arXiv:2305.18512  [pdf, other

    cs.LG cs.CV eess.SP

    A Rainbow in Deep Network Black Boxes

    Authors: Florentin Guth, Brice Ménard, Gaspar Rochette, Stéphane Mallat

    Abstract: A central question in deep learning is to understand the functions learned by deep networks. What is their approximation class? Do the learned weights and representations depend on initialization? Previous empirical work has evidenced that kernels defined by network activations are similar across initializations. For shallow networks, this has been theoretically studied with random feature models,… ▽ More

    Submitted 24 October, 2024; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: 59 pages, 10 figures. To appear at JMLR

  7. arXiv:2303.02984  [pdf, other

    cs.CV cs.LG

    Learning multi-scale local conditional probability models of images

    Authors: Zahra Kadkhodaie, Florentin Guth, Stéphane Mallat, Eero P Simoncelli

    Abstract: Deep neural networks can learn powerful prior probability models for images, as evidenced by the high-quality generations obtained with recent score-based diffusion methods. But the means by which these networks capture complex global statistical structure, apparently without suffering from the curse of dimensionality, remain a mystery. To study this, we incorporate diffusion methods into a multi-… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: 16 pages, 8 figures

    Journal ref: ICLR 2023

  8. arXiv:2208.05003  [pdf, other

    cs.LG cs.CV stat.ML

    Wavelet Score-Based Generative Modeling

    Authors: Florentin Guth, Simon Coste, Valentin De Bortoli, Stephane Mallat

    Abstract: Score-based generative models (SGMs) synthesize new data samples from Gaussian white noise by running a time-reversed Stochastic Differential Equation (SDE) whose drift coefficient depends on some probabilistic score. The discretization of such SDEs typically requires a large number of time steps and hence a high computational cost. This is because of ill-conditioning properties of the score that… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

  9. arXiv:2111.13309  [pdf, other

    cs.CV cs.AI

    Data Augmented 3D Semantic Scene Completion with 2D Segmentation Priors

    Authors: Aloisio Dourado, Frederico Guth, Teofilo de Campos

    Abstract: Semantic scene completion (SSC) is a challenging Computer Vision task with many practical applications, from robotics to assistive computing. Its goal is to infer the 3D geometry in a field of view of a scene and the semantic labels of voxels, including occluded regions. In this work, we present SPAwN, a novel lightweight multimodal 3D deep CNN that seamlessly fuses structural data from the depth… ▽ More

    Submitted 25 November, 2021; originally announced November 2021.

    Comments: 10 pages, 5 figures

    ACM Class: I.5.0

  10. arXiv:2110.05283  [pdf, other

    cs.LG eess.SP stat.ML

    Phase Collapse in Neural Networks

    Authors: Florentin Guth, John Zarka, Stéphane Mallat

    Abstract: Deep convolutional classifiers linearly separate image classes and improve accuracy as depth increases. They progressively reduce the spatial dimension whereas the number of channels grows with depth. Spatial variability is therefore transformed into variability along channels. A fundamental challenge is to understand the role of non-linearities together with convolutional filters in this transfor… ▽ More

    Submitted 21 March, 2022; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: 17 pages, 2 figures

    Journal ref: International Conference on Learning Representations, 2022

  11. arXiv:2012.10424  [pdf, other

    cs.LG cs.CV

    Separation and Concentration in Deep Networks

    Authors: John Zarka, Florentin Guth, Stéphane Mallat

    Abstract: Numerical experiments demonstrate that deep neural network classifiers progressively separate class distributions around their mean, achieving linear separability on the training set, and increasing the Fisher discriminant ratio. We explain this mechanism with two types of operators. We prove that a rectifier without biases applied to sign-invariant tight frames can separate class means and increa… ▽ More

    Submitted 15 March, 2021; v1 submitted 18 December, 2020; originally announced December 2020.

  12. arXiv:1912.08812  [pdf, other

    cs.DL cs.CV cs.LG

    Research Frontiers in Transfer Learning -- a systematic and bibliometric review

    Authors: Frederico Guth, Teofilo Emidio de-Campos

    Abstract: Humans can learn from very few samples, demonstrating an outstanding generalization ability that learning algorithms are still far from reaching. Currently, the most successful models demand enormous amounts of well-labeled data, which are expensive and difficult to obtain, becoming one of the biggest obstacles to the use of machine learning in practice. This scenario shows the massive potential f… ▽ More

    Submitted 18 December, 2019; originally announced December 2019.

    Comments: 19 pages, 9 figures

    MSC Class: 68T05 ACM Class: I.5

  13. arXiv:1903.06969  [pdf, other

    cs.CV

    Domain adaptation for holistic skin detection

    Authors: Aloisio Dourado, Frederico Guth, Teofilo Emidio de Campos, Li Weigang

    Abstract: Human skin detection in images is a widely studied topic of Computer Vision for which it is commonly accepted that analysis of pixel color or local patches may suffice. This is because skin regions appear to be relatively uniform and many argue that there is a small chromatic variation among different samples. However, we found that there are strong biases in the datasets commonly used to train or… ▽ More

    Submitted 28 March, 2020; v1 submitted 16 March, 2019; originally announced March 2019.

    Comments: 11 pages, 10 figures, 6 tables

    ACM Class: I.4.6; I.2.10; I.5.1

  14. arXiv:1811.11314  [pdf, other

    cs.CV

    Skin lesion segmentation using U-Net and good training strategies

    Authors: Fred Guth, Teofilo E. deCampos

    Abstract: In this paper we approach the problem of skin lesion segmentation using a convolutional neural network based on the U-Net architecture. We present a set of training strategies that had a significant impact on the performance of this model. We evaluated this method on the ISIC Challenge 2018 - Skin Lesion Analysis Towards Melanoma Detection, obtaining threshold Jaccard index of 77.5%.

    Submitted 27 November, 2018; originally announced November 2018.

    MSC Class: 68T45 ACM Class: I.2.10

  15. arXiv:1807.07822  [pdf, other

    cs.SE cs.PL

    Specification Mining for Smart Contracts with Automatic Abstraction Tuning

    Authors: Florentin Guth, Valentin Wüstholz, Maria Christakis, Peter Müller

    Abstract: Smart contracts are programs that manage digital assets according to a certain protocol, expressing for instance the rules of an auction. Understanding the possible behaviors of a smart contract is difficult, which complicates development, auditing, and the post-mortem analysis of attacks. This paper presents the first specification mining technique for smart contracts. Our technique extracts th… ▽ More

    Submitted 20 July, 2018; originally announced July 2018.