Skip to main content

Showing 1–13 of 13 results for author: Saremi, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2502.00557  [pdf, other

    stat.ML cs.LG

    Sampling Binary Data by Denoising through Score Functions

    Authors: Francis Bach, Saeed Saremi

    Abstract: Gaussian smoothing combined with a probabilistic framework for denoising via the empirical Bayes formalism, i.e., the Tweedie-Miyasawa formula (TMF), are the two key ingredients in the success of score-based generative models in Euclidean spaces. Smoothing holds the key for easing the problem of learning and sampling in high dimensions, denoising is needed for recovering the original signal, and T… ▽ More

    Submitted 1 February, 2025; originally announced February 2025.

  2. arXiv:2305.19473  [pdf, other

    stat.ML cs.LG stat.CO

    Chain of Log-Concave Markov Chains

    Authors: Saeed Saremi, Ji Won Park, Francis Bach

    Abstract: We introduce a theoretical framework for sampling from unnormalized densities based on a smoothing scheme that uses an isotropic Gaussian kernel with a single fixed noise scale. We prove one can decompose sampling from a density (minimal assumptions made on the density) into a sequence of sampling from log-concave conditional densities via accumulation of noisy measurements with equal noise levels… ▽ More

    Submitted 28 September, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

  3. arXiv:2303.11669  [pdf, other

    stat.ML cs.LG

    Universal Smoothed Score Functions for Generative Modeling

    Authors: Saeed Saremi, Rupesh Kumar Srivastava, Francis Bach

    Abstract: We consider the problem of generative modeling based on smoothing an unknown density of interest in $\mathbb{R}^d$ using factorial kernels with $M$ independent Gaussian channels with equal noise levels introduced by Saremi and Srivastava (2022). First, we fully characterize the time complexity of learning the resulting smoothed density in $\mathbb{R}^{Md}$, called M-density, by deriving a universa… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: Technical Report

  4. arXiv:2112.09822  [pdf, other

    stat.ML cs.LG

    Multimeasurement Generative Models

    Authors: Saeed Saremi, Rupesh Kumar Srivastava

    Abstract: We formally map the problem of sampling from an unknown distribution with a density in $\mathbb{R}^d$ to the problem of learning and sampling a smoother density in $\mathbb{R}^{Md}$ obtained by convolution with a fixed factorial kernel: the new density is referred to as M-density and the kernel as multimeasurement noise model (MNM). The M-density in $\mathbb{R}^{Md}$ is smoother than the original… ▽ More

    Submitted 16 June, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

    Comments: Our code is publicly available at https://github.com/nnaisense/mems

    Journal ref: International Conference on Learning Representations, 2022

  5. arXiv:2007.15130  [pdf, other

    stat.ML cs.LG

    Unnormalized Variational Bayes

    Authors: Saeed Saremi

    Abstract: We unify empirical Bayes and variational Bayes for approximating unnormalized densities. This framework, named unnormalized variational Bayes (UVB), is based on formulating a latent variable model for the random variable $Y=X+N(0,σ^2 I_d)$ and using the evidence lower bound (ELBO), computed by a variational autoencoder, as a parametrization of the energy function of $Y$ which is then used to estim… ▽ More

    Submitted 29 July, 2020; originally announced July 2020.

    Comments: Submitted to Journal of Machine Learning Research

  6. arXiv:2005.09047  [pdf, other

    stat.ML cs.LG

    Learning and Inference in Imaginary Noise Models

    Authors: Saeed Saremi

    Abstract: Inspired by recent developments in learning smoothed densities with empirical Bayes, we study variational autoencoders with a decoder that is tailored for the random variable $Y=X+N(0,σ^2 I_d)$. A notion of smoothed variational inference emerges where the smoothing is implicitly enforced by the noise model of the decoder; "implicit", since during training the encoder only sees clean samples. This… ▽ More

    Submitted 5 June, 2020; v1 submitted 18 May, 2020; originally announced May 2020.

  7. arXiv:2005.04504  [pdf, other

    stat.ML cs.LG math.OC

    Provable Robust Classification via Learned Smoothed Densities

    Authors: Saeed Saremi, Rupesh Srivastava

    Abstract: Smoothing classifiers and probability density functions with Gaussian kernels appear unrelated, but in this work, they are unified for the problem of robust classification. The key building block is approximating the $\textit{energy function}$ of the random variable $Y=X+N(0,σ^2 I_d)$ with a neural network which we use to formulate the problem of robust classification in terms of $\widehat{x}(Y)$,… ▽ More

    Submitted 9 May, 2020; originally announced May 2020.

    Comments: 24 pages, 6 figures

  8. arXiv:1912.03845  [pdf, other

    cs.LG stat.ML

    No Representation without Transformation

    Authors: Giorgio Giannone, Saeed Saremi, Jonathan Masci, Christian Osendorfer

    Abstract: We extend the framework of variational autoencoders to represent transformations explicitly in the latent space. In the family of hierarchical graphical models that emerges, the latent space is populated by higher order objects that are inferred jointly with the latent representations they act on. To explicitly demonstrate the effect of these higher order objects, we show that the inferred latent… ▽ More

    Submitted 23 April, 2020; v1 submitted 8 December, 2019; originally announced December 2019.

    Comments: Preprint. Accepted at BDL and PGR workshops at NeurIPS 2019

  9. arXiv:1910.12744  [pdf, ps, other

    stat.ML cs.LG

    On approximating $\nabla f$ with neural networks

    Authors: Saeed Saremi

    Abstract: Consider a feedforward neural network $ψ: \mathbb{R}^d\rightarrow \mathbb{R}^d$ such that $ψ\approx \nabla f$, where $f:\mathbb{R}^d \rightarrow \mathbb{R}$ is a smooth function, therefore $ψ$ must satisfy $\partial_j ψ_i = \partial_i ψ_j$ pointwise. We prove a theorem that a $ψ$ network with more than one hidden layer can only represent one feature in its first hidden layer; this is a dramatic de… ▽ More

    Submitted 6 November, 2019; v1 submitted 28 October, 2019; originally announced October 2019.

    Comments: 10 pages

  10. arXiv:1903.02334  [pdf, other

    stat.ML cs.LG

    Neural Empirical Bayes

    Authors: Saeed Saremi, Aapo Hyvarinen

    Abstract: We unify $\textit{kernel density estimation}$ and $\textit{empirical Bayes}$ and address a set of problems in unsupervised learning with a geometric interpretation of those methods, rooted in the $\textit{concentration of measure}$ phenomenon. Kernel density is viewed symbolically as $X\rightharpoonup Y$ where the random variable $X$ is smoothed to $Y= X+N(0,σ^2 I_d)$, and empirical Bayes is the m… ▽ More

    Submitted 21 April, 2020; v1 submitted 6 March, 2019; originally announced March 2019.

    Comments: 23 pages, 10 figures

    Journal ref: Journal of Machine Learning Research 20(181), 1-23, 2019

  11. arXiv:1805.08306  [pdf, other

    stat.ML cs.LG

    Deep Energy Estimator Networks

    Authors: Saeed Saremi, Arash Mehrjou, Bernhard Schölkopf, Aapo Hyvärinen

    Abstract: Density estimation is a fundamental problem in statistical learning. This problem is especially challenging for complex high-dimensional data due to the curse of dimensionality. A promising solution to this problem is given here in an inference-free hierarchical framework that is built on score matching. We revisit the Bayesian interpretation of the score function and the Parzen score matching, an… ▽ More

    Submitted 21 May, 2018; originally announced May 2018.

  12. arXiv:1705.07505  [pdf, other

    stat.ML cs.LG

    Annealed Generative Adversarial Networks

    Authors: Arash Mehrjou, Bernhard Schölkopf, Saeed Saremi

    Abstract: We introduce a novel framework for adversarial training where the target distribution is annealed between the uniform distribution and the data distribution. We posited a conjecture that learning under continuous annealing in the nonparametric regime is stable irrespective of the divergence measures in the objective function and proposed an algorithm, dubbed ß-GAN, in corollary. In this framework,… ▽ More

    Submitted 21 May, 2017; originally announced May 2017.

    Comments: 9 pages, 6 figures

  13. arXiv:1510.07740  [pdf, other

    stat.ML cond-mat.stat-mech cs.CV cs.LG

    The Wilson Machine for Image Modeling

    Authors: Saeed Saremi, Terrence J. Sejnowski

    Abstract: Learning the distribution of natural images is one of the hardest and most important problems in machine learning. The problem remains open, because the enormous complexity of the structures in natural images spans all length scales. We break down the complexity of the problem and show that the hierarchy of structures in natural images fuels a new class of learning algorithms based on the theory o… ▽ More

    Submitted 11 November, 2015; v1 submitted 26 October, 2015; originally announced October 2015.