Skip to main content

Showing 1–27 of 27 results for author: Saremi, S

.
  1. arXiv:2502.00557  [pdf, other

    stat.ML cs.LG

    Sampling Binary Data by Denoising through Score Functions

    Authors: Francis Bach, Saeed Saremi

    Abstract: Gaussian smoothing combined with a probabilistic framework for denoising via the empirical Bayes formalism, i.e., the Tweedie-Miyasawa formula (TMF), are the two key ingredients in the success of score-based generative models in Euclidean spaces. Smoothing holds the key for easing the problem of learning and sampling in high dimensions, denoising is needed for recovering the original signal, and T… ▽ More

    Submitted 1 February, 2025; originally announced February 2025.

  2. arXiv:2501.08508  [pdf, other

    cs.LG q-bio.BM

    Score-based 3D molecule generation with neural fields

    Authors: Matthieu Kirchmeyer, Pedro O. Pinheiro, Saeed Saremi

    Abstract: We introduce a new representation for 3D molecules based on their continuous atomic density fields. Using this representation, we propose a new model based on walk-jump sampling for unconditional 3D molecule generation in the continuous space using neural fields. Our model, FuncMol, encodes molecular fields into latent codes using a conditional neural field, samples noisy codes from a Gaussian-smo… ▽ More

    Submitted 14 January, 2025; originally announced January 2025.

    Comments: NeurIPS 2024

  3. arXiv:2410.14621  [pdf, other

    physics.bio-ph cs.LG q-bio.BM

    JAMUN: Transferable Molecular Conformational Ensemble Generation with Walk-Jump Sampling

    Authors: Ameya Daigavane, Bodhi P. Vani, Saeed Saremi, Joseph Kleinhenz, Joshua Rackers

    Abstract: Conformational ensembles of protein structures are immensely important both to understanding protein function, and for drug discovery in novel modalities such as cryptic pockets. Current techniques for sampling ensembles are computationally inefficient, or do not transfer to systems outside their training data. We present walk-Jump Accelerated Molecular ensembles with Universal Noise (JAMUN), a st… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

  4. arXiv:2407.03428  [pdf, other

    cs.LG q-bio.BM

    NEBULA: Neural Empirical Bayes Under Latent Representations for Efficient and Controllable Design of Molecular Libraries

    Authors: Ewa M. Nowara, Pedro O. Pinheiro, Sai Pooja Mahajan, Omar Mahmood, Andrew Martin Watkins, Saeed Saremi, Michael Maser

    Abstract: We present NEBULA, the first latent 3D generative model for scalable generation of large molecular libraries around a seed compound of interest. Such libraries are crucial for scientific discovery, but it remains challenging to generate large numbers of high quality samples efficiently. 3D-voxel-based methods have recently shown great promise for generating high quality samples de novo from random… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  5. arXiv:2405.03961  [pdf, other

    cs.LG q-bio.BM

    Structure-based drug design by denoising voxel grids

    Authors: Pedro O. Pinheiro, Arian Jamasb, Omar Mahmood, Vishnu Sresht, Saeed Saremi

    Abstract: We present VoxBind, a new score-based generative model for 3D molecules conditioned on protein structures. Our approach represents molecules as 3D atomic density grids and leverages a 3D voxel-denoising network for learning and generation. We extend the neural empirical Bayes formalism (Saremi & Hyvarinen, 2019) to the conditional setting and generate structure-conditioned molecules with a two-ste… ▽ More

    Submitted 2 July, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

  6. arXiv:2306.12360  [pdf, other

    q-bio.BM cs.LG

    Protein Discovery with Discrete Walk-Jump Sampling

    Authors: Nathan C. Frey, Daniel Berenberg, Karina Zadorozhny, Joseph Kleinhenz, Julien Lafrance-Vanasse, Isidro Hotzel, Yan Wu, Stephen Ra, Richard Bonneau, Kyunghyun Cho, Andreas Loukas, Vladimir Gligorijevic, Saeed Saremi

    Abstract: We resolve difficulties in training and sampling from a discrete generative model by learning a smoothed energy function, sampling from the smoothed data manifold with Langevin Markov chain Monte Carlo (MCMC), and projecting back to the true data manifold with one-step denoising. Our Discrete Walk-Jump Sampling formalism combines the contrastive divergence training of an energy-based model and imp… ▽ More

    Submitted 15 March, 2024; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: ICLR 2024 oral presentation, top 1.2% of submissions; {ICLR 2023 Physics for Machine Learning, NeurIPS 2023 GenBio, MLCB 2023} Spotlight

    Journal ref: The Twelfth International Conference on Learning Representations, 2024

  7. arXiv:2306.07473  [pdf, other

    cs.LG q-bio.QM

    3D molecule generation by denoising voxel grids

    Authors: Pedro O. Pinheiro, Joshua Rackers, Joseph Kleinhenz, Michael Maser, Omar Mahmood, Andrew Martin Watkins, Stephen Ra, Vishnu Sresht, Saeed Saremi

    Abstract: We propose a new score-based approach to generate 3D molecules represented as atomic densities on regular grids. First, we train a denoising neural network that learns to map from a smooth distribution of noisy molecules to the distribution of real molecules. Then, we follow the neural empirical Bayes framework (Saremi and Hyvarinen, 19) and generate molecules in two steps: (i) sample noisy densit… ▽ More

    Submitted 8 March, 2024; v1 submitted 12 June, 2023; originally announced June 2023.

  8. arXiv:2305.19473  [pdf, other

    stat.ML cs.LG stat.CO

    Chain of Log-Concave Markov Chains

    Authors: Saeed Saremi, Ji Won Park, Francis Bach

    Abstract: We introduce a theoretical framework for sampling from unnormalized densities based on a smoothing scheme that uses an isotropic Gaussian kernel with a single fixed noise scale. We prove one can decompose sampling from a density (minimal assumptions made on the density) into a sequence of sampling from log-concave conditional densities via accumulation of noisy measurements with equal noise levels… ▽ More

    Submitted 28 September, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

  9. arXiv:2303.11669  [pdf, other

    stat.ML cs.LG

    Universal Smoothed Score Functions for Generative Modeling

    Authors: Saeed Saremi, Rupesh Kumar Srivastava, Francis Bach

    Abstract: We consider the problem of generative modeling based on smoothing an unknown density of interest in $\mathbb{R}^d$ using factorial kernels with $M$ independent Gaussian channels with equal noise levels introduced by Saremi and Srivastava (2022). First, we fully characterize the time complexity of learning the resulting smoothed density in $\mathbb{R}^{Md}$, called M-density, by deriving a universa… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: Technical Report

  10. arXiv:2210.04096  [pdf, other

    cs.LG q-bio.QM

    PropertyDAG: Multi-objective Bayesian optimization of partially ordered, mixed-variable properties for biological sequence design

    Authors: Ji Won Park, Samuel Stanton, Saeed Saremi, Andrew Watkins, Henri Dwyer, Vladimir Gligorijevic, Richard Bonneau, Stephen Ra, Kyunghyun Cho

    Abstract: Bayesian optimization offers a sample-efficient framework for navigating the exploration-exploitation trade-off in the vast design space of biological sequences. Whereas it is possible to optimize the various properties of interest jointly using a multi-objective acquisition function, such as the expected hypervolume improvement (EHVI), this approach does not account for objectives with a hierarch… ▽ More

    Submitted 8 October, 2022; originally announced October 2022.

    Comments: 9 pages, 7 figures. Submitted to NeurIPS 2022 AI4Science Workshop

  11. arXiv:2112.09822  [pdf, other

    stat.ML cs.LG

    Multimeasurement Generative Models

    Authors: Saeed Saremi, Rupesh Kumar Srivastava

    Abstract: We formally map the problem of sampling from an unknown distribution with a density in $\mathbb{R}^d$ to the problem of learning and sampling a smoother density in $\mathbb{R}^{Md}$ obtained by convolution with a fixed factorial kernel: the new density is referred to as M-density and the kernel as multimeasurement noise model (MNM). The M-density in $\mathbb{R}^{Md}$ is smoother than the original… ▽ More

    Submitted 16 June, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

    Comments: Our code is publicly available at https://github.com/nnaisense/mems

    Journal ref: International Conference on Learning Representations, 2022

  12. arXiv:2101.11890  [pdf, other

    cs.LG cs.AI q-bio.QM

    Automatic design of novel potential 3CL$^{\text{pro}}$ and PL$^{\text{pro}}$ inhibitors

    Authors: Timothy Atkinson, Saeed Saremi, Faustino Gomez, Jonathan Masci

    Abstract: With the goal of designing novel inhibitors for SARS-CoV-1 and SARS-CoV-2, we propose the general molecule optimization framework, Molecular Neural Assay Search (MONAS), consisting of three components: a property predictor which identifies molecules with specific desirable properties, an energy model which approximates the statistical similarity of a given molecule to known training molecules, and… ▽ More

    Submitted 29 January, 2021; v1 submitted 28 January, 2021; originally announced January 2021.

  13. arXiv:2007.15130  [pdf, other

    stat.ML cs.LG

    Unnormalized Variational Bayes

    Authors: Saeed Saremi

    Abstract: We unify empirical Bayes and variational Bayes for approximating unnormalized densities. This framework, named unnormalized variational Bayes (UVB), is based on formulating a latent variable model for the random variable $Y=X+N(0,σ^2 I_d)$ and using the evidence lower bound (ELBO), computed by a variational autoencoder, as a parametrization of the energy function of $Y$ which is then used to estim… ▽ More

    Submitted 29 July, 2020; originally announced July 2020.

    Comments: Submitted to Journal of Machine Learning Research

  14. arXiv:2005.09047  [pdf, other

    stat.ML cs.LG

    Learning and Inference in Imaginary Noise Models

    Authors: Saeed Saremi

    Abstract: Inspired by recent developments in learning smoothed densities with empirical Bayes, we study variational autoencoders with a decoder that is tailored for the random variable $Y=X+N(0,σ^2 I_d)$. A notion of smoothed variational inference emerges where the smoothing is implicitly enforced by the noise model of the decoder; "implicit", since during training the encoder only sees clean samples. This… ▽ More

    Submitted 5 June, 2020; v1 submitted 18 May, 2020; originally announced May 2020.

  15. arXiv:2005.04504  [pdf, other

    stat.ML cs.LG math.OC

    Provable Robust Classification via Learned Smoothed Densities

    Authors: Saeed Saremi, Rupesh Srivastava

    Abstract: Smoothing classifiers and probability density functions with Gaussian kernels appear unrelated, but in this work, they are unified for the problem of robust classification. The key building block is approximating the $\textit{energy function}$ of the random variable $Y=X+N(0,σ^2 I_d)$ with a neural network which we use to formulate the problem of robust classification in terms of $\widehat{x}(Y)$,… ▽ More

    Submitted 9 May, 2020; originally announced May 2020.

    Comments: 24 pages, 6 figures

  16. arXiv:1912.03845  [pdf, other

    cs.LG stat.ML

    No Representation without Transformation

    Authors: Giorgio Giannone, Saeed Saremi, Jonathan Masci, Christian Osendorfer

    Abstract: We extend the framework of variational autoencoders to represent transformations explicitly in the latent space. In the family of hierarchical graphical models that emerges, the latent space is populated by higher order objects that are inferred jointly with the latent representations they act on. To explicitly demonstrate the effect of these higher order objects, we show that the inferred latent… ▽ More

    Submitted 23 April, 2020; v1 submitted 8 December, 2019; originally announced December 2019.

    Comments: Preprint. Accepted at BDL and PGR workshops at NeurIPS 2019

  17. arXiv:1912.03257  [pdf

    physics.app-ph cond-mat.mtrl-sci

    Piezoresponse phase as variable in electromechanical characterization

    Authors: Sabine M. Neumayer, Sahar Saremi, Lane W. Martin, Liam Collins, Alexander Tselev, Stephen Jesse, Sergei V. Kalinin, Nina Balke

    Abstract: Piezoresponse force microscopy (PFM) is a powerful characterization technique to readily image and manipulate ferroelectrics domains. PFM gives insight into the strength of local piezoelectric coupling as well as polarization direction through PFM amplitude and phase, respectively. Converting measured arbitrary units to physical material parameters, however, remains a challenge. While much effort… ▽ More

    Submitted 6 December, 2019; originally announced December 2019.

    Comments: 16 pages, 6 figures

  18. arXiv:1910.12744  [pdf, ps, other

    stat.ML cs.LG

    On approximating $\nabla f$ with neural networks

    Authors: Saeed Saremi

    Abstract: Consider a feedforward neural network $ψ: \mathbb{R}^d\rightarrow \mathbb{R}^d$ such that $ψ\approx \nabla f$, where $f:\mathbb{R}^d \rightarrow \mathbb{R}$ is a smooth function, therefore $ψ$ must satisfy $\partial_j ψ_i = \partial_i ψ_j$ pointwise. We prove a theorem that a $ψ$ network with more than one hidden layer can only represent one feature in its first hidden layer; this is a dramatic de… ▽ More

    Submitted 6 November, 2019; v1 submitted 28 October, 2019; originally announced October 2019.

    Comments: 10 pages

  19. arXiv:1903.02334  [pdf, other

    stat.ML cs.LG

    Neural Empirical Bayes

    Authors: Saeed Saremi, Aapo Hyvarinen

    Abstract: We unify $\textit{kernel density estimation}$ and $\textit{empirical Bayes}$ and address a set of problems in unsupervised learning with a geometric interpretation of those methods, rooted in the $\textit{concentration of measure}$ phenomenon. Kernel density is viewed symbolically as $X\rightharpoonup Y$ where the random variable $X$ is smoothed to $Y= X+N(0,σ^2 I_d)$, and empirical Bayes is the m… ▽ More

    Submitted 21 April, 2020; v1 submitted 6 March, 2019; originally announced March 2019.

    Comments: 23 pages, 10 figures

    Journal ref: Journal of Machine Learning Research 20(181), 1-23, 2019

  20. arXiv:1805.08306  [pdf, other

    stat.ML cs.LG

    Deep Energy Estimator Networks

    Authors: Saeed Saremi, Arash Mehrjou, Bernhard Schölkopf, Aapo Hyvärinen

    Abstract: Density estimation is a fundamental problem in statistical learning. This problem is especially challenging for complex high-dimensional data due to the curse of dimensionality. A promising solution to this problem is given here in an inference-free hierarchical framework that is built on score matching. We revisit the Bayesian interpretation of the score function and the Parzen score matching, an… ▽ More

    Submitted 21 May, 2018; originally announced May 2018.

  21. arXiv:1705.07505  [pdf, other

    stat.ML cs.LG

    Annealed Generative Adversarial Networks

    Authors: Arash Mehrjou, Bernhard Schölkopf, Saeed Saremi

    Abstract: We introduce a novel framework for adversarial training where the target distribution is annealed between the uniform distribution and the data distribution. We posited a conjecture that learning under continuous annealing in the nonparametric regime is stable irrespective of the divergence measures in the objective function and proposed an algorithm, dubbed ß-GAN, in corollary. In this framework,… ▽ More

    Submitted 21 May, 2017; originally announced May 2017.

    Comments: 9 pages, 6 figures

  22. Pressurizing Field-Effect Transistors of Few-Layer MoS2 in a Diamond Anvil Cell

    Authors: Yabin Chen, Feng Ke, Penghong Ci, Changhyun Ko, Taegyun Park, Sahar Saremi, Huili Liu, Yeonbae Lee, Joonki Suh, Lane W. Martin, Joel W. Ager, Bin Chen, Junqiao Wu

    Abstract: Hydrostatic pressure applied using diamond anvil cells (DAC) has been widely explored to modulate physical properties of materials by tuning their lattice degree of freedom. Independently, electrical field is able to tune the electronic degree of freedom of functional materials via, for example, the field-effect transistor (FET) configuration. Combining these two orthogonal approaches would allow… ▽ More

    Submitted 2 October, 2016; originally announced October 2016.

    Comments: 15 pages, 5 figures

  23. arXiv:1510.07740  [pdf, other

    stat.ML cond-mat.stat-mech cs.CV cs.LG

    The Wilson Machine for Image Modeling

    Authors: Saeed Saremi, Terrence J. Sejnowski

    Abstract: Learning the distribution of natural images is one of the hardest and most important problems in machine learning. The problem remains open, because the enormous complexity of the structures in natural images spans all length scales. We break down the complexity of the problem and show that the hierarchy of structures in natural images fuels a new class of learning algorithms based on the theory o… ▽ More

    Submitted 11 November, 2015; v1 submitted 26 October, 2015; originally announced October 2015.

  24. The Physics of the B Factories

    Authors: A. J. Bevan, B. Golob, Th. Mannel, S. Prell, B. D. Yabsley, K. Abe, H. Aihara, F. Anulli, N. Arnaud, T. Aushev, M. Beneke, J. Beringer, F. Bianchi, I. I. Bigi, M. Bona, N. Brambilla, J. B rodzicka, P. Chang, M. J. Charles, C. H. Cheng, H. -Y. Cheng, R. Chistov, P. Colangelo, J. P. Coleman, A. Drutskoy , et al. (2009 additional authors not shown)

    Abstract: This work is on the Physics of the B Factories. Part A of this book contains a brief description of the SLAC and KEK B Factories as well as their detectors, BaBar and Belle, and data taking related issues. Part B discusses tools and methods used by the experiments in order to obtain results. The results themselves can be found in Part C. Please note that version 3 on the archive is the auxiliary… ▽ More

    Submitted 31 October, 2015; v1 submitted 24 June, 2014; originally announced June 2014.

    Comments: 928 pages, version 3 (arXiv:1406.6311v3) corresponds to the alpha, beta, gamma version of the book, the other versions use the phi1, phi2, phi3 notation

    Report number: SLAC-PUB-15968, KEK Preprint 2014-3

    Journal ref: Eur. Phys. J. C74 (2014) 3026

  25. arXiv:0903.4195  [pdf, ps, other

    cond-mat.str-el

    Kondo Vortices, Zero Modes, and Magnetic Ordering in a Kondo Lattice Model

    Authors: Saeed Saremi, Patrick A. Lee, T. Senthil

    Abstract: Motivated by the mysteries of the heavy fermion quantum critical point, we investigate the competition between Kondo screening and magnetic ordering in the honeycomb Kondo lattice at half filling. We examine the destruction of the Kondo phase by proliferating vortex configurations in the Kondo hybridization order parameter. We find that there are zero modes associated with Kondo vortices. Conden… ▽ More

    Submitted 25 March, 2009; originally announced March 2009.

    Comments: 4 pages, 2 figures, 4 tables

  26. RKKY in half-filled bipartite lattices: graphene as an example

    Authors: Saeed Saremi

    Abstract: We first present a simple proof that for any bipartite lattice at half filling the RKKY interaction is antiferromagnetic between impurities on opposite (i.e., A and B) sublattices and is ferromagnetic between impurities on the same sublattices. This result is valid on all length scales. We then focus on the honeycomb lattice and examine the theorem in the long distance limit by performing the lo… ▽ More

    Submitted 26 November, 2007; v1 submitted 2 May, 2007; originally announced May 2007.

    Comments: v3. The published version. 6 pages, 1 figure

    Journal ref: Phys. Rev. B 76, 184430 (2007) (6 pages)

  27. Quantum critical point in the Kondo-Heisenberg model on the honeycomb lattice

    Authors: Saeed Saremi, Patrick A. Lee

    Abstract: We study the Kondo--Heisenberg model on the honeycomb lattice at half-filling. Due to the vanishing of the density of states at the fermi level, the Kondo insulator disappears at a finite Kondo coupling even in the absence of the Heisenberg exchange. We adopt a large-N formulation of this model and use the renormalization group machinery to study systematically the second order phase transition… ▽ More

    Submitted 1 May, 2007; v1 submitted 10 October, 2006; originally announced October 2006.

    Comments: The published version. New title. Very minor changes in the abstract and introduction compared to the original version

    Journal ref: Phys. Rev. B 75, 165110 (2007)