Skip to main content

Showing 1–9 of 9 results for author: Lou, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2310.20030  [pdf, other

    cs.LG math.DG stat.ML

    Scaling Riemannian Diffusion Models

    Authors: Aaron Lou, Minkai Xu, Stefano Ermon

    Abstract: Riemannian diffusion models draw inspiration from standard Euclidean space diffusion models to learn distributions on general manifolds. Unfortunately, the additional geometric complexity renders the diffusion transition term inexpressible in closed form, so prior methods resort to imprecise approximations of the score matching training objective that degrade performance and preclude applications… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023

  2. arXiv:2310.16834  [pdf, other

    stat.ML cs.CL cs.LG

    Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution

    Authors: Aaron Lou, Chenlin Meng, Stefano Ermon

    Abstract: Despite their groundbreaking performance for many generative modeling tasks, diffusion models have fallen short on discrete data domains such as natural language. Crucially, standard diffusion models rely on the well-established theory of score matching, but efforts to generalize this to discrete structures have not yielded the same empirical gains. In this work, we bridge this gap by proposing sc… ▽ More

    Submitted 6 June, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: ICML 2024 Oral. Code at https://github.com/louaaron/Score-Entropy-Discrete-Diffusion

  3. arXiv:2310.10013  [pdf, other

    stat.ML cs.LG

    Riemannian Residual Neural Networks

    Authors: Isay Katsman, Eric Ming Chen, Sidhanth Holalkere, Anna Asch, Aaron Lou, Ser-Nam Lim, Christopher De Sa

    Abstract: Recent methods in geometric deep learning have introduced various neural networks to operate over data that lie on Riemannian manifolds. Such networks are often necessary to learn well over graphs with a hierarchical structure or to learn over manifold-valued data encountered in the natural sciences. These networks are often inspired by and directly generalize standard Euclidean neural networks. H… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

    Comments: Published at NeurIPS 2023

  4. arXiv:2304.04740  [pdf, other

    stat.ML cs.LG

    Reflected Diffusion Models

    Authors: Aaron Lou, Stefano Ermon

    Abstract: Score-based diffusion models learn to reverse a stochastic differential equation that maps data to noise. However, for complex tasks, numerical error can compound and result in highly unnatural samples. Previous work mitigates this drift with thresholding, which projects to the natural data domain (such as pixel space for images) after each diffusion step, but this leads to a mismatch between the… ▽ More

    Submitted 8 June, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

    Comments: ICML 2023 Camera Ready. Code available at https://github.com/louaaron/Reflected-Diffusion

  5. arXiv:2111.13171  [pdf, other

    cs.LG cs.AI cs.CV math.GN stat.ML

    Intrinsic Dimension, Persistent Homology and Generalization in Neural Networks

    Authors: Tolga Birdal, Aaron Lou, Leonidas Guibas, Umut Şimşekli

    Abstract: Disobeying the classical wisdom of statistical learning theory, modern deep neural networks generalize well even though they typically contain millions of parameters. Recently, it has been shown that the trajectories of iterative optimization algorithms can possess fractal structures, and their generalization error can be formally linked to the complexity of such fractals. This complexity is measu… ▽ More

    Submitted 25 November, 2021; originally announced November 2021.

    Comments: Appears at NeurIPS 2021

  6. arXiv:2107.08596  [pdf, other

    stat.ML cs.LG math.DG

    Equivariant Manifold Flows

    Authors: Isay Katsman, Aaron Lou, Derek Lim, Qingxuan Jiang, Ser-Nam Lim, Christopher De Sa

    Abstract: Tractably modelling distributions over manifolds has long been an important goal in the natural sciences. Recent work has focused on developing general machine learning models to learn such distributions. However, for many applications these distributions must respect manifold symmetries -- a trait which most previous models disregard. In this paper, we lay the theoretical foundations for learning… ▽ More

    Submitted 27 January, 2022; v1 submitted 18 July, 2021; originally announced July 2021.

    Comments: Published at NeurIPS 2021

  7. arXiv:2006.10254  [pdf, other

    stat.ML cs.LG math.DG

    Neural Manifold Ordinary Differential Equations

    Authors: Aaron Lou, Derek Lim, Isay Katsman, Leo Huang, Qingxuan Jiang, Ser-Nam Lim, Christopher De Sa

    Abstract: To better conform to data geometry, recent deep generative modelling techniques adapt Euclidean constructions to non-Euclidean spaces. In this paper, we study normalizing flows on manifolds. Previous work has developed flow models for specific cases; however, these advancements hand craft layers on a manifold-by-manifold basis, restricting generality and inducing cumbersome design constraints. We… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

    Comments: Submitted to NeurIPS 2020

  8. arXiv:2003.00335  [pdf, other

    stat.ML cs.LG

    Differentiating through the Fréchet Mean

    Authors: Aaron Lou, Isay Katsman, Qingxuan Jiang, Serge Belongie, Ser-Nam Lim, Christopher De Sa

    Abstract: Recent advances in deep representation learning on Riemannian manifolds extend classical deep learning operations to better capture the geometry of the manifold. One possible extension is the Fréchet mean, the generalization of the Euclidean mean; however, it has been difficult to apply because it lacks a closed form with an easily computable derivative. In this paper, we show how to differentiate… ▽ More

    Submitted 5 July, 2021; v1 submitted 29 February, 2020; originally announced March 2020.

    Comments: ICML 2020 camera-ready; updated Algorithm 1 typo

  9. arXiv:1812.01198  [pdf, other

    stat.ML cs.LG

    Adversarial Example Decomposition

    Authors: Horace He, Aaron Lou, Qingxuan Jiang, Isay Katsman, Serge Belongie, Ser-Nam Lim

    Abstract: Research has shown that widely used deep neural networks are vulnerable to carefully crafted adversarial perturbations. Moreover, these adversarial perturbations often transfer across models. We hypothesize that adversarial weakness is composed of three sources of bias: architecture, dataset, and random initialization. We show that one can decompose adversarial examples into an architecture-depend… ▽ More

    Submitted 21 June, 2019; v1 submitted 3 December, 2018; originally announced December 2018.

    Comments: ICML 2019 Workshop, Security and Privacy of Machine Learning, camera-ready version