Skip to main content

Showing 1–10 of 10 results for author: Ross, B L

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.10060  [pdf, other

    cs.LG cs.AI stat.ML

    Textual Bayes: Quantifying Uncertainty in LLM-Based Systems

    Authors: Brendan Leigh Ross, Noël Vouitsis, Atiyeh Ashari Ghomi, Rasa Hosseinzadeh, Ji Xin, Zhaoyan Liu, Yi Sui, Shiyi Hou, Kin Kwan Leung, Gabriel Loaiza-Ganem, Jesse C. Cresswell

    Abstract: Although large language models (LLMs) are becoming increasingly capable of solving challenging real-world tasks, accurately quantifying their uncertainty remains a critical open problem, which limits their applicability in high-stakes domains. This challenge is further compounded by the closed-source, black-box nature of many state-of-the-art LLMs. Moreover, LLM-based systems can be highly sensiti… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

  2. arXiv:2411.00113  [pdf, other

    stat.ML cs.LG

    A Geometric Framework for Understanding Memorization in Generative Models

    Authors: Brendan Leigh Ross, Hamidreza Kamkari, Tongzi Wu, Rasa Hosseinzadeh, Zhaoyan Liu, George Stein, Jesse C. Cresswell, Gabriel Loaiza-Ganem

    Abstract: As deep generative models have progressed, recent work has shown them to be capable of memorizing and reproducing training datapoints when deployed. These findings call into question the usability of generative models, especially in light of the legal and privacy risks brought about by memorization. To better understand this phenomenon, we propose the manifold memorization hypothesis (MMH), a geom… ▽ More

    Submitted 12 March, 2025; v1 submitted 31 October, 2024; originally announced November 2024.

    Comments: Accepted to ICLR 2025 (Spotlight)

  3. arXiv:2406.03537  [pdf, other

    cs.LG cs.AI stat.ML

    A Geometric View of Data Complexity: Efficient Local Intrinsic Dimension Estimation with Diffusion Models

    Authors: Hamidreza Kamkari, Brendan Leigh Ross, Rasa Hosseinzadeh, Jesse C. Cresswell, Gabriel Loaiza-Ganem

    Abstract: High-dimensional data commonly lies on low-dimensional submanifolds, and estimating the local intrinsic dimension (LID) of a datum -- i.e. the dimension of the submanifold it belongs to -- is a longstanding problem. LID can be understood as the number of local factors of variation: the more factors of variation a datum has, the more complex it tends to be. Estimating this quantity has proven usefu… ▽ More

    Submitted 24 October, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: NeurIPS 2024 (spotlight)

  4. arXiv:2404.02954  [pdf, other

    cs.LG cs.AI stat.ML

    Deep Generative Models through the Lens of the Manifold Hypothesis: A Survey and New Connections

    Authors: Gabriel Loaiza-Ganem, Brendan Leigh Ross, Rasa Hosseinzadeh, Anthony L. Caterini, Jesse C. Cresswell

    Abstract: In recent years there has been increased interest in understanding the interplay between deep generative models (DGMs) and the manifold hypothesis. Research in this area focuses on understanding the reasons why commonly-used DGMs succeed or fail at learning distributions supported on unknown low-dimensional manifolds, as well as developing new models explicitly designed to account for manifold-sup… ▽ More

    Submitted 25 September, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: TMLR 2024 (survey certification, expert certification)

  5. arXiv:2403.18910  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    A Geometric Explanation of the Likelihood OOD Detection Paradox

    Authors: Hamidreza Kamkari, Brendan Leigh Ross, Jesse C. Cresswell, Anthony L. Caterini, Rahul G. Krishnan, Gabriel Loaiza-Ganem

    Abstract: Likelihood-based deep generative models (DGMs) commonly exhibit a puzzling behaviour: when trained on a relatively complex dataset, they assign higher likelihood values to out-of-distribution (OOD) data from simpler sources. Adding to the mystery, OOD samples are never generated by these DGMs despite having higher likelihoods. This two-pronged paradox has yet to be conclusively explained, making l… ▽ More

    Submitted 11 June, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: ICML 2024

  6. arXiv:2306.04675  [pdf, other

    cs.LG cs.CV stat.ML

    Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models

    Authors: George Stein, Jesse C. Cresswell, Rasa Hosseinzadeh, Yi Sui, Brendan Leigh Ross, Valentin Villecroze, Zhaoyan Liu, Anthony L. Caterini, J. Eric T. Taylor, Gabriel Loaiza-Ganem

    Abstract: We systematically study a wide variety of generative models spanning semantically-diverse image datasets to understand and improve the feature extractors and metrics used to evaluate them. Using best practices in psychophysics, we measure human perception of image realism for generated samples by conducting the largest experiment evaluating generative models to date, and find that no existing metr… ▽ More

    Submitted 30 October, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023. 53 pages, 29 figures, 12 tables. Code at https://github.com/layer6ai-labs/dgm-eval, reviews at https://openreview.net/forum?id=08zf7kTOoh

    Journal ref: Thirty-seventh Conference on Neural Information Processing Systems (2023)

  7. arXiv:2207.02862  [pdf, other

    stat.ML cs.AI cs.LG

    Verifying the Union of Manifolds Hypothesis for Image Data

    Authors: Bradley C. A. Brown, Anthony L. Caterini, Brendan Leigh Ross, Jesse C. Cresswell, Gabriel Loaiza-Ganem

    Abstract: Deep learning has had tremendous success at learning low-dimensional representations of high-dimensional data. This success would be impossible if there was no hidden low-dimensional structure in data of interest; this existence is posited by the manifold hypothesis, which states that the data lies on an unknown manifold of low intrinsic dimension. In this paper, we argue that this hypothesis does… ▽ More

    Submitted 2 March, 2023; v1 submitted 6 July, 2022; originally announced July 2022.

    Comments: ICLR 2023

  8. arXiv:2206.11267  [pdf, other

    stat.ML cs.LG

    Neural Implicit Manifold Learning for Topology-Aware Density Estimation

    Authors: Brendan Leigh Ross, Gabriel Loaiza-Ganem, Anthony L. Caterini, Jesse C. Cresswell

    Abstract: Natural data observed in $\mathbb{R}^n$ is often constrained to an $m$-dimensional manifold $\mathcal{M}$, where $m < n$. This work focuses on the task of building theoretically principled generative models for such data. Current generative models learn $\mathcal{M}$ by mapping an $m$-dimensional latent variable through a neural network $f_θ: \mathbb{R}^m \to \mathbb{R}^n$. These procedures, which… ▽ More

    Submitted 21 December, 2023; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: Accepted to TMLR in 2023. Code: https://github.com/layer6ai-labs/implicit-manifolds

  9. arXiv:2204.07172  [pdf, other

    stat.ML cs.AI cs.LG stat.ME

    Diagnosing and Fixing Manifold Overfitting in Deep Generative Models

    Authors: Gabriel Loaiza-Ganem, Brendan Leigh Ross, Jesse C. Cresswell, Anthony L. Caterini

    Abstract: Likelihood-based, or explicit, deep generative models use neural networks to construct flexible high-dimensional densities. This formulation directly contradicts the manifold hypothesis, which states that observed data lies on a low-dimensional manifold embedded in high-dimensional ambient space. In this paper we investigate the pathologies of maximum-likelihood training in the presence of this di… ▽ More

    Submitted 28 November, 2022; v1 submitted 14 April, 2022; originally announced April 2022.

    Comments: Accepted for publication in TMLR

  10. arXiv:2106.05275  [pdf, other

    stat.ML cs.LG

    Tractable Density Estimation on Learned Manifolds with Conformal Embedding Flows

    Authors: Brendan Leigh Ross, Jesse C. Cresswell

    Abstract: Normalizing flows are generative models that provide tractable density estimation via an invertible transformation from a simple base distribution to a complex target distribution. However, this technique cannot directly model data supported on an unknown low-dimensional manifold, a common occurrence in real-world domains such as image data. Recent attempts to remedy this limitation have introduce… ▽ More

    Submitted 11 November, 2021; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021 Camera-Ready. Code: https://github.com/layer6ai-labs/CEF