Statistically Optimal Generative Modeling with Maximum Deviation from the Empirical Distribution

Vardanyan, Elen; Hunanyan, Sona; Galstyan, Tigran; Minasyan, Arshak; Dalalyan, Arnak

Mathematics > Statistics Theory

arXiv:2307.16422 (math)

[Submitted on 31 Jul 2023 (v1), last revised 6 Jun 2024 (this version, v2)]

Title:Statistically Optimal Generative Modeling with Maximum Deviation from the Empirical Distribution

Authors:Elen Vardanyan, Sona Hunanyan, Tigran Galstyan, Arshak Minasyan, Arnak Dalalyan

View PDF HTML (experimental)

Abstract:This paper explores the problem of generative modeling, aiming to simulate diverse examples from an unknown distribution based on observed examples. While recent studies have focused on quantifying the statistical precision of popular algorithms, there is a lack of mathematical evaluation regarding the non-replication of observed examples and the creativity of the generative model. We present theoretical insights into this aspect, demonstrating that the Wasserstein GAN, constrained to left-invertible push-forward maps, generates distributions that avoid replication and significantly deviate from the empirical distribution. Importantly, we show that left-invertibility achieves this without compromising the statistical optimality of the resulting generator. Our most important contribution provides a finite-sample lower bound on the Wasserstein-1 distance between the generative distribution and the empirical one. We also establish a finite-sample upper bound on the distance between the generative distribution and the true data-generating one. Both bounds are explicit and show the impact of key parameters such as sample size, dimensions of the ambient and latent spaces, noise level, and smoothness measured by the Lipschitz constant.

Comments:	ICML 2024
Subjects:	Statistics Theory (math.ST); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2307.16422 [math.ST]
	(or arXiv:2307.16422v2 [math.ST] for this version)
	https://doi.org/10.48550/arXiv.2307.16422

Submission history

From: Arnak Dalalyan S. [view email]
[v1] Mon, 31 Jul 2023 06:11:57 UTC (880 KB)
[v2] Thu, 6 Jun 2024 14:00:36 UTC (8,605 KB)

Mathematics > Statistics Theory

Title:Statistically Optimal Generative Modeling with Maximum Deviation from the Empirical Distribution

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Statistics Theory

Title:Statistically Optimal Generative Modeling with Maximum Deviation from the Empirical Distribution

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators