Skip to main content

Showing 1–24 of 24 results for author: Goldfeld, Z

Searching in archive math. Search in all archives.
.
  1. arXiv:2506.21507  [pdf, ps, other

    math.ST stat.ML

    Robust Alignment via Partial Gromov-Wasserstein Distances

    Authors: Xiaoyun Gong, Sloan Nietert, Ziv Goldfeld

    Abstract: The Gromov-Wasserstein (GW) problem provides a powerful framework for aligning heterogeneous datasets by matching their internal structures in a way that minimizes distortion. However, GW alignment is sensitive to data contamination by outliers, which can greatly distort the resulting matching scheme. To address this issue, we study robust GW alignment, where upon observing contaminated versions o… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  2. arXiv:2411.07947  [pdf, ps, other

    math.PR math.OC

    Approximation rates of entropic maps in semidiscrete optimal transport

    Authors: Ritwik Sadhu, Ziv Goldfeld, Kengo Kato

    Abstract: Entropic optimal transport offers a computationally tractable approximation to the classical problem. In this note, we study the approximation rate of the entropic optimal transport map (in approaching the Brenier map) when the regularization parameter $\varepsilon$ tends to zero in the semidiscrete setting, where the input measure is absolutely continuous while the output is finitely discrete. Pr… ▽ More

    Submitted 21 November, 2024; v1 submitted 12 November, 2024; originally announced November 2024.

    MSC Class: 49Q22; 60F05; 49N15

  3. arXiv:2410.18006  [pdf, other

    math.ST math.PR

    Limit Laws for Gromov-Wasserstein Alignment with Applications to Testing Graph Isomorphisms

    Authors: Gabriel Rioux, Ziv Goldfeld, Kengo Kato

    Abstract: The Gromov-Wasserstein (GW) distance enables comparing metric measure spaces based solely on their internal structure, making it invariant to isomorphic transformations. This property is particularly useful for comparing datasets that naturally admit isomorphic representations, such as unlabelled graphs or objects embedded in space. However, apart from the recently derived empirical convergence ra… ▽ More

    Submitted 23 October, 2024; originally announced October 2024.

    Comments: 65 pages. 11 figures

  4. arXiv:2407.11800  [pdf, other

    math.AP math.OC stat.ML

    Gradient Flows and Riemannian Structure in the Gromov-Wasserstein Geometry

    Authors: Zhengxin Zhang, Ziv Goldfeld, Kristjan Greenewald, Youssef Mroueh, Bharath K. Sriperumbudur

    Abstract: The Wasserstein space of probability measures is known for its intricate Riemannian structure, which underpins the Wasserstein geometry and enables gradient flow algorithms. However, the Wasserstein geometry may not be suitable for certain tasks or data modalities. Motivated by scenarios where the global structure of the data needs to be preserved, this work initiates the study of gradient flows a… ▽ More

    Submitted 21 May, 2025; v1 submitted 16 July, 2024; originally announced July 2024.

    Comments: 74 pages

  5. arXiv:2405.06734  [pdf, other

    math.ST

    Neural Estimation Of Entropic Optimal Transport

    Authors: Tao Wang, Ziv Goldfeld

    Abstract: Optimal transport (OT) serves as a natural framework for comparing probability measures, with applications in statistics, machine learning, and applied mathematics. Alas, statistical estimation and exact computation of the OT distances suffer from the curse of dimensionality. To circumvent these issues, entropic regularization has emerged as a remedy that enables parametric estimation rates via pl… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2312.07397

  6. arXiv:2312.07397  [pdf, other

    math.ST

    Neural Entropic Gromov-Wasserstein Alignment

    Authors: Tao Wang, Ziv Goldfeld

    Abstract: The Gromov-Wasserstein (GW) distance, rooted in optimal transport (OT) theory, provides a natural framework for aligning heterogeneous datasets. Alas, statistical estimation of the GW distance suffers from the curse of dimensionality and its exact computation is NP hard. To circumvent these issues, entropic regularization has emerged as a remedy that enables parametric estimation rates via plug-in… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  7. arXiv:2311.05573  [pdf, other

    stat.ML cs.LG math.OC

    Outlier-Robust Wasserstein DRO

    Authors: Sloan Nietert, Ziv Goldfeld, Soroosh Shafiee

    Abstract: Distributionally robust optimization (DRO) is an effective approach for data-driven decision-making in the presence of uncertainty. Geometric uncertainty due to sampling or localized perturbations of data points is captured by Wasserstein DRO (WDRO), which seeks to learn a model that performs uniformly well over a Wasserstein ball centered around the observed data distribution. However, WDRO fails… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: Appearing at NeurIPS 2023

  8. arXiv:2306.00182  [pdf, other

    math.OC math.ST

    Entropic Gromov-Wasserstein Distances: Stability and Algorithms

    Authors: Gabriel Rioux, Ziv Goldfeld, Kengo Kato

    Abstract: The Gromov-Wasserstein (GW) distance quantifies discrepancy between metric measure spaces and provides a natural framework for aligning heterogeneous datasets. Alas, as exact computation of GW alignment is NP hard, entropic regularization provides an avenue towards a computationally tractable proxy. Leveraging a recently derived variational representation for the quadratic entropic GW (EGW) distan… ▽ More

    Submitted 9 January, 2024; v1 submitted 31 May, 2023; originally announced June 2023.

    Comments: Version 3 of this arxiv report has been split into two parts. Version 4 of the arxiv report contains the algorithmic results of the original submission. The statistical results will appear as a separate arxiv submission

  9. arXiv:2303.10155  [pdf, other

    math.ST math.PR

    Stability and statistical inference for semidiscrete optimal transport maps

    Authors: Ritwik Sadhu, Ziv Goldfeld, Kengo Kato

    Abstract: We study statistical inference for the optimal transport (OT) map (also known as the Brenier map) from a known absolutely continuous reference distribution onto an unknown finitely discrete target distribution. We derive limit distributions for the $L^p$-error with arbitrary $p \in [1,\infty)$ and for linear functionals of the empirical OT map, together with their moment convergence. The former ha… ▽ More

    Submitted 20 May, 2024; v1 submitted 17 March, 2023; originally announced March 2023.

    Comments: 43 pages

  10. arXiv:2302.01237  [pdf, other

    stat.ML cs.LG math.ST

    Robust Estimation under the Wasserstein Distance

    Authors: Sloan Nietert, Rachel Cummings, Ziv Goldfeld

    Abstract: We study the problem of robust distribution estimation under the Wasserstein distance, a popular discrepancy measure between probability distributions rooted in optimal transport (OT) theory. Given $n$ samples from an unknown distribution $μ$, of which $\varepsilon n$ are adversarially corrupted, we seek an estimate for $μ$ with minimal Wasserstein error. To address this task, we draw upon two fra… ▽ More

    Submitted 24 September, 2024; v1 submitted 2 February, 2023; originally announced February 2023.

  11. arXiv:2212.12848  [pdf, other

    math.ST

    Gromov-Wasserstein Distances: Entropic Regularization, Duality, and Sample Complexity

    Authors: Zhengxin Zhang, Ziv Goldfeld, Youssef Mroueh, Bharath K. Sriperumbudur

    Abstract: The Gromov-Wasserstein (GW) distance, rooted in optimal transport (OT) theory, quantifies dissimilarity between metric measure spaces and provides a framework for aligning heterogeneous datasets. While computational aspects of the GW problem have been widely studied, a duality theory and fundamental statistical questions concerning empirical convergence rates remained obscure. This work closes the… ▽ More

    Submitted 28 September, 2023; v1 submitted 24 December, 2022; originally announced December 2022.

    Comments: 47 pages

  12. arXiv:2211.11184  [pdf, ps, other

    math.ST cs.IT

    Limit distribution theory for $f$-Divergences

    Authors: Sreejith Sreekumar, Ziv Goldfeld, Kengo Kato

    Abstract: $f$-divergences, which quantify discrepancy between probability distributions, are ubiquitous in information theory, machine learning, and statistics. While there are numerous methods for estimating $f… ▽ More

    Submitted 12 October, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

  13. arXiv:2207.08683  [pdf, other

    math.ST math.PR

    Limit Theorems for Entropic Optimal Transport Maps and the Sinkhorn Divergence

    Authors: Ziv Goldfeld, Kengo Kato, Gabriel Rioux, Ritwik Sadhu

    Abstract: We study limit theorems for entropic optimal transport (EOT) maps, dual potentials, and the Sinkhorn divergence. The key technical tool we use is a first and second-order Hadamard differentiability analysis of EOT potentials with respect to the marginal distributions, which may be of independent interest. Given the differentiability results, the functional delta method is used to obtain central li… ▽ More

    Submitted 14 June, 2023; v1 submitted 18 July, 2022; originally announced July 2022.

    Comments: 49 pages

  14. arXiv:2205.04283  [pdf, ps, other

    math.ST math.PR

    Statistical inference with regularized optimal transport

    Authors: Ziv Goldfeld, Kengo Kato, Gabriel Rioux, Ritwik Sadhu

    Abstract: Optimal transport (OT) is a versatile framework for comparing probability measures, with many applications to statistics, machine learning, and applied mathematics. However, OT distances suffer from computational and statistical scalability issues to high dimensions, which motivated the study of regularized OT methods like slicing, smoothing, and entropic penalty. This work establishes a unified f… ▽ More

    Submitted 7 June, 2022; v1 submitted 9 May, 2022; originally announced May 2022.

    Comments: 71 pages

  15. arXiv:2203.00159  [pdf, ps, other

    math.PR math.ST

    Limit distribution theory for smooth $p$-Wasserstein distances

    Authors: Ziv Goldfeld, Kengo Kato, Sloan Nietert, Gabriel Rioux

    Abstract: The Wasserstein distance is a metric on a space of probability measures that has seen a surge of applications in statistics, machine learning, and applied mathematics. However, statistical aspects of Wasserstein distances are bottlenecked by the curse of dimensionality, whereby the number of data points needed to accurately estimate them grows exponentially with dimension. Gaussian smoothing was r… ▽ More

    Submitted 28 February, 2022; originally announced March 2022.

  16. arXiv:2110.03652  [pdf, ps, other

    math.ST stat.ML

    Neural Estimation of Statistical Divergences

    Authors: Sreejith Sreekumar, Ziv Goldfeld

    Abstract: Statistical divergences (SDs), which quantify the dissimilarity between probability distributions, are a basic constituent of statistical inference and machine learning. A modern method for estimating those divergences relies on parametrizing an empirical variational form by a neural network (NN) and optimizing over parameter space. Such neural estimators are abundantly used in practice, but corre… ▽ More

    Submitted 29 March, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

  17. arXiv:2107.13494  [pdf, ps, other

    math.ST math.PR stat.ML

    Limit Distribution Theory for the Smooth 1-Wasserstein Distance with Applications

    Authors: Ritwik Sadhu, Ziv Goldfeld, Kengo Kato

    Abstract: The smooth 1-Wasserstein distance (SWD) $W_1^σ$ was recently proposed as a means to mitigate the curse of dimensionality in empirical approximation while preserving the Wasserstein structure. Indeed, SWD exhibits parametric convergence rates and inherits the metric and topological structure of the classic Wasserstein distance. Motivated by the above, this work conducts a thorough statistical study… ▽ More

    Submitted 24 February, 2022; v1 submitted 28 July, 2021; originally announced July 2021.

    MSC Class: 62E17; 60F05; 60F17; 62G10; 62F12; 62F40

  18. arXiv:2103.06923  [pdf, other

    math.ST stat.ML

    Non-Asymptotic Performance Guarantees for Neural Estimation of $\mathsf{f}$-Divergences

    Authors: Sreejith Sreekumar, Zhengxin Zhang, Ziv Goldfeld

    Abstract: Statistical distances (SDs), which quantify the dissimilarity between probability distributions, are central to machine learning and statistics. A modern method for estimating such distances from data relies on parametrizing a variational form by a neural network (NN) and optimizing it. These estimators are abundantly used in practice, but corresponding performance guarantees are partial and call… ▽ More

    Submitted 16 March, 2021; v1 submitted 11 March, 2021; originally announced March 2021.

  19. arXiv:2101.04039  [pdf, other

    math.ST stat.ML

    Smooth $p$-Wasserstein Distance: Structure, Empirical Approximation, and Statistical Applications

    Authors: Sloan Nietert, Ziv Goldfeld, Kengo Kato

    Abstract: Discrepancy measures between probability distributions, often termed statistical distances, are ubiquitous in probability theory, statistics and machine learning. To combat the curse of dimensionality when estimating these distances from data, recent work has proposed smoothing out local irregularities in the measured distributions via convolution with a Gaussian kernel. Motivated by the scalabili… ▽ More

    Submitted 17 December, 2021; v1 submitted 11 January, 2021; originally announced January 2021.

    Comments: updated to match ICML 2021 paper

  20. arXiv:2002.01013  [pdf, other

    math.ST

    Limit Distribution for Smooth Total Variation and $χ^2$-Divergence in High Dimensions

    Authors: Ziv Goldfeld, Kengo Kato

    Abstract: Statistical divergences are ubiquitous in machine learning as tools for measuring discrepancy between probability distributions. As these applications inherently rely on approximating distributions from samples, we consider empirical approximation under two popular $f$-divergences: the total variation (TV) distance and the $χ^2$-divergence. To circumvent the sensitivity of these divergences to sup… ▽ More

    Submitted 30 April, 2020; v1 submitted 3 February, 2020; originally announced February 2020.

  21. arXiv:2002.01012  [pdf, ps, other

    math.ST

    Asymptotic Guarantees for Generative Modeling Based on the Smooth Wasserstein Distance

    Authors: Ziv Goldfeld, Kristjan Greenewald, Kengo Kato

    Abstract: Minimum distance estimation (MDE) gained recent attention as a formulation of (implicit) generative modeling. It considers minimizing, over model parameters, a statistical distance between the empirical data distribution and the model. This formulation lends itself well to theoretical analysis, but typical results are hindered by the curse of dimensionality. To overcome this and devise a scalable… ▽ More

    Submitted 19 October, 2020; v1 submitted 3 February, 2020; originally announced February 2020.

  22. arXiv:2001.09206  [pdf, other

    math.ST

    Gaussian-Smooth Optimal Transport: Metric Structure and Statistical Efficiency

    Authors: Ziv Goldfeld, Kristjan Greenewald

    Abstract: Optimal transport (OT), and in particular the Wasserstein distance, has seen a surge of interest and applications in machine learning. However, empirical approximation under Wasserstein distances suffers from a severe curse of dimensionality, rendering them impractical in high dimensions. As a result, entropically regularized OT has become a popular workaround. However, while it enjoys fast algori… ▽ More

    Submitted 24 January, 2020; originally announced January 2020.

  23. arXiv:1905.13576  [pdf, other

    math.ST cs.IT

    Convergence of Smoothed Empirical Measures with Applications to Entropy Estimation

    Authors: Ziv Goldfeld, Kristjan Greenewald, Yury Polyanskiy, Jonathan Weed

    Abstract: This paper studies convergence of empirical measures smoothed by a Gaussian kernel. Specifically, consider approximating $P\ast\mathcal{N}_σ$, for $\mathcal{N}_σ\triangleq\mathcal{N}(0,σ^2 \mathrm{I}_d)$, by $\hat{P}_n\ast\mathcal{N}_σ$, where $\hat{P}_n$ is the empirical measure, under different statistical distances. The convergence is examined in terms of the Wasserstein distance, total variati… ▽ More

    Submitted 1 May, 2020; v1 submitted 30 May, 2019; originally announced May 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1810.11589

  24. arXiv:1810.11589   

    math.ST

    Estimating Differential Entropy under Gaussian Convolutions

    Authors: Ziv Goldfeld, Kristjan Greenewald, Yury Polyanskiy

    Abstract: This paper studies the problem of estimating the differential entropy $h(S+Z)$, where $S$ and $Z$ are independent $d$-dimensional random variables with $Z\sim\mathcal{N}(0,σ^2 \mathrm{I}_d)$. The distribution of $S$ is unknown, but $n$ independently and identically distributed (i.i.d) samples from it are available. The question is whether having access to samples of $S$ as opposed to samples of… ▽ More

    Submitted 2 June, 2019; v1 submitted 26 October, 2018; originally announced October 2018.

    Comments: A significantly updated version with a different set of authors replaces this manuscript. New version available at arXiv:1905.13576