Monge-Kantorovich Fitting With Sobolev Budgets

Kobayashi, Forest; Hayase, Jonathan; Kim, Young-Heon

Computer Science > Machine Learning

arXiv:2409.16541 (cs)

[Submitted on 25 Sep 2024 (v1), last revised 29 Mar 2025 (this version, v2)]

Title:Monge-Kantorovich Fitting With Sobolev Budgets

Authors:Forest Kobayashi, Jonathan Hayase, Young-Heon Kim

View PDF

Abstract:Given $m < n$, we consider the problem of ``best'' approximating an $n\text{-d}$ probability measure $\rho$ via an $m\text{-d}$ measure $\nu$ such that $\mathrm{supp}\ \nu$ has bounded total ``complexity.'' When $\rho$ is concentrated near an $m\text{-d}$ set we may interpret this as a manifold learning problem with noisy data. However, we do not restrict our analysis to this case, as the more general formulation has broader applications.
We quantify $\nu$'s performance in approximating $\rho$ via the Monge-Kantorovich (also called Wasserstein) $p$-cost $\mathbb{W}_p^p(\rho, \nu)$, and constrain the complexity by requiring $\mathrm{supp}\ \nu$ to be coverable by an $f : \mathbb{R}^{m} \to \mathbb{R}^{n}$ whose $W^{k,q}$ Sobolev norm is bounded by $\ell \geq 0$. This allows us to reformulate the problem as minimizing a functional $\mathscr J_p(f)$ under the Sobolev ``budget'' $\ell$. This problem is closely related to (but distinct from) principal curves with length constraints when $m=1, k = 1$ and an unsupervised analogue of smoothing splines when $k > 1$. New challenges arise from the higher-order differentiability condition.
We study the ``gradient'' of $\mathscr J_p$, which is given by a certain vector field that we call the barycenter field, and use it to prove a nontrivial (almost) strict monotonicity result. We also provide a natural discretization scheme and establish its consistency. We use this scheme as a toy model for a generative learning task, and by analogy, propose novel interpretations for the role regularization plays in improving training.

Comments:	Expanded abstract and §6; added conclusion (§7); minor correction to implementation of constraint gradient in §5.3.2; removed unused references; misc typo corrections. 69 pages, 51 pages without figures
Subjects:	Machine Learning (cs.LG); Analysis of PDEs (math.AP)
MSC classes:	49Q10 (Primary), 49Q20, 49Q22, 65D10, 68T01 (Secondary)
Report number:	PIMS-20240923-PRN01
Cite as:	arXiv:2409.16541 [cs.LG]
	(or arXiv:2409.16541v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2409.16541

Submission history

From: Forest Kobayashi [view email]
[v1] Wed, 25 Sep 2024 01:30:16 UTC (25,903 KB)
[v2] Sat, 29 Mar 2025 21:57:44 UTC (8,124 KB)

Computer Science > Machine Learning

Title:Monge-Kantorovich Fitting With Sobolev Budgets

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Monge-Kantorovich Fitting With Sobolev Budgets

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators