Skip to main content

Showing 1–22 of 22 results for author: Katsoulakis, M A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2505.12097  [pdf, ps, other

    math.OC math.PR stat.ME stat.ML

    Proximal optimal transport divergences

    Authors: Ricardo Baptista, Panagiota Birmpa, Markos A. Katsoulakis, Luc Rey-Bellet, Benjamin J. Zhang

    Abstract: We introduce proximal optimal transport divergence, a novel discrepancy measure that interpolates between information divergences and optimal transport distances via an infimal convolution formulation. This divergence provides a principled foundation for optimal transport proximals and proximal optimization methods frequently used in generative modeling. We explore its mathematical properties, inc… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

  2. arXiv:2410.01244  [pdf, other

    stat.ML cs.LG

    Equivariant score-based generative models provably learn distributions with symmetries efficiently

    Authors: Ziyu Chen, Markos A. Katsoulakis, Benjamin J. Zhang

    Abstract: Symmetry is ubiquitous in many real-world phenomena and tasks, such as physics, images, and molecular simulations. Empirical studies have demonstrated that incorporating symmetries into generative models can provide better generalization and sampling efficiency when the underlying data distribution has group symmetry. In this work, we provide the first theoretical analysis and guarantees of score-… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

  3. arXiv:2407.11901  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Combining Wasserstein-1 and Wasserstein-2 proximals: robust manifold learning via well-posed generative flows

    Authors: Hyemin Gu, Markos A. Katsoulakis, Luc Rey-Bellet, Benjamin J. Zhang

    Abstract: We formulate well-posed continuous-time generative flows for learning distributions that are supported on low-dimensional manifolds through Wasserstein proximal regularizations of $f$-divergences. Wasserstein-1 proximal operators regularize $f$-divergences so that singular distributions can be compared. Meanwhile, Wasserstein-2 proximal operators regularize the paths of the generative flows by add… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  4. arXiv:2405.15754  [pdf, ps, other

    stat.ML cs.LG math.ST

    Score-based generative models are provably robust: an uncertainty quantification perspective

    Authors: Nikiforos Mimikos-Stamatopoulos, Benjamin J. Zhang, Markos A. Katsoulakis

    Abstract: Through an uncertainty quantification (UQ) perspective, we show that score-based generative models (SGMs) are provably robust to the multiple sources of error in practical implementation. Our primary tool is the Wasserstein uncertainty propagation (WUP) theorem, a model-form UQ bound that describes how the $L^2$ error from learning the score function propagates to a Wasserstein-1 ($\mathbf{d}_1$)… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  5. arXiv:2405.15625  [pdf, ps, other

    stat.ML cs.LG

    Nonlinear denoising score matching for enhanced learning of structured distributions

    Authors: Jeremiah Birrell, Markos A. Katsoulakis, Luc Rey-Bellet, Benjamin J. Zhang, Wei Zhu

    Abstract: We present a novel method for training score-based generative models which uses nonlinear noising dynamics to improve learning of structured distributions. Generalizing to a nonlinear drift allows for additional structure to be incorporated into the dynamics, thus making the training better adapted to the data, e.g., in the case of multimodality or (approximate) symmetries. Such structure can be o… ▽ More

    Submitted 8 July, 2025; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: 16 pages, 8 figures

  6. arXiv:2405.13962  [pdf, other

    stat.ML cs.LG

    Robust Generative Learning with Lipschitz-Regularized $α$-Divergences Allows Minimal Assumptions on Target Distributions

    Authors: Ziyu Chen, Hyemin Gu, Markos A. Katsoulakis, Luc Rey-Bellet, Wei Zhu

    Abstract: This paper demonstrates the robustness of Lipschitz-regularized $α$-divergences as objective functionals in generative modeling, showing they enable stable learning across a wide range of target distributions with minimal assumptions. We establish that these divergences remain finite under a mild condition-that the source distribution has a finite first moment-regardless of the properties of the t… ▽ More

    Submitted 23 November, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: 43 pages, 6 figures and 2 tables in the main text

  7. arXiv:2402.06162  [pdf, other

    stat.ML cs.LG

    Wasserstein proximal operators describe score-based generative models and resolve memorization

    Authors: Benjamin J. Zhang, Siting Liu, Wuchen Li, Markos A. Katsoulakis, Stanley J. Osher

    Abstract: We focus on the fundamental mathematical structure of score-based generative models (SGMs). We first formulate SGMs in terms of the Wasserstein proximal operator (WPO) and demonstrate that, via mean-field games (MFGs), the WPO formulation reveals mathematical structure that describes the inductive bias of diffusion and score-based models. In particular, MFGs yield optimality conditions in the form… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  8. arXiv:2305.13517  [pdf, other

    stat.ML cs.LG

    Statistical Guarantees of Group-Invariant GANs

    Authors: Ziyu Chen, Markos A. Katsoulakis, Luc Rey-Bellet, Wei Zhu

    Abstract: This work presents the first statistical performance guarantees for group-invariant generative models. Many real data, such as images and molecules, are invariant to certain group symmetries, which can be taken advantage of to learn more efficiently as we rigorously demonstrate in this work. Here we specifically study generative adversarial networks (GANs), and quantify the gains when incorporatin… ▽ More

    Submitted 10 March, 2025; v1 submitted 22 May, 2023; originally announced May 2023.

    MSC Class: 62E10; 62E17; 60-08

  9. arXiv:2304.13534  [pdf, other

    stat.ML cs.LG

    A mean-field games laboratory for generative modeling

    Authors: Benjamin J. Zhang, Markos A. Katsoulakis

    Abstract: We demonstrate the versatility of mean-field games (MFGs) as a mathematical framework for explaining, enhancing, and designing generative models. In generative flows, a Lagrangian formulation is used where each particle (generated sample) aims to minimize a loss function over its simulated path. The loss, however, is dependent on the paths of other particles, which leads to a competition among the… ▽ More

    Submitted 24 October, 2023; v1 submitted 26 April, 2023; originally announced April 2023.

    Comments: 56 pages, 10 figures. Version 5 has a slightly modified version of the normalizing flow and improved introduction and conclusions

  10. arXiv:2302.01915  [pdf, other

    math.ST stat.ML

    Sample Complexity of Probability Divergences under Group Symmetry

    Authors: Ziyu Chen, Markos A. Katsoulakis, Luc Rey-Bellet, Wei Zhu

    Abstract: We rigorously quantify the improvement in the sample complexity of variational divergence estimations for group-invariant distributions. In the cases of the Wasserstein-1 metric and the Lipschitz-regularized $α$-divergences, the reduction of sample complexity is proportional to the group size if the group is finite. In addition to the published version at ICML 2023, our proof indeed has included t… ▽ More

    Submitted 22 November, 2024; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: In addition to our published version at ICML 2023, we include the case when the group is infinite such as compact Lie groups. Our approach is different from that in [Tahmasebi & Jegelka, ICML 2024] and our work also applies to asymmetric divergences, such as the Lipschitz-regularized $α$-divergences

  11. arXiv:2210.17230  [pdf, other

    stat.ML cs.LG

    Lipschitz-regularized gradient flows and generative particle algorithms for high-dimensional scarce data

    Authors: Hyemin Gu, Panagiota Birmpa, Yannis Pantazis, Luc Rey-Bellet, Markos A. Katsoulakis

    Abstract: We build a new class of generative algorithms capable of efficiently learning an arbitrary target distribution from possibly scarce, high-dimensional data and subsequently generate new samples. These generative algorithms are particle-based and are constructed as gradient flows of Lipschitz-regularized Kullback-Leibler or other $f$-divergences, where data from a source distribution can be stably t… ▽ More

    Submitted 27 August, 2024; v1 submitted 31 October, 2022; originally announced October 2022.

    MSC Class: 35Q84; 49Q22; 62B10; 65C35; 68T07; 94A17

  12. arXiv:2210.04974  [pdf, ps, other

    stat.ML cs.LG

    Function-space regularized Rényi divergences

    Authors: Jeremiah Birrell, Yannis Pantazis, Paul Dupuis, Markos A. Katsoulakis, Luc Rey-Bellet

    Abstract: We propose a new family of regularized Rényi divergences parametrized not only by the order $α$ but also by a variational function space. These new objects are defined by taking the infimal convolution of the standard Rényi divergence with the integral probability metric (IPM) associated with the chosen function space. We derive a novel dual variational representation that can be used to construct… ▽ More

    Submitted 14 February, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: 24 pages, 4 figures

  13. arXiv:2202.01129  [pdf, other

    cs.LG math.PR stat.ML

    Structure-preserving GANs

    Authors: Jeremiah Birrell, Markos A. Katsoulakis, Luc Rey-Bellet, Wei Zhu

    Abstract: Generative adversarial networks (GANs), a class of distribution-learning methods based on a two-player game between a generator and a discriminator, can generally be formulated as a minmax problem based on the variational representation of a divergence between the unknown and the generated distributions. We introduce structure-preserving GANs as a data-efficient framework for learning distribution… ▽ More

    Submitted 17 June, 2022; v1 submitted 2 February, 2022; originally announced February 2022.

    Comments: 39 pages, 16 figures

  14. arXiv:2107.08179  [pdf, other

    stat.ML cs.IT cs.LG math.PR

    Model Uncertainty and Correctability for Directed Graphical Models

    Authors: Panagiota Birmpa, Jinchao Feng, Markos A. Katsoulakis, Luc Rey-Bellet

    Abstract: Probabilistic graphical models are a fundamental tool in probabilistic modeling, machine learning and artificial intelligence. They allow us to integrate in a natural way expert knowledge, physical modeling, heterogeneous and correlated data and quantities of interest. For exactly this reason, multiple sources of model uncertainty are inherent within the modular structure of the graphical model. I… ▽ More

    Submitted 17 July, 2021; originally announced July 2021.

    MSC Class: 62H22; 62P30; 68T37; 80A30; 93B35; 94A17

  15. arXiv:2011.05953  [pdf, ps, other

    stat.ML cs.LG

    $(f,Γ)$-Divergences: Interpolating between $f$-Divergences and Integral Probability Metrics

    Authors: Jeremiah Birrell, Paul Dupuis, Markos A. Katsoulakis, Yannis Pantazis, Luc Rey-Bellet

    Abstract: We develop a rigorous and general framework for constructing information-theoretic divergences that subsume both $f$-divergences and integral probability metrics (IPMs), such as the $1$-Wasserstein distance. We prove under which assumptions these divergences, hereafter referred to as $(f,Γ)$-divergences, provide a notion of `distance' between probability measures and show that they can be expresse… ▽ More

    Submitted 15 September, 2021; v1 submitted 11 November, 2020; originally announced November 2020.

    Comments: 49 pages

  16. arXiv:2009.04570  [pdf, other

    cs.LG math.NA stat.ML

    Mutual Information for Explainable Deep Learning of Multiscale Systems

    Authors: Søren Taverniers, Eric J. Hall, Markos A. Katsoulakis, Daniel M. Tartakovsky

    Abstract: Timely completion of design cycles for complex systems ranging from consumer electronics to hypersonic vehicles relies on rapid simulation-based prototyping. The latter typically involves high-dimensional spaces of possibly correlated control variables (CVs) and quantities of interest (QoIs) with non-Gaussian and possibly multimodal distributions. We develop a model-agnostic, moment-independent gl… ▽ More

    Submitted 19 May, 2021; v1 submitted 7 September, 2020; originally announced September 2020.

    Comments: 27 pages, 8 figures. Added additional examples

    MSC Class: 93B35 (Primary) 68T07; 62R07 (Secondary)

  17. arXiv:2009.00038  [pdf, other

    stat.ML cs.IT cs.LG math.PR

    Uncertainty quantification for Markov Random Fields

    Authors: Panagiota Birmpa, Markos A. Katsoulakis

    Abstract: We present an information-based uncertainty quantification method for general Markov Random Fields. Markov Random Fields (MRF) are structured, probabilistic graphical models over undirected graphs, and provide a fundamental unifying modeling tool for statistical mechanics, probabilistic machine learning, and artificial intelligence. Typically MRFs are complex and high-dimensional with nodes and ed… ▽ More

    Submitted 17 July, 2021; v1 submitted 31 August, 2020; originally announced September 2020.

    MSC Class: 62H22; 82B20; 94A17

  18. arXiv:2007.03814  [pdf, ps, other

    stat.ML cs.IT cs.LG math.PR

    Variational Representations and Neural Network Estimation of Rényi Divergences

    Authors: Jeremiah Birrell, Paul Dupuis, Markos A. Katsoulakis, Luc Rey-Bellet, Jie Wang

    Abstract: We derive a new variational formula for the Rényi family of divergences, $R_α(Q\|P)$, between probability measures $Q$ and $P$. Our result generalizes the classical Donsker-Varadhan variational formula for the Kullback-Leibler divergence. We further show that this Rényi variational formula holds over a range of function spaces; this leads to a formula for the optimizer under very weak assumptions… ▽ More

    Submitted 20 July, 2021; v1 submitted 7 July, 2020; originally announced July 2020.

    Comments: 24 pages, 2 figures

    MSC Class: 94A17; 62B10; 62G05

  19. arXiv:2007.00340  [pdf, other

    stat.ME math.PR physics.data-an

    Data-driven Uncertainty Quantification for Systematic Coarse-grained Models

    Authors: Tangxin Jin, Anthony Chazirakis, Evangelia Kalligiannaki, Vagelis Harmandaris, Markos A. Katsoulakis

    Abstract: In this work, we present methodologies for the quantification of confidence in bottom-up coarse-grained models for molecular and macromolecular systems. Coarse-graining methods have been extensively used in the past decades in order to extend the length and time scales accessible by simulation methodologies. The quantification, though, of induced errors due to the limited availability of fine-grai… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

  20. arXiv:2006.14807  [pdf, other

    physics.comp-ph math.NA stat.ML

    GINNs: Graph-Informed Neural Networks for Multiscale Physics

    Authors: Eric J. Hall, Søren Taverniers, Markos A. Katsoulakis, Daniel M. Tartakovsky

    Abstract: We introduce the concept of a Graph-Informed Neural Network (GINN), a hybrid approach combining deep learning with probabilistic graphical models (PGMs) that acts as a surrogate for physics-based representations of multiscale and multiphysics systems. GINNs address the twin challenges of removing intrinsic computational bottlenecks in physics-based models and generating large data sets for estimat… ▽ More

    Submitted 26 June, 2020; originally announced June 2020.

    Comments: 20 pages, 8 figures

    MSC Class: 68T07 (Primary) 62H22; 35R60 (Secondary)

  21. arXiv:2006.08781  [pdf, ps, other

    cs.LG cs.IT stat.ML

    Optimizing Variational Representations of Divergences and Accelerating their Statistical Estimation

    Authors: Jeremiah Birrell, Markos A. Katsoulakis, Yannis Pantazis

    Abstract: Variational representations of divergences and distances between high-dimensional probability distributions offer significant theoretical insights and practical advantages in numerous research areas. Recently, they have gained popularity in machine learning as a tractable and scalable approach for training probabilistic models and for statistically differentiating between data distributions. Their… ▽ More

    Submitted 23 March, 2022; v1 submitted 15 June, 2020; originally announced June 2020.

    Comments: 48 pages, 6 figures

  22. Causality and Bayesian network PDEs for multiscale representations of porous media

    Authors: Kimoon Um, Eric Joseph Hall, Markos A. Katsoulakis, Daniel M. Tartakovsky

    Abstract: Microscopic (pore-scale) properties of porous media affect and often determine their macroscopic (continuum- or Darcy-scale) counterparts. Understanding the relationship between processes on these two scales is essential to both the derivation of macroscopic models of, e.g., transport phenomena in natural porous media, and the design of novel materials, e.g., for energy storage. Most microscopic p… ▽ More

    Submitted 6 January, 2019; originally announced January 2019.

    Comments: 23 pages, 11 figures and 5 tables

    Journal ref: Journal of Computational Physics 394 (2019) 658--678