Skip to main content

Showing 1–25 of 25 results for author: Sriperumbudur, B K

Searching in archive math. Search in all archives.
.
  1. arXiv:2506.17366  [pdf, ps, other

    stat.ML cs.LG math.NA math.PR math.ST

    Gaussian Processes and Reproducing Kernels: Connections and Equivalences

    Authors: Motonobu Kanagawa, Philipp Hennig, Dino Sejdinovic, Bharath K. Sriperumbudur

    Abstract: This monograph studies the relations between two approaches using positive definite kernels: probabilistic methods using Gaussian processes, and non-probabilistic methods using reproducing kernel Hilbert spaces (RKHS). They are widely studied and used in machine learning, statistics, and numerical analysis. Connections and equivalences between them are reviewed for fundamental topics such as regre… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

    Comments: 172 pages

  2. arXiv:2502.20755  [pdf, other

    math.ST cs.LG stat.ML

    Minimax Optimal Kernel Two-Sample Tests with Random Features

    Authors: Soumya Mukherjee, Bharath K. Sriperumbudur

    Abstract: Reproducing Kernel Hilbert Space (RKHS) embedding of probability distributions has proved to be an effective approach, via MMD (maximum mean discrepancy) for nonparametric hypothesis testing problems involving distributions defined over general (non-Euclidean) domains. While a substantial amount of work has been done on this topic, only recently, minimax optimal two-sample tests have been construc… ▽ More

    Submitted 28 February, 2025; originally announced February 2025.

    Comments: 82 pages, 10 figures, 5 tables

    MSC Class: 62G10 (Primary) 65J20; 65J22; 46E22; 47A52 (Secondary)

  3. arXiv:2502.07369  [pdf, other

    stat.ML cs.LG math.ST

    Uniform Kernel Prober

    Authors: Soumya Mukherjee, Bharath K. Sriperumbudur

    Abstract: The ability to identify useful features or representations of the input data based on training data that achieves low prediction error on test data across multiple prediction tasks is considered the key to multitask learning success. In practice, however, one faces the issue of the choice of prediction tasks and the availability of test data from the chosen tasks while comparing the relative perfo… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

    Comments: 34 pages, 10 figures

  4. arXiv:2407.11800  [pdf, other

    math.AP math.OC stat.ML

    Gradient Flows and Riemannian Structure in the Gromov-Wasserstein Geometry

    Authors: Zhengxin Zhang, Ziv Goldfeld, Kristjan Greenewald, Youssef Mroueh, Bharath K. Sriperumbudur

    Abstract: The Wasserstein space of probability measures is known for its intricate Riemannian structure, which underpins the Wasserstein geometry and enables gradient flow algorithms. However, the Wasserstein geometry may not be suitable for certain tasks or data modalities. Motivated by scenarios where the global structure of the data needs to be preserved, this work initiates the study of gradient flows a… ▽ More

    Submitted 21 May, 2025; v1 submitted 16 July, 2024; originally announced July 2024.

    Comments: 74 pages

  5. arXiv:2406.10005  [pdf, ps, other

    math.ST

    Optimal Rates for Functional Linear Regression with General Regularization

    Authors: Naveen Gupta, S. Sivananthan, Bharath K. Sriperumbudur

    Abstract: Functional linear regression is one of the fundamental and well-studied methods in functional data analysis. In this work, we investigate the functional linear regression model within the context of reproducing kernel Hilbert space by employing general spectral regularization to approximate the slope function with certain smoothness assumptions. We establish optimal convergence rates for estimatio… ▽ More

    Submitted 11 December, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

  6. arXiv:2406.08401  [pdf, other

    stat.ML cs.LG math.ST

    Nyström Kernel Stein Discrepancy

    Authors: Florian Kalinke, Zoltan Szabo, Bharath K. Sriperumbudur

    Abstract: Kernel methods underpin many of the most successful approaches in data science and statistics, and they allow representing probability measures as elements of a reproducing kernel Hilbert space without loss of information. Recently, the kernel Stein discrepancy (KSD), which combines Stein's method with the flexibility of kernel techniques, gained considerable attention. Through the Stein operator,… ▽ More

    Submitted 18 March, 2025; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: Add limitations; accepted for publication at AISTATS 2025

    MSC Class: 46E22 (Primary) 62G10 (Secondary) ACM Class: G.3; I.2.6

  7. arXiv:2310.02607  [pdf, ps, other

    math.ST

    Convergence Analysis of Kernel Conjugate Gradient for Functional Linear Regression

    Authors: Naveen Gupta, S. Sivananthan, Bharath K. Sriperumbudur

    Abstract: In this paper, we discuss the convergence analysis of the conjugate gradient-based algorithm for the functional linear model in the reproducing kernel Hilbert space framework, utilizing early stopping results in regularization against over-fitting. We establish the convergence rates depending on the regularity condition of the slope function and the decay rate of the eigenvalues of the operator co… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    MSC Class: 62R10; 62G20; 65F22

  8. arXiv:2308.04561  [pdf, other

    math.ST stat.ML

    Spectral Regularized Kernel Goodness-of-Fit Tests

    Authors: Omar Hagrass, Bharath K. Sriperumbudur, Bing Li

    Abstract: Maximum mean discrepancy (MMD) has enjoyed a lot of success in many machine learning and statistical applications, including non-parametric hypothesis testing, because of its ability to handle non-Euclidean data. Recently, it has been demonstrated in Balasubramanian et al.(2021) that the goodness-of-fit test based on MMD is not minimax optimal while a Tikhonov regularized version of it is, for an… ▽ More

    Submitted 22 January, 2025; v1 submitted 8 August, 2023; originally announced August 2023.

    Comments: 49 pages. arXiv admin note: text overlap with arXiv:2212.09201

    MSC Class: 62G10 (Primary); 65J20; 65J22; 46E22; 47A52 (Secondary)

    Journal ref: Journal of Machine Learning Research, 25 (309): 1-52, 2024

  9. arXiv:2306.17329  [pdf, ps, other

    stat.ML cs.LG math.ST

    Kernel $ε$-Greedy for Multi-Armed Bandits with Covariates

    Authors: Sakshi Arya, Bharath K. Sriperumbudur

    Abstract: We consider the $ε$-greedy strategy for the multi-arm bandit with covariates (MABC) problem, where the mean reward functions are assumed to lie in a reproducing kernel Hilbert space (RKHS). We propose to estimate the unknown mean reward functions using an online weighted kernel ridge regression estimator, and show the resultant estimator to be consistent under appropriate decay rates of the explor… ▽ More

    Submitted 1 June, 2025; v1 submitted 29 June, 2023; originally announced June 2023.

    MSC Class: 62L10; 62G05; 68T05

  10. arXiv:2212.12848  [pdf, other

    math.ST

    Gromov-Wasserstein Distances: Entropic Regularization, Duality, and Sample Complexity

    Authors: Zhengxin Zhang, Ziv Goldfeld, Youssef Mroueh, Bharath K. Sriperumbudur

    Abstract: The Gromov-Wasserstein (GW) distance, rooted in optimal transport (OT) theory, quantifies dissimilarity between metric measure spaces and provides a framework for aligning heterogeneous datasets. While computational aspects of the GW problem have been widely studied, a duality theory and fundamental statistical questions concerning empirical convergence rates remained obscure. This work closes the… ▽ More

    Submitted 28 September, 2023; v1 submitted 24 December, 2022; originally announced December 2022.

    Comments: 47 pages

  11. arXiv:2212.09201  [pdf, other

    math.ST cs.LG stat.ML

    Spectral Regularized Kernel Two-Sample Tests

    Authors: Omar Hagrass, Bharath K. Sriperumbudur, Bing Li

    Abstract: Over the last decade, an approach that has gained a lot of popularity to tackle nonparametric testing problems on general (i.e., non-Euclidean) domains is based on the notion of reproducing kernel Hilbert space (RKHS) embedding of probability distributions. The main goal of our work is to understand the optimality of two-sample tests constructed based on this approach. First, we show the popular M… ▽ More

    Submitted 1 May, 2024; v1 submitted 18 December, 2022; originally announced December 2022.

    Comments: 75 pages, to be published in the Annals of Statistics

    MSC Class: Primary: 62G10; Secondary: 65J20; 65J22; 46E22; 47A52

  12. arXiv:2211.07861  [pdf, other

    stat.ML cs.LG math.AP math.NA math.ST stat.CO

    Regularized Stein Variational Gradient Flow

    Authors: Ye He, Krishnakumar Balasubramanian, Bharath K. Sriperumbudur, Jianfeng Lu

    Abstract: The Stein Variational Gradient Descent (SVGD) algorithm is a deterministic particle method for sampling. However, a mean-field analysis reveals that the gradient flow corresponding to the SVGD algorithm (i.e., the Stein Variational Gradient Flow) only provides a constant-order approximation to the Wasserstein Gradient Flow corresponding to the KL-divergence minimization. In this work, we propose t… ▽ More

    Submitted 8 May, 2024; v1 submitted 14 November, 2022; originally announced November 2022.

  13. arXiv:2207.06357  [pdf, ps, other

    math.ST stat.ME stat.ML

    Shrinkage Estimation of Higher Order Bochner Integrals

    Authors: Saiteja Utpala, Bharath K. Sriperumbudur

    Abstract: We consider shrinkage estimation of higher order Hilbert space valued Bochner integrals in a non-parametric setting. We propose estimators that shrink the $U$-statistic estimator of the Bochner integral towards a pre-specified target element in the Hilbert space. Depending on the degeneracy of the kernel of the $U$-statistic, we construct consistent shrinkage estimators with fast rates of converge… ▽ More

    Submitted 21 July, 2022; v1 submitted 13 July, 2022; originally announced July 2022.

    Comments: 33 pages; Under Review

    MSC Class: 62G05(Primary); 62F10; 62J07(Secondary)

  14. arXiv:2206.03975  [pdf, other

    math.ST

    Functional linear and single-index models: A unified approach via Gaussian Stein identity

    Authors: Krishnakumar Balasubramanian, Hans-Georg Müller, Bharath K. Sriperumbudur

    Abstract: Functional linear and single-index models are core regression methods in functional data analysis and are widely used for performing regression in a wide range of applications when the covariates are random functions coupled with scalar responses. In the existing literature, however, the construction of associated estimators and the study of their theoretical properties is invariably carried out o… ▽ More

    Submitted 26 March, 2024; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: To appear in Bernoulli Journal

  15. arXiv:2206.01795  [pdf, other

    math.ST cs.CG cs.LG math.AT stat.ML

    Adversarially Robust Topological Inference

    Authors: Siddharth Vishwanath, Bharath K. Sriperumbudur, Kenji Fukumizu, Satoshi Kuriki

    Abstract: The distance function to a compact set plays a crucial role in the paradigm of topological data analysis. In particular, the sublevel sets of the distance function are used in the computation of persistent homology -- a backbone of the topological data analysis pipeline. Despite its stability to perturbations in the Hausdorff distance, persistent homology is highly sensitive to outliers. In this w… ▽ More

    Submitted 28 March, 2025; v1 submitted 3 June, 2022; originally announced June 2022.

    Comments: 54 pages, 13 figures

    MSC Class: 62R40; 55N31; 68T09

  16. arXiv:2010.08071  [pdf, other

    math.ST

    Shrinkage Estimation for the Diagonal Multivariate Exponential Families

    Authors: Nikolas Siapoutis, Donald Richards, Bharath K. Sriperumbudur

    Abstract: We study shrinkage estimation of the mean parameters of a class of multivariate distributions for which the diagonal entries of the corresponding covariance matrix are certain quadratic functions of the mean parameter. This class of distributions includes the diagonal multivariate natural exponential families. We propose two classes of semi-parametric shrinkage estimators for the mean and construc… ▽ More

    Submitted 1 July, 2022; v1 submitted 15 October, 2020; originally announced October 2020.

    Comments: 36 pages, 2 figures

    MSC Class: 62F12; 62H05 (Primary) 62J07; 62G05 (Secondary)

  17. arXiv:1912.01103  [pdf, ps, other

    math.ST stat.ML

    On Distance and Kernel Measures of Conditional Independence

    Authors: Tianhong Sheng, Bharath K. Sriperumbudur

    Abstract: Measuring conditional independence is one of the important tasks in statistical inference and is fundamental in causal discovery, feature selection, dimensionality reduction, Bayesian network learning, and others. In this work, we explore the connection between conditional independence measures induced by distances on a metric space and reproducing kernels associated with a reproducing kernel Hilb… ▽ More

    Submitted 17 August, 2020; v1 submitted 2 December, 2019; originally announced December 2019.

  18. arXiv:1908.05818  [pdf, other

    stat.ML cs.LG math.PR math.ST

    Gaussian Sketching yields a J-L Lemma in RKHS

    Authors: Samory Kpotufe, Bharath K. Sriperumbudur

    Abstract: The main contribution of the paper is to show that Gaussian sketching of a kernel-Gram matrix $\boldsymbol K$ yields an operator whose counterpart in an RKHS $\mathcal H$, is a \emph{random projection} operator---in the spirit of Johnson-Lindenstrauss (J-L) lemma. To be precise, given a random matrix $Z$ with i.i.d. Gaussian entries, we show that a sketch $Z\boldsymbol{K}$ corresponds to a particu… ▽ More

    Submitted 11 March, 2020; v1 submitted 15 August, 2019; originally announced August 2019.

    Comments: 16 pages

  19. arXiv:1902.01219  [pdf, ps, other

    math.ST cs.IT cs.LG stat.ML

    Local minimax rates for closeness testing of discrete distributions

    Authors: Joseph Lam-Weil, Alexandra Carpentier, Bharath K. Sriperumbudur

    Abstract: We consider the closeness testing problem for discrete distributions. The goal is to distinguish whether two samples are drawn from the same unspecified distribution, or whether their respective distributions are separated in $L_1$-norm. In this paper, we focus on adapting the rate to the shape of the underlying distributions, i.e. we consider \textit{a local minimax setting}. We provide, to the b… ▽ More

    Submitted 19 January, 2021; v1 submitted 1 February, 2019; originally announced February 2019.

    MSC Class: 62F03; 62G10; 62F35 ACM Class: G.3; I.2.6

  20. arXiv:1810.05207  [pdf, ps, other

    stat.ML cs.LG math.PR

    On Kernel Derivative Approximation with Random Fourier Features

    Authors: Zoltan Szabo, Bharath K. Sriperumbudur

    Abstract: Random Fourier features (RFF) represent one of the most popular and wide-spread techniques in machine learning to scale up kernel algorithms. Despite the numerous successful applications of RFFs, unfortunately, quite little is understood theoretically on their optimality and limitations of their performance. Only recently, precise statistical-computational trade-offs have been established for RFFs… ▽ More

    Submitted 9 February, 2019; v1 submitted 11 October, 2018; originally announced October 2018.

    Comments: AISTATS-2019

    MSC Class: 60E10; 42Bxx; 46E22 ACM Class: G.3; I.2.6

  21. arXiv:1803.11451  [pdf, ps, other

    math.ST cs.IT stat.ML

    Minimax Estimation of Quadratic Fourier Functionals

    Authors: Shashank Singh, Bharath K. Sriperumbudur, Barnabás Póczos

    Abstract: We study estimation of (semi-)inner products between two nonparametric probability distributions, given IID samples from each distribution. These products include relatively well-studied classical $\mathcal{L}^2$ and Sobolev inner products, as well as those induced by translation-invariant reproducing kernels, for which we believe our results are the first. We first propose estimators for these qu… ▽ More

    Submitted 1 September, 2018; v1 submitted 30 March, 2018; originally announced March 2018.

  22. arXiv:1709.00147  [pdf, other

    math.NA stat.ML

    Convergence Analysis of Deterministic Kernel-Based Quadrature Rules in Misspecified Settings

    Authors: Motonobu Kanagawa, Bharath K. Sriperumbudur, Kenji Fukumizu

    Abstract: This paper presents a convergence analysis of kernel-based quadrature rules in misspecified settings, focusing on deterministic quadrature in Sobolev spaces. In particular, we deal with misspecified settings where a test integrand is less smooth than a Sobolev RKHS based on which a quadrature rule is constructed. We provide convergence guarantees based on two different assumptions on a quadrature… ▽ More

    Submitted 30 October, 2018; v1 submitted 1 September, 2017; originally announced September 2017.

    Comments: 36 pages

    MSC Class: 65D30 (Primary); 65D32; 65D05; 46E35; 46E22 (Secondary)

  23. arXiv:1506.02155  [pdf, ps, other

    math.ST cs.LG math.FA stat.ML

    Optimal Rates for Random Fourier Features

    Authors: Bharath K. Sriperumbudur, Zoltan Szabo

    Abstract: Kernel methods represent one of the most powerful tools in machine learning to tackle problems expressed in terms of function values and derivatives due to their capability to represent and model complex relations. While these methods show good versatility, they are computationally intensive and have poor scalability to large data as they require operations on Gram matrices. In order to mitigate t… ▽ More

    Submitted 4 November, 2015; v1 submitted 6 June, 2015; originally announced June 2015.

    Comments: To appear at NIPS-2015

    MSC Class: 60E10; 62Gxx; 62Exx; 62H12; 42Bxx; 46E22 ACM Class: G.3; I.2.6; F.2

  24. arXiv:1003.0887  [pdf, ps, other

    stat.ML math.ST

    Universality, Characteristic Kernels and RKHS Embedding of Measures

    Authors: Bharath K. Sriperumbudur, Kenji Fukumizu, Gert R. G. Lanckriet

    Abstract: A Hilbert space embedding for probability measures has recently been proposed, wherein any probability measure is represented as a mean element in a reproducing kernel Hilbert space (RKHS). Such an embedding has found applications in homogeneity testing, independence testing, dimensionality reduction, etc., with the requirement that the reproducing kernel is characteristic, i.e., the embedding i… ▽ More

    Submitted 3 March, 2010; originally announced March 2010.

    Comments: 30 pages, 1 figure

  25. arXiv:0907.5309  [pdf, ps, other

    stat.ML math.ST

    Hilbert space embeddings and metrics on probability measures

    Authors: Bharath K. Sriperumbudur, Arthur Gretton, Kenji Fukumizu, Bernhard Schölkopf, Gert R. G. Lanckriet

    Abstract: A Hilbert space embedding for probability measures has recently been proposed, with applications including dimensionality reduction, homogeneity testing, and independence testing. This embedding represents any probability measure as a mean element in a reproducing kernel Hilbert space (RKHS). A pseudometric on the space of probability measures can be defined as the distance between distribution… ▽ More

    Submitted 29 January, 2010; v1 submitted 30 July, 2009; originally announced July 2009.

    Comments: 48 pages