Skip to main content

Showing 1–28 of 28 results for author: Mukherjee, S S

Searching in archive math. Search in all archives.
.
  1. arXiv:2509.03048  [pdf, ps, other

    math.PR math-ph

    Elephant random walks on infinite Cayley trees

    Authors: Soumendu Sundar Mukherjee

    Abstract: In this article, we initiate the study of elephant random walks on finitely generated infinite groups whose Cayley graphs are homogeneous trees of degree $d \ge 3$ (e.g., groups of the form $\mathbb{Z}^{* d_1} * \mathbb{Z}_2^{*d_2}$ with $2d_1 + d_2 \ge 3$). We show that the asymptotic speed of the walk does not depend on the memory parameter $p \in [0, 1)$ and equals $\frac{d - 2}{d}$, the asympt… ▽ More

    Submitted 3 September, 2025; originally announced September 2025.

    Comments: 12 pages, 1 figure

    MSC Class: 60G50; 82C41; 60K99; 60G42

  2. arXiv:2507.15097  [pdf, ps, other

    stat.ML cs.LG math.ST stat.ME

    Learning under Latent Group Sparsity via Diffusion on Networks

    Authors: Subhroshekhar Ghosh, Soumendu Sundar Mukherjee

    Abstract: Group or cluster structure on explanatory variables in machine learning problems is a very general phenomenon, which has attracted broad interest from practitioners and theoreticians alike. In this work we contribute an approach to sparse learning under such group structure, that does not require prior information on the group identities. Our paradigm is motivated by the Laplacian geometry of an u… ▽ More

    Submitted 20 July, 2025; originally announced July 2025.

    Comments: 49 pages, 4 figures, 2 tables; this submission subsumes the earlier preprint arXiv:2201.08326

  3. arXiv:2505.10555  [pdf, ps, other

    math.PR

    Spectra of contractions of the Gaussian Orthogonal Tensor Ensemble

    Authors: Soumendu Sundar Mukherjee, Himasish Talukdar

    Abstract: In this article, we study the spectra of matrix-valued contractions of the Gaussian Orthogonal Tensor Ensemble (GOTE). Let $\mathcal{G}$ denote a random tensor of order $r$ and dimension $n$ drawn from the density \[ f(\mathcal{G}) \propto \exp\bigg(-\frac{1}{2r}\|\mathcal{G}\|^2_{\mathrm{F}}\bigg). \] For $\mathbf{w} \in \mathbb{S}^{n - 1}$, the unit-sphere in $\mathbb{R}^n$, we consider the ma… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

    Comments: 45 pages, 1 figure; abstract shortened to meet arXiv requirements

  4. arXiv:2504.07720  [pdf, ps, other

    eess.SP math.AT

    Filtering through a topological lens: homology for point processes on the time-frequency plane

    Authors: Juan Manuel Miramont, Kin Aun Tan, Soumendu Sundar Mukherjee, Rémi Bardenet, Subhroshekhar Ghosh

    Abstract: We introduce a very general approach to the analysis of signals from their noisy measurements from the perspective of Topological Data Analysis (TDA). While TDA has emerged as a powerful analytical tool for data with pronounced topological structures, here we demonstrate its applicability for general problems of signal processing, without any a-priori geometric feature. Our methods are well-suited… ▽ More

    Submitted 25 July, 2025; v1 submitted 10 April, 2025; originally announced April 2025.

  5. arXiv:2412.19802  [pdf, other

    stat.ML cs.LG math.PR math.ST stat.ME

    A new approach to locally adaptive polynomial regression

    Authors: Sabyasachi Chatterjee, Subhajit Goswami, Soumendu Sundar Mukherjee

    Abstract: Adaptive bandwidth selection is a fundamental challenge in nonparametric regression. This paper introduces a new bandwidth selection procedure inspired by the optimality criteria for $\ell_0$-penalized regression. Although similar in spirit to Lepski's method and its variants in selecting the largest interval satisfying an admissibility criterion, our approach stems from a distinct philosophy, uti… ▽ More

    Submitted 20 May, 2025; v1 submitted 27 December, 2024; originally announced December 2024.

    Comments: 29 pages, 4 figures; in this version, the title has been updated and the exposition significantly expanded

  6. arXiv:2409.11381  [pdf, other

    math.PR math-ph math.CO math.ST

    Edge spectra of Gaussian random symmetric matrices with correlated entries

    Authors: Debapratim Banerjee, Soumendu Sundar Mukherjee, Dipranjan Pal

    Abstract: We study the largest eigenvalue of a Gaussian random symmetric matrix $X_n$, with zero-mean, unit variance entries satisfying the condition $\sup_{(i, j) \ne (i', j')}|\mathbb{E}[X_{ij} X_{i'j'}]| = O(n^{-(1 + \varepsilon)})$, where $\varepsilon > 0$. It follows from Catalano et al. (2024) that the empirical spectral distribution of $n^{-1/2} X_n$ converges weakly almost surely to the standard sem… ▽ More

    Submitted 7 February, 2025; v1 submitted 17 September, 2024; originally announced September 2024.

    Comments: 27 pages, 2 figures; abstract shortened to meet arXiv requirements

  7. arXiv:2409.03756  [pdf, other

    math.PR math.CO

    Spectra of adjacency and Laplacian matrices of Erdős-Rényi hypergraphs

    Authors: Soumendu Sundar Mukherjee, Dipranjan Pal, Himasish Talukdar

    Abstract: We study adjacency and Laplacian matrices of Erdős-Rényi $r$-uniform hypergraphs on $n$ vertices with hyperedge inclusion probability $p$, in the setting where $r$ can vary with $n$ such that $r / n \to c \in [0, 1)$. Adjacency matrices of hypergraphs are contractions of adjacency tensors and their entries exhibit long range correlations. We show that under the Erdős-Rényi model, the expected empi… ▽ More

    Submitted 5 September, 2024; originally announced September 2024.

  8. arXiv:2409.02911  [pdf, other

    math.ST math.PR

    Bulk Spectra of Truncated Sample Covariance Matrices

    Authors: Subhroshekhar Ghosh, Soumendu Sundar Mukherjee, Himasish Talukdar

    Abstract: Determinantal Point Processes (DPPs), which originate from quantum and statistical physics, are known for modelling diversity. Recent research [Ghosh and Rigollet (2020)] has demonstrated that certain matrix-valued $U$-statistics (that are truncated versions of the usual sample covariance matrix) can effectively estimate parameters in the context of Gaussian DPPs and enhance dimension reduction te… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

    Comments: 26 pages, 2 figures

  9. arXiv:2312.12428  [pdf, other

    math.PR cond-mat.dis-nn math-ph math.NT

    The "visible" Wigner matrix

    Authors: Arup Bose, Soumendu Sundar Mukherjee

    Abstract: We consider the ``visible'' Wigner matrix, a Wigner matrix whose $(i, j)$-th entry is coerced to zero if $i, j$ are co-prime. Using a recent result from elementary number theory on co-primality patterns in integers, we show that the limiting spectral distribution of this matrix exists, and give explicit descriptions of its moments in terms of infinite products over primes $p$ of certain polynomial… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 12 pages, 5 figures, 1 table

    MSC Class: 60B20

  10. arXiv:2312.07839  [pdf, ps, other

    math.ST cs.LG math.PR stat.ML

    Minimax-optimal estimation for sparse multi-reference alignment with collision-free signals

    Authors: Subhro Ghosh, Soumendu Sundar Mukherjee, Jing Bin Pan

    Abstract: The Multi-Reference Alignment (MRA) problem aims at the recovery of an unknown signal from repeated observations under the latent action of a group of cyclic isometries, in the presence of additive noise of high intensity $σ$. It is a more tractable version of the celebrated cryo EM model. In the crucial high noise regime, it is known that its sample complexity scales as $σ^6$. Recent investigatio… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  11. arXiv:2309.10864  [pdf, other

    stat.ME math.PR math.ST physics.soc-ph

    A dynamic mean-field statistical model of academic collaboration

    Authors: Soumendu Sundar Mukherjee, Tamojit Sadhukhan, Shirshendu Chatterjee

    Abstract: There is empirical evidence that collaboration in academia has increased significantly during the past few decades, perhaps due to the breathtaking advancements in communication and technology during this period. Multi-author articles have become more frequent than single-author ones. Interdisciplinary collaboration is also on the rise. Although there have been several studies on the dynamical asp… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: 27 pages, 20 figures

  12. arXiv:2308.02344  [pdf, ps, other

    math.ST cs.LG stat.CO stat.ME stat.ML

    Learning Networks from Gaussian Graphical Models and Gaussian Free Fields

    Authors: Subhro Ghosh, Soumendu Sundar Mukherjee, Hoang-Son Tran, Ujan Gangopadhyay

    Abstract: We investigate the problem of estimating the structure of a weighted network from repeated measurements of a Gaussian Graphical Model (GGM) on the network. In this vein, we consider GGMs whose covariance structures align with the geometry of the weighted network on which they are based. Such GGMs have been of longstanding interest in statistical physics, and are referred to as the Gaussian Free Fi… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  13. arXiv:2307.12982  [pdf, other

    math.ST cs.IT stat.ME stat.ML

    Consistent model selection in the spiked Wigner model via AIC-type criteria

    Authors: Soumendu Sundar Mukherjee

    Abstract: Consider the spiked Wigner model \[ X = \sum_{i = 1}^k λ_i u_i u_i^\top + σG, \] where $G$ is an $N \times N$ GOE random matrix, and the eigenvalues $λ_i$ are all spiked, i.e. above the Baik-Ben Arous-Péché (BBP) threshold $σ$. We consider AIC-type model selection criteria of the form \[ -2 \, (\text{maximised log-likelihood}) + γ\, (\text{number of parameters}) \] for estimating the number… ▽ More

    Submitted 7 February, 2025; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: 25 pages, 2 figures, 5 tables

  14. arXiv:2304.01145  [pdf, other

    math.PR math.CO

    On a generalisation of the coupon collector problem

    Authors: Siva Athreya, Satyaki Mukherjee, Soumendu Sundar Mukherjee

    Abstract: We consider a generalisation of the classical coupon collector problem. We define a super-coupon to be any $s$-subset of a universe of $n$ coupons. In each round, a random $r$-subset from the universe is drawn and all its $s$-subsets are marked as collected. We show that the time to collect all super-coupons is $\binom{r}{s}^{-1}\binom{n}{s} \log \binom{n}{s}(1 + o(1))$ on average and has a Gumbel… ▽ More

    Submitted 12 September, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

    Comments: 16 pages, 4 figures

  15. arXiv:2302.12693  [pdf, ps, other

    cs.LG math.ST stat.ML

    Wasserstein Projection Pursuit of Non-Gaussian Signals

    Authors: Satyaki Mukherjee, Soumendu Sundar Mukherjee, Debarghya Ghoshdastidar

    Abstract: We consider the general dimensionality reduction problem of locating in a high-dimensional data cloud, a $k$-dimensional non-Gaussian subspace of interesting features. We use a projection pursuit approach -- we search for mutually orthogonal unit directions which maximise the 2-Wasserstein distance of the empirical distribution of data-projections along these directions from a standard Gaussian. U… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

  16. arXiv:2208.01365  [pdf, other

    math.ST math.PR stat.ML

    Concentration inequalities for correlated network-valued processes with applications to community estimation and changepoint analysis

    Authors: Sayak Chatterjee, Shirshendu Chatterjee, Soumendu Sundar Mukherjee, Anirban Nath, Sharmodeep Bhattacharyya

    Abstract: Network-valued time series are currently a common form of network data. However, the study of the aggregate behavior of network sequences generated from network-valued stochastic processes is relatively rare. Most of the existing research focuses on the simple setup where the networks are independent (or conditionally independent) across time, and all edges are updated synchronously at each time s… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

    Comments: 27 pages, 4 figures

  17. arXiv:2201.08326  [pdf, other

    stat.ME cs.LG econ.EM math.ST stat.CO stat.ML

    Learning with latent group sparsity via heat flow dynamics on networks

    Authors: Subhroshekhar Ghosh, Soumendu Sundar Mukherjee

    Abstract: Group or cluster structure on explanatory variables in machine learning problems is a very general phenomenon, which has attracted broad interest from practitioners and theoreticians alike. In this work we contribute an approach to learning under such group structure, that does not require prior information on the group identities. Our paradigm is motivated by the Laplacian geometry of an underlyi… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

    Comments: 36 pages, 3 figures, 3 tables

  18. arXiv:2102.05839  [pdf, ps, other

    math.PR

    Distribution of Eigenvalues of Matrix Ensembles arising from Wigner and Palindromic Toeplitz Blocks

    Authors: Keller Blackwell, Neelima Borade, Arup Bose, Charles Devlin VI, Noah Luntzlara, Renyuan Ma, Steven J. Miller, Soumendu Sundar Mukherjee, Mengxi Wang, Wanqiao Xu

    Abstract: Random Matrix Theory (RMT) has successfully modeled diverse systems, from energy levels of heavy nuclei to zeros of $L$-functions; this correspondence has allowed RMT to successfully predict many number theoretic behaviors. However there are some operations which to date have no RMT analogue. Our motivation is to find an RMT analogue of Rankin-Selberg convolution, which constructs a new $L$-functi… ▽ More

    Submitted 10 February, 2021; originally announced February 2021.

    Comments: 14 pages, 5 figures. arXiv admin note: text overlap with arXiv:1908.03834

    MSC Class: 15A52 (primary); 60F99; 62H10 (secondary)

  19. arXiv:2101.04105  [pdf, other

    math.PR math.OA math.ST

    Some characterization results on classical and free Poisson thinning

    Authors: Soumendu Sundar Mukherjee

    Abstract: Poisson thinning is an elementary result in probability, which is of great importance in the theory of Poisson point processes. In this article, we record a couple of characterization results on Poisson thinning. We also consider several free probability analogues of Poisson thinning, which we collectively dub as \emph{free Poisson thinning}, and prove characterization results for them, similar to… ▽ More

    Submitted 4 September, 2022; v1 submitted 11 January, 2021; originally announced January 2021.

    Comments: 19 pages, 1 figure, to appear in RMTA

    MSC Class: 46L54; 60E05; 62E10

  20. arXiv:2011.04470  [pdf, other

    math.ST stat.ME

    High dimensional PCA: a new model selection criterion

    Authors: Abhinav Chakraborty, Soumendu Sundar Mukherjee, Arijit Chakrabarti

    Abstract: Given a random sample from a multivariate population, estimating the number of large eigenvalues of the population covariance matrix is an important problem in Statistics with wide applications in many areas. In the context of Principal Component Analysis (PCA), the linear combinations of the original variables having the largest amounts of variation are determined by this number. In this paper, w… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

    Comments: 37 pages, 6 figures, 2 tables

    MSC Class: 62H12; 62H25

  21. arXiv:2008.05916  [pdf, other

    math.PR math.OA

    On $*$-Convergence of Schur-Hadamard Products of Independent Nonsymmetric Random Matrices

    Authors: Soumendu Sundar Mukherjee

    Abstract: Let $\{x_α\}_{α\in \mathbb{Z}}$ and $\{y_α\}_{α\in \mathbb{Z}}$ be two independent collections of zero mean, unit variance random variables with uniformly bounded moments of all orders. Consider a nonsymmetric Toeplitz matrix $X_n = ((x_{i - j}))_{1 \le i, j \le n}$ and a Hankel matrix $Y_n = ((y_{i + j}))_{1 \le i, j \le n}$, and let $M_n = X_n \odot Y_n$ be their elementwise/Schur-Hadamard produ… ▽ More

    Submitted 5 September, 2022; v1 submitted 13 August, 2020; originally announced August 2020.

    Comments: 18 pages, 2 figures, to appear in IMRN

    MSC Class: 46L54; 60B20

  22. arXiv:2007.10989  [pdf, ps, other

    math.OA math.CO math.PR

    Construction of product $*$-probability spaces via free cumulants

    Authors: Arup Bose, Soumendu Sundar Mukherjee

    Abstract: It is well known that free independence is equivalent to the vanishing of mixed free cumulants. The purpose of this short note is to build free products of $*$-probability spaces using this as the definition of freeness and relying on free cumulants instead of moments.

    Submitted 7 February, 2025; v1 submitted 21 July, 2020; originally announced July 2020.

    Comments: 8 pages

    MSC Class: 46L54

  23. arXiv:1905.06661  [pdf, other

    math.ST

    When random initializations help: a study of variational inference for community detection

    Authors: Purnamrita Sarkar, Y. X. Rachel Wang, Soumendu Sundar Mukherjee

    Abstract: Variational approximation has been widely used in large-scale Bayesian inference recently, the simplest kind of which involves imposing a mean field assumption to approximate complicated latent structures. Despite the computational scalability of mean field, theoretical studies of its loss function surface and the convergence behavior of iterative updates for optimizing the loss are far from compl… ▽ More

    Submitted 18 May, 2019; v1 submitted 16 May, 2019; originally announced May 2019.

    Comments: 32 pages, 5 figures

  24. arXiv:1708.05573  [pdf, other

    stat.ML math.ST stat.CO stat.ME

    Two provably consistent divide and conquer clustering algorithms for large networks

    Authors: Soumendu Sundar Mukherjee, Purnamrita Sarkar, Peter J. Bickel

    Abstract: In this article, we advance divide-and-conquer strategies for solving the community detection problem in networks. We propose two algorithms which perform clustering on a number of small subgraphs and finally patches the results into a single clustering. The main advantage of these algorithms is that they bring down significantly the computational cost of traditional algorithms, including spectral… ▽ More

    Submitted 18 August, 2017; originally announced August 2017.

    Comments: 41 pages, comments are most welcome

  25. arXiv:1408.0874  [pdf, other

    math.PR

    Limiting spectral distribution of a class of Hankel type random matrices

    Authors: Anirban Basak, Arup Bose, Soumendu Sundar Mukherjee

    Abstract: We consider an indexed class of real symmetric random matrices which generalize the symmetric Hankel and Reverse Circulant matrices. We show that the limiting spectral distributions of these matrices exist almost surely and the limit is continuous in the index. We also study other properties of the limit.

    Submitted 5 August, 2014; originally announced August 2014.

    Comments: 19 pages, 2 figures, 1 table

    MSC Class: Primary 15B52; 60B20; secondary 60B10; 60F99; 60B99

  26. arXiv:1402.3683  [pdf, other

    math.PR

    Bulk behaviour of skew-symmetric patterned random matrices

    Authors: Arup Bose, Soumendu Sundar Mukherjee

    Abstract: Limiting Spectral Distributions (LSD) of real symmetric patterned matrices have been well-studied. In this article, we consider skew-symmetric/anti-symmetric patterned random matrices and establish the LSDs of several common matrices. For the skew-symmetric Wigner, skew-symmetric Toeplitz and the skew-symmetric Circulant, the LSDs (on the imaginary axis) are the same as those in the symmetric case… ▽ More

    Submitted 15 February, 2014; originally announced February 2014.

    Comments: 21 pages, 2 figures

    MSC Class: Primary 15B52; 60B20; secondary 60B10; 60F99; 60B99

  27. arXiv:1402.2207  [pdf, other

    math.PR

    Bulk behaviour of Schur-Hadamard products of symmetric random matrices

    Authors: Arup Bose, Soumendu Sundar Mukherjee

    Abstract: We develop a general method for establishing the existence of the Limiting Spectral Distributions (LSD) of Schur-Hadamard products of independent symmetric patterned random matrices. We apply this method to show that the LSDs of Schur-Hadamard products of some common patterned matrices exist and identify the limits. In particular, the Schur-Hadamard product of independent Toeplitz and Hankel matri… ▽ More

    Submitted 15 March, 2014; v1 submitted 10 February, 2014; originally announced February 2014.

    Comments: 27 pages, 1 figure; to appear, Random Matrices: Theory and Applications. This is the final version, incorporating referee comments

    MSC Class: Primary 15B52; 60B20; secondary 60B10; 60F99; 60B99

  28. arXiv:1303.4251  [pdf, ps, other

    math.CA

    An Approximation Inequality for Continued Radicals and Power Forms

    Authors: Soumendu Sundar Mukherjee

    Abstract: In this article we derive an approximation inequality for continued radicals, generalizing an inequality of Herschfeld for continued square roots to arbitrary radicals, which is useful in exploring convergence issues and obtaining convergence rates. In fact, we generalize this inequality further to encompass the more general continued power forms. We demonstrate the use of this inequality by obtai… ▽ More

    Submitted 24 October, 2013; v1 submitted 18 March, 2013; originally announced March 2013.

    Comments: 12 pages

    MSC Class: 40A25; 40A05; 26D20