-
Study of higher-order interactions in unweighted, undirected networks using persistent homology
Authors:
Udit Raj,
Slobodan Maletić,
Sudeepto Bhattacharya
Abstract:
Persistent homology has been studied to better understand the structural properties and topology features of weighted networks. It can reveal hidden layers of information about the higher-order structures formed by non-pairwise interactions in a network. Studying of higher-order interactions (HoIs) of a system provides a more comprehensive understanding of the complex system; moreover, it is a mor…
▽ More
Persistent homology has been studied to better understand the structural properties and topology features of weighted networks. It can reveal hidden layers of information about the higher-order structures formed by non-pairwise interactions in a network. Studying of higher-order interactions (HoIs) of a system provides a more comprehensive understanding of the complex system; moreover, it is a more precise depiction of the system as many complex systems, such as ecological systems and biological systems, etc., demonstrate HoIs. In this study, the weighted simplicial adjacency matrix has been constructed using the concept of adjacency strength of simplices in a clique complex obtained from an unweighted, undirected network. This weighted simplicial adjacency matrix is thus used to calculate the global measure, which is called generalised weighted betweenness centrality, which further helps us in calculating the persistent homology on the given simplicial complex by constructing a filtration on it. Moreover, a local measure called maximal generalised degree centrality has also been established for better understanding of the network topology of the studied simplicial complex. All the generalizations given in this work can be reduced to the graph-theoretic case. i.e., for a simplicial complex of dimension 1. Three different filtration schemes for constructing the sequence of simplicial complexes have been given with the help of both global and local measures, and by using these measures, the topology of higher-order structures of the studied network due to the interactions of their vertices has been compared. Further, the illustration of established definitions has been given using a real-life network by calculating Betti numbers up to dimension two.
△ Less
Submitted 24 June, 2025;
originally announced June 2025.
-
Two-dimensional Rademacher walk
Authors:
Satyaki Bhattacharya,
Stanislav Volkov
Abstract:
We study a generalisation of the one-dimensional Rademacher random walk introduced in Bhattacharya and Volkov (2023) to $\mathbb{Z}^2$ (for $d\ge 3$, the Rademacher random walk is always transient, as follows from Theorem 8.8 in Englander and Volkov (2025)). This walk is defined as the sum of a sequence of independent steps, where each step goes in one of the four possible directions with equal pr…
▽ More
We study a generalisation of the one-dimensional Rademacher random walk introduced in Bhattacharya and Volkov (2023) to $\mathbb{Z}^2$ (for $d\ge 3$, the Rademacher random walk is always transient, as follows from Theorem 8.8 in Englander and Volkov (2025)). This walk is defined as the sum of a sequence of independent steps, where each step goes in one of the four possible directions with equal probability, and the size of the $n$th step is $a_n$ where $\{a_n\}$ is a given sequence of positive integers. We establish some general conditions under which the walk is recurrent, respectively, transient.
△ Less
Submitted 19 June, 2025;
originally announced June 2025.
-
The superspace coinvariant ring of type B
Authors:
Sutanay Bhattacharya
Abstract:
Given the rank $n$ superspace $Ω_n$, the ring of polynomial-valued differential forms on $\mathbb C^n$, one can define an action of hyperoctahedral group $\mathfrak B_n$ on it. This leads to a superspace coinvariant ideal $SR_n^B$, defined as the quotient of $Ω_n$ by two-sided ideal generated by all $\mathfrak B_n$ invariants with vanishing constant terms. We derive the Hilbert series of $SR^B_n$…
▽ More
Given the rank $n$ superspace $Ω_n$, the ring of polynomial-valued differential forms on $\mathbb C^n$, one can define an action of hyperoctahedral group $\mathfrak B_n$ on it. This leads to a superspace coinvariant ideal $SR_n^B$, defined as the quotient of $Ω_n$ by two-sided ideal generated by all $\mathfrak B_n$ invariants with vanishing constant terms. We derive the Hilbert series of $SR^B_n$ conjectured by Sagan and Swanson, and prove an operator theorem that yields a concrete description of the superharmonic space $SH^B_n$ associated to $SR^B_n$ as conjectured by Swanson and Wallach. We also derive an explicit basis of $SR^B_n$ using the theory of hyperplane arrangements.
△ Less
Submitted 29 May, 2025;
originally announced May 2025.
-
ItDPDM: Information-Theoretic Discrete Poisson Diffusion Model
Authors:
Sagnik Bhattacharya,
Abhiram Gorle,
Ahsan Bilal,
Connor Ding,
Amit Kumar Singh Yadav,
Tsachy Weissman
Abstract:
Generative modeling of non-negative, discrete data, such as symbolic music, remains challenging due to two persistent limitations in existing methods. Firstly, many approaches rely on modeling continuous embeddings, which is suboptimal for inherently discrete data distributions. Secondly, most models optimize variational bounds rather than exact data likelihood, resulting in inaccurate likelihood…
▽ More
Generative modeling of non-negative, discrete data, such as symbolic music, remains challenging due to two persistent limitations in existing methods. Firstly, many approaches rely on modeling continuous embeddings, which is suboptimal for inherently discrete data distributions. Secondly, most models optimize variational bounds rather than exact data likelihood, resulting in inaccurate likelihood estimates and degraded sampling quality. While recent diffusion-based models have addressed these issues separately, we tackle them jointly. In this work, we introduce the Information-Theoretic Discrete Poisson Diffusion Model (ItDPDM), inspired by photon arrival process, which combines exact likelihood estimation with fully discrete-state modeling. Central to our approach is an information-theoretic Poisson Reconstruction Loss (PRL) that has a provable exact relationship with the true data likelihood. ItDPDM achieves improved likelihood and sampling performance over prior discrete and continuous diffusion models on a variety of synthetic discrete datasets. Furthermore, on real-world datasets such as symbolic music and images, ItDPDM attains superior likelihood estimates and competitive generation quality-demonstrating a proof of concept for distribution-robust discrete generative modeling.
△ Less
Submitted 27 May, 2025; v1 submitted 8 May, 2025;
originally announced May 2025.
-
Constant Rate Isometric Embeddings of Hamming Metric into Edit Metric
Authors:
Sudatta Bhattacharya,
Sanjana Dey,
Elazar Goldenberg,
Mursalin Habib,
Bernhard Haeupler,
Karthik C. S.,
Michal Koucký
Abstract:
A function $\varphi: \{0,1\}^n \to \{0,1\}^N$ is called an isometric embedding of the $n$-dimensional Hamming metric space to the $N$-dimensional edit metric space if, for all $x, y \in \{0,1\}^n$, the Hamming distance between $x$ and $y$ is equal to the edit distance between $\varphi(x)$ and $\varphi(y)$. The rate of such an embedding is defined as the ratio $n/N$. It is well known in the literat…
▽ More
A function $\varphi: \{0,1\}^n \to \{0,1\}^N$ is called an isometric embedding of the $n$-dimensional Hamming metric space to the $N$-dimensional edit metric space if, for all $x, y \in \{0,1\}^n$, the Hamming distance between $x$ and $y$ is equal to the edit distance between $\varphi(x)$ and $\varphi(y)$. The rate of such an embedding is defined as the ratio $n/N$. It is well known in the literature how to construct isometric embeddings with a rate of $Ω(\frac{1}{\log n})$. However, achieving even near-isometric embeddings with a positive constant rate has remained elusive until now.
In this paper, we present an isometric embedding with a rate of 1/8 by discovering connections to synchronization strings, which were studied in the context of insertion-deletion codes (Haeupler-Shahrasbi [JACM'21]). At a technical level, we introduce a framework for obtaining high-rate isometric embeddings using a novel object called misaligners. As an immediate consequence of our constant rate isometric embedding, we improve known conditional lower bounds for various optimization problems in the edit metric, but now with optimal dependency on the dimension.
We complement our results by showing that no isometric embedding $\varphi:\{0, 1\}^n \to \{0, 1\}^N$ can have rate greater than 15/32 for all positive integers $n$. En route to proving this upper bound, we uncover fundamental structural properties necessary for every Hamming-to-edit isometric embedding. We also prove similar upper and lower bounds for embeddings over larger alphabets.
Finally, we consider embeddings $\varphi:Σ_{\text{in}}^n\to Σ_{\text{out}}^N$ between different input and output alphabets, where the rate is given by $\frac{n\log|Σ_{\text{in}}|}{N\log|Σ_{\text{out}}|}$. In this setting, we show that the rate can be made arbitrarily close to 1.
△ Less
Submitted 4 April, 2025;
originally announced April 2025.
-
Uncertainty principles on $C^{*}$-algebras
Authors:
Saptak Bhattacharya
Abstract:
In this paper we prove some uncertainty bounds for commutators and anti-commutators of observables in a $C^*$-algebra. We give a short, elementary proof of Robertson's Standard Uncertaity Principle in this setting. We also prove some other uncertainty relations for which the lower bound doesn't vanish for any number of observables.
In this paper we prove some uncertainty bounds for commutators and anti-commutators of observables in a $C^*$-algebra. We give a short, elementary proof of Robertson's Standard Uncertaity Principle in this setting. We also prove some other uncertainty relations for which the lower bound doesn't vanish for any number of observables.
△ Less
Submitted 22 March, 2025; v1 submitted 27 January, 2025;
originally announced January 2025.
-
Trace of Multi-variable Matrix Functions and its Application to Functions of Graph Spectrum
Authors:
Subhrajit Bhattacharya
Abstract:
Matrix extension of a scalar function of a single variable is well-studied in literature. Of particular interest is the trace of such functions. It is known that for diagonalizable matrices, $M$, the function $g(M) = \text{Tr}(f(M)) = \sum_{j=1}^n f(μ_j)$ (where $\{μ_j\}_{j=1,2,\cdots,n}$ are the eigenvalues of $M$) inherits the monotonocity and convexity properties of $f$ (i.e., for $g$ to be con…
▽ More
Matrix extension of a scalar function of a single variable is well-studied in literature. Of particular interest is the trace of such functions. It is known that for diagonalizable matrices, $M$, the function $g(M) = \text{Tr}(f(M)) = \sum_{j=1}^n f(μ_j)$ (where $\{μ_j\}_{j=1,2,\cdots,n}$ are the eigenvalues of $M$) inherits the monotonocity and convexity properties of $f$ (i.e., for $g$ to be convex, $f$ need not be operator convex -- convexity is sufficient). In this paper we formalize the idea of matrix extension of a function of multiple variables, study the monotonicity and convexity properties of the trace, and thus show that a function of form $g(M) = \sum_{j_1=1}^n \sum_{j_2=1}^n \cdots \sum_{j_m=1}^n f(μ_{j_1}, μ_{j_2},\cdots, μ_{j_m})$ also inherits the monotonocity and convexity properties of the multi-variable function, $f$. We apply these results to functions of the spectrum of the weighted Laplacian matrix of undirected, simple graphs.
△ Less
Submitted 27 January, 2025; v1 submitted 24 January, 2025;
originally announced January 2025.
-
Twist like behavior in non-twist patterns of triods
Authors:
Sourav Bhattacharya,
Ashish Yadav
Abstract:
We prove a sufficient condition for a \emph{pattern} $π$ on a \emph{triod} $T$ to have \emph{rotation number} $ρ_π$ coincide with an end-point of its \emph{forced rotation interval} $I_π$. Then, we demonstrate the existence of peculiar \emph{patterns} on \emph{triods} that are neither \emph{triod twists} nor possess a \emph{block structure} over a \emph{triod twist pattern}, but their \emph{rotati…
▽ More
We prove a sufficient condition for a \emph{pattern} $π$ on a \emph{triod} $T$ to have \emph{rotation number} $ρ_π$ coincide with an end-point of its \emph{forced rotation interval} $I_π$. Then, we demonstrate the existence of peculiar \emph{patterns} on \emph{triods} that are neither \emph{triod twists} nor possess a \emph{block structure} over a \emph{triod twist pattern}, but their \emph{rotation numbers} are an end point of their respective \emph{forced rotation intervals}, mimicking the behavior of \emph{triod twist patterns}. These \emph{patterns}, absent in circle maps (see \cite{almBB}), highlight a key difference between the rotation theories for \emph{triods} (introduced in \cite{BMR}) and that of circle maps. We name these \emph{patterns}: ``\emph{strangely ordered}" and show that they are semi-conjugate to circle rotations via a piece-wise monotone map. We conclude by providing an algorithm to construct unimodal \emph{strangely ordered patterns} with arbitrary \emph{rotation pairs}.
△ Less
Submitted 24 December, 2024;
originally announced December 2024.
-
Forest Fire Model on $\mathbb{Z}_{+}$ with Delays
Authors:
Satyaki Bhattacharya,
Stanislav Volkov
Abstract:
We consider a generalization of the forest fire model on $\mathbb{Z}_+$ with ignition at zero only, studied in [arXiv:0907.1821]. Unlike that model, we allow delays in the spread of the fires as well as the non-zero burning time of individual ``trees''. We obtain some general properties for this model, which cover, among others, the phenomena of an ``infinite fire'', not present in the original mo…
▽ More
We consider a generalization of the forest fire model on $\mathbb{Z}_+$ with ignition at zero only, studied in [arXiv:0907.1821]. Unlike that model, we allow delays in the spread of the fires as well as the non-zero burning time of individual ``trees''. We obtain some general properties for this model, which cover, among others, the phenomena of an ``infinite fire'', not present in the original model.
△ Less
Submitted 31 March, 2025; v1 submitted 20 November, 2024;
originally announced November 2024.
-
Spectrum Optimization of Dynamic Networks for Reduction of Vulnerability Against Adversarial Resonance Attacks
Authors:
Alp Sahin,
Nicolas Kozachuk,
Rick S. Blum,
Subhrajit Bhattacharya
Abstract:
Resonance is a well-known phenomenon that happens in systems with second order dynamics. In this paper we address the fundamental question of making a network robust to signal being periodically pumped into it at or near a resonant frequency by an adversarial agent with the aim of saturating the network with the signal. Towards this goal, we develop the notion of network vulnerability, which is me…
▽ More
Resonance is a well-known phenomenon that happens in systems with second order dynamics. In this paper we address the fundamental question of making a network robust to signal being periodically pumped into it at or near a resonant frequency by an adversarial agent with the aim of saturating the network with the signal. Towards this goal, we develop the notion of network vulnerability, which is measured by the expected resonance amplitude on the network under a stochastically modeled adversarial attack. Assuming a second order dynamics model based on the network graph Laplacian matrix and a known stochastic model for the adversarial attack, we propose two methods for minimizing the network vulnerability through optimization of the spectrum of the network graph. We provide extensive numerical results analyzing the effects of both methods.
△ Less
Submitted 29 January, 2025; v1 submitted 30 September, 2024;
originally announced October 2024.
-
Primes and polygonal numbers
Authors:
Soumya Bhattacharya,
Habibur Rahaman
Abstract:
For all integers $r,s>2$, each linear combination of an \mbox{$r$-gonal} number and an $s$-gonal number with coprime positive integer coefficients produces infinitely many primes. However, for any given pair of coprime positive integer coefficients of the polygonal numbers, the sum of the reciprocals of such primes diverges if and only if $r=s=4$.
For all integers $r,s>2$, each linear combination of an \mbox{$r$-gonal} number and an $s$-gonal number with coprime positive integer coefficients produces infinitely many primes. However, for any given pair of coprime positive integer coefficients of the polygonal numbers, the sum of the reciprocals of such primes diverges if and only if $r=s=4$.
△ Less
Submitted 20 February, 2025; v1 submitted 24 August, 2024;
originally announced August 2024.
-
Forcing Minimal Interval Patterns as Interval Exchange Transformations
Authors:
Sourav Bhattacharya
Abstract:
We prove that any over-twist pattern is conjugate to an interval exchange transformation with bounded number of segments of isometry, restricted on one of its cycles. The bound is independent of the period and over-rotation number of the over-twist pattern and depends only on its modality.
We prove that any over-twist pattern is conjugate to an interval exchange transformation with bounded number of segments of isometry, restricted on one of its cycles. The bound is independent of the period and over-rotation number of the over-twist pattern and depends only on its modality.
△ Less
Submitted 18 August, 2024;
originally announced August 2024.
-
Causal effect estimation under network interference with mean-field methods
Authors:
Sohom Bhattacharya,
Subhabrata Sen
Abstract:
We study causal effect estimation from observational data under interference. The interference pattern is captured by an observed network. We adopt the chain graph framework of Tchetgen Tchetgen et. al. (2021), which allows (i) interaction among the outcomes of distinct study units connected along the graph and (ii) long range interference, whereby the outcome of an unit may depend on the treatmen…
▽ More
We study causal effect estimation from observational data under interference. The interference pattern is captured by an observed network. We adopt the chain graph framework of Tchetgen Tchetgen et. al. (2021), which allows (i) interaction among the outcomes of distinct study units connected along the graph and (ii) long range interference, whereby the outcome of an unit may depend on the treatments assigned to distant units connected along the interference network. For ``mean-field" interaction networks, we develop a new scalable iterative algorithm to estimate the causal effects. For gaussian weighted networks, we introduce a novel causal effect estimation algorithm based on Approximate Message Passing (AMP). Our algorithms are provably consistent under a ``high-temperature" condition on the underlying model. We estimate the (unknown) parameters of the model from data using maximum pseudo-likelihood and establish $\sqrt{n}$-consistency of this estimator in all parameter regimes. Finally, we prove that the downstream estimators obtained by plugging in estimated parameters into the aforementioned algorithms are consistent at high-temperature. Our methods can accommodate dense interactions among the study units -- a setting beyond reach using existing techniques. Our algorithms originate from the study of variational inference approaches in high-dimensional statistics; overall, we demonstrate the usefulness of these ideas in the context of causal effect estimation under interference.
△ Less
Submitted 28 July, 2024;
originally announced July 2024.
-
A description of the integral depth-$r$ Bernstein center
Authors:
Tsao-Hsien Chen,
Sarbartha Bhattacharya
Abstract:
In this paper we give a description of the depth-$r$ Bernstein center for non-negative integers $r$ of a reductive simply connected group $G$ over a non-archimedean local field as a limit of depth-$r$ standard parahoric Hecke algebras. Using the description, we construct maps from the algebra of stable functions on the $r$-th Moy-Prasad filtration quotient of hyperspecial parahorics to the depth-…
▽ More
In this paper we give a description of the depth-$r$ Bernstein center for non-negative integers $r$ of a reductive simply connected group $G$ over a non-archimedean local field as a limit of depth-$r$ standard parahoric Hecke algebras. Using the description, we construct maps from the algebra of stable functions on the $r$-th Moy-Prasad filtration quotient of hyperspecial parahorics to the depth-$r$ Bernstein center and use them to attach to each depth-$r$ irreducible representation $π$ an invariant $θ(π)$, called the depth-$r$ Deligne-Lusztig parameter of $π$. We show that $θ(π)$ is equal to the semi-simple part of minimal $K$-types of $π$.
△ Less
Submitted 21 July, 2024;
originally announced July 2024.
-
Generalization error of min-norm interpolators in transfer learning
Authors:
Yanke Song,
Sohom Bhattacharya,
Pragya Sur
Abstract:
This paper establishes the generalization error of pooled min-$\ell_2$-norm interpolation in transfer learning where data from diverse distributions are available. Min-norm interpolators emerge naturally as implicit regularized limits of modern machine learning algorithms. Previous work characterized their out-of-distribution risk when samples from the test distribution are unavailable during trai…
▽ More
This paper establishes the generalization error of pooled min-$\ell_2$-norm interpolation in transfer learning where data from diverse distributions are available. Min-norm interpolators emerge naturally as implicit regularized limits of modern machine learning algorithms. Previous work characterized their out-of-distribution risk when samples from the test distribution are unavailable during training. However, in many applications, a limited amount of test data may be available during training, yet properties of min-norm interpolation in this setting are not well-understood. We address this gap by characterizing the bias and variance of pooled min-$\ell_2$-norm interpolation under covariate and model shifts. The pooled interpolator captures both early fusion and a form of intermediate fusion. Our results have several implications: under model shift, for low signal-to-noise ratio (SNR), adding data always hurts. For higher SNR, transfer learning helps as long as the shift-to-signal (SSR) ratio lies below a threshold that we characterize explicitly. By consistently estimating these ratios, we provide a data-driven method to determine: (i) when the pooled interpolator outperforms the target-based interpolator, and (ii) the optimal number of target samples that minimizes the generalization error. Under covariate shift, if the source sample size is small relative to the dimension, heterogeneity between between domains improves the risk, and vice versa. We establish a novel anisotropic local law to achieve these characterizations, which may be of independent interest in random matrix theory. We supplement our theoretical characterizations with comprehensive simulations that demonstrate the finite-sample efficacy of our results.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
A proof of Sylvester's theorem
Authors:
Saptak Bhattacharya
Abstract:
We give a new elementary proof of existence and uniqueness of a solution to the Sylvester equation $AX-XB=Y$
We give a new elementary proof of existence and uniqueness of a solution to the Sylvester equation $AX-XB=Y$
△ Less
Submitted 29 February, 2024;
originally announced March 2024.
-
A new invariant for a cycle of an interval map
Authors:
Sourav Bhattacharya
Abstract:
We \emph{propose} a new \emph{invariant} for a \emph{cycle} of an \emph{interval map} $f:[0,1] \to [0,1]$, called its \emph{unfolding number}.
We \emph{propose} a new \emph{invariant} for a \emph{cycle} of an \emph{interval map} $f:[0,1] \to [0,1]$, called its \emph{unfolding number}.
△ Less
Submitted 2 June, 2024; v1 submitted 25 March, 2024;
originally announced March 2024.
-
Nonsense associations in Markov random fields with pairwise dependence
Authors:
Sohom Bhattacharya,
Rajarshi Mukherjee,
Elizabeth Ogburn
Abstract:
Yule (1926) identified the issue of "nonsense correlations" in time series data, where dependence within each of two random vectors causes overdispersion -- i.e. variance inflation -- for measures of dependence between the two. During the near century since then, much has been written about nonsense correlations -- but nearly all of it confined to the time series literature. In this paper we provi…
▽ More
Yule (1926) identified the issue of "nonsense correlations" in time series data, where dependence within each of two random vectors causes overdispersion -- i.e. variance inflation -- for measures of dependence between the two. During the near century since then, much has been written about nonsense correlations -- but nearly all of it confined to the time series literature. In this paper we provide the first, to our knowledge, rigorous study of this phenomenon for more general forms of (positive) dependence, specifically for Markov random fields on lattices and graphs. We consider both binary and continuous random vectors and three different measures of association: correlation, covariance, and the ordinary least squares coefficient that results from projecting one random vector onto the other. In some settings we find variance inflation consistent with Yule's nonsense correlation. However, surprisingly, we also find variance deflation in some settings, and in others the variance is unchanged under dependence. Perhaps most notably, we find general conditions under which OLS inference that ignores dependence is valid despite positive dependence in the regression errors, contradicting the presentation of OLS in countless textbooks and courses.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Evaluating the consequences: Impact of sex-selective harvesting on fish population and identifying tipping points via life-history parameters
Authors:
Joydeb Bhattacharyya,
Arnab Chattopadhyay,
Anurag Sau,
Sabyasachi Bhattacharya
Abstract:
Fish harvesting often targets larger individuals, which can be sex-specific due to size dimorphism or differences in behaviors like migration and spawning. Sex-selective harvesting can have dire consequences in the long run, potentially pushing fish populations towards collapse much earlier due to skewed sex ratios and reduced reproduction. To investigate this pressing issue, we used a single-spec…
▽ More
Fish harvesting often targets larger individuals, which can be sex-specific due to size dimorphism or differences in behaviors like migration and spawning. Sex-selective harvesting can have dire consequences in the long run, potentially pushing fish populations towards collapse much earlier due to skewed sex ratios and reduced reproduction. To investigate this pressing issue, we used a single-species sex-structured mathematical model with a weak Allee effect on the fish population. Additionally, we incorporate a realistic harvesting mechanism resembling the Michaelis-Menten function. Our analysis illuminates the intricate interplay between life history traits, harvesting intensity, and population stability. The results demonstrate that fish life history traits, such as a higher reproductive rate, early maturation of juveniles, and increased longevity, confer advantages under intensive harvesting. To anticipate potential population collapse, we employ a novel early warning tool (EWT) based on the concept of basin stability to pinpoint tipping points before they occur. Harvesting yield at our proposed early indicator can act as a potential pathway to achieve optimal yield while keeping the population safely away from the brink of collapse, rather than relying solely on the established maximum sustainable yield (MSY), where the population dangerously approaches the point of no return. Furthermore, we show that density-dependent female stocking upon receiving an EWT signal significantly shifts the tipping point, allowing safe harvesting even at MSY levels, thus can act as a potential intervention strategy.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
Weighted Combinatorial Laplacian and its Application to Coverage Repair in Sensor Networks
Authors:
Shunsaku Yadokoro,
Subhrajit Bhattacharya
Abstract:
We define the weighted combinatorial Laplacian operators on a simplicial complex and investigate their spectral properties. Eigenvalues close to zero and the corresponding eigenvectors of them are especially of our interest, and we show that they can detect almost $n$-dimensional holes in the given complex. Real-valued weights on simplices allow gradient descent based optimization, which in turn g…
▽ More
We define the weighted combinatorial Laplacian operators on a simplicial complex and investigate their spectral properties. Eigenvalues close to zero and the corresponding eigenvectors of them are especially of our interest, and we show that they can detect almost $n$-dimensional holes in the given complex. Real-valued weights on simplices allow gradient descent based optimization, which in turn gives an efficient dynamic coverage repair algorithm for the sensor network of a mobile robot team. Using the theory of relative homology, we also extend the problem of dynamic coverage repair to environments with obstacles.
△ Less
Submitted 14 April, 2024; v1 submitted 7 December, 2023;
originally announced December 2023.
-
On monic abelian trace-one cubic polynomials
Authors:
Shubhrajit Bhattacharya,
Andrew O'Desky
Abstract:
We compute the asymptotic number of monic trace-one integral polynomials with Galois group $C_3$ and bounded height. For such polynomials we compute a height function coming from toric geometry and introduce a parametrization using the quadratic cyclotomic field $\mathbb Q(\sqrt{-3})$. We also give a formula for the number of polynomials of the form $t^3 -t^2 + at + b \in \mathbb Z[t]$ with Galois…
▽ More
We compute the asymptotic number of monic trace-one integral polynomials with Galois group $C_3$ and bounded height. For such polynomials we compute a height function coming from toric geometry and introduce a parametrization using the quadratic cyclotomic field $\mathbb Q(\sqrt{-3})$. We also give a formula for the number of polynomials of the form $t^3 -t^2 + at + b \in \mathbb Z[t]$ with Galois group $C_3$ for a fixed integer $a$.
△ Less
Submitted 26 October, 2023;
originally announced October 2023.
-
Inferences on Mixing Probabilities and Ranking in Mixed-Membership Models
Authors:
Sohom Bhattacharya,
Jianqing Fan,
Jikai Hou
Abstract:
Network data is prevalent in numerous big data applications including economics and health networks where it is of prime importance to understand the latent structure of network. In this paper, we model the network using the Degree-Corrected Mixed Membership (DCMM) model. In DCMM model, for each node $i$, there exists a membership vector…
▽ More
Network data is prevalent in numerous big data applications including economics and health networks where it is of prime importance to understand the latent structure of network. In this paper, we model the network using the Degree-Corrected Mixed Membership (DCMM) model. In DCMM model, for each node $i$, there exists a membership vector $\boldsymbolπ_ i = (\boldsymbolπ_i(1), \boldsymbolπ_i(2),\ldots, \boldsymbolπ_i(K))$, where $\boldsymbolπ_i(k)$ denotes the weight that node $i$ puts in community $k$. We derive novel finite-sample expansion for the $\boldsymbolπ_i(k)$s which allows us to obtain asymptotic distributions and confidence interval of the membership mixing probabilities and other related population quantities. This fills an important gap on uncertainty quantification on the membership profile. We further develop a ranking scheme of the vertices based on the membership mixing probabilities on certain communities and perform relevant statistical inferences. A multiplier bootstrap method is proposed for ranking inference of individual member's profile with respect to a given community. The validity of our theoretical results is further demonstrated by via numerical experiments in both real and synthetic data examples.
△ Less
Submitted 28 August, 2023;
originally announced August 2023.
-
Gibbs Measures with Multilinear Forms
Authors:
Sohom Bhattacharya,
Nabarun Deb,
Sumit Mukherjee
Abstract:
In this paper, we study a class of multilinear Gibbs measures with Hamiltonian given by a generalized $\mathrm{U}$-statistic and with a general base measure. Expressing the asymptotic free energy as an optimization problem over a space of functions, we obtain necessary and sufficient conditions for replica-symmetry. Utilizing this, we obtain weak limits for a large class of statistics of interest,…
▽ More
In this paper, we study a class of multilinear Gibbs measures with Hamiltonian given by a generalized $\mathrm{U}$-statistic and with a general base measure. Expressing the asymptotic free energy as an optimization problem over a space of functions, we obtain necessary and sufficient conditions for replica-symmetry. Utilizing this, we obtain weak limits for a large class of statistics of interest, which includes the ''local fields/magnetization'', the Hamiltonian, the global magnetization, etc. An interesting consequence is a universal weak law for contrasts under replica symmetry, namely, $n^{-1}\sum_{i=1}^n c_i X_i\to 0$ weakly, if $\sum_{i=1}^n c_i=o(n)$. Our results yield a probabilistic interpretation for the optimizers arising out of the limiting free energy. We also prove the existence of a sharp phase transition point in terms of the temperature parameter, thereby generalizing existing results that were only known for quadratic Hamiltonians. As a by-product of our proof technique, we obtain exponential concentration bounds on local and global magnetizations, which are of independent interest.
△ Less
Submitted 28 July, 2023; v1 submitted 26 July, 2023;
originally announced July 2023.
-
Transience of continuous-time conservative random walks
Authors:
Satyaki Bhattacharya,
Stanislav Volkov
Abstract:
We consider two continuous-time generalizations of conservative random walks introduced in [J.Englander and S.Volkov (2022)], an orthogonal and a spherically-symmetrical one; the latter model is known as {\em random flights}. For both models, we show the transience of the walks when $d\ge 2$ and the rate of changing of direction follows power law $t^{-α}$, $0<α\le 1$, or the law $(\ln t)^{-β}$ whe…
▽ More
We consider two continuous-time generalizations of conservative random walks introduced in [J.Englander and S.Volkov (2022)], an orthogonal and a spherically-symmetrical one; the latter model is known as {\em random flights}. For both models, we show the transience of the walks when $d\ge 2$ and the rate of changing of direction follows power law $t^{-α}$, $0<α\le 1$, or the law $(\ln t)^{-β}$ where $β>2$.
△ Less
Submitted 27 August, 2024; v1 submitted 19 June, 2023;
originally announced June 2023.
-
Deep Neural Networks for Nonparametric Interaction Models with Diverging Dimension
Authors:
Sohom Bhattacharya,
Jianqing Fan,
Debarghya Mukherjee
Abstract:
Deep neural networks have achieved tremendous success due to their representation power and adaptation to low-dimensional structures. Their potential for estimating structured regression functions has been recently established in the literature. However, most of the studies require the input dimension to be fixed and consequently ignore the effect of dimension on the rate of convergence and hamper…
▽ More
Deep neural networks have achieved tremendous success due to their representation power and adaptation to low-dimensional structures. Their potential for estimating structured regression functions has been recently established in the literature. However, most of the studies require the input dimension to be fixed and consequently ignore the effect of dimension on the rate of convergence and hamper their applications to modern big data with high dimensionality. In this paper, we bridge this gap by analyzing a $k^{th}$ order nonparametric interaction model in both growing dimension scenarios ($d$ grows with $n$ but at a slower rate) and in high dimension ($d \gtrsim n$). In the latter case, sparsity assumptions and associated regularization are required in order to obtain optimal rates of convergence. A new challenge in diverging dimension setting is in calculation mean-square error, the covariance terms among estimated additive components are an order of magnitude larger than those of the variances and they can deteriorate statistical properties without proper care. We introduce a critical debiasing technique to amend the problem. We show that under certain standard assumptions, debiased deep neural networks achieve a minimax optimal rate both in terms of $(n, d)$. Our proof techniques rely crucially on a novel debiasing technique that makes the covariances of additive components negligible in the mean-square error calculation. In addition, we establish the matching lower bounds.
△ Less
Submitted 11 February, 2023;
originally announced February 2023.
-
LDP for Inhomogeneous U-Statistics
Authors:
Sohom Bhattacharya,
Nabarun Deb,
Sumit Mukherjee
Abstract:
In this paper we derive a Large Deviation Principle (LDP) for inhomogeneous U/V-statistics of a general order. Using this, we derive a LDP for two types of statistics: random multilinear forms, and number of monochromatic copies of a subgraph. We show that the corresponding rate functions in these cases can be expressed as a variational problem over a suitable space of functions. We use the tools…
▽ More
In this paper we derive a Large Deviation Principle (LDP) for inhomogeneous U/V-statistics of a general order. Using this, we derive a LDP for two types of statistics: random multilinear forms, and number of monochromatic copies of a subgraph. We show that the corresponding rate functions in these cases can be expressed as a variational problem over a suitable space of functions. We use the tools developed to study Gibbs measures with the corresponding Hamiltonians, which include tensor generalizations of both Ising (with non-compact base measure) and Potts models. For these Gibbs measures, we establish scaling limits of log normalizing constants, and weak laws in terms of weak* topology, which are of possible independent interest.
△ Less
Submitted 7 December, 2022;
originally announced December 2022.
-
On hierarchically closed fractional intersecting families
Authors:
Niranjan Balachandran,
Srimanta Bhattacharya,
Krishn Vishwas Kher,
Rogers Mathew,
Brahadeesh Sankarnarayanan
Abstract:
For a set $L$ of positive proper fractions and a positive integer $r \geq 2$, a fractional $r$-closed $L$-intersecting family is a collection $\mathcal{F} \subset \mathcal{P}([n])$ with the property that for any $2 \leq t \leq r$ and $A_1, \dotsc, A_t \in \mathcal{F}$ there exists $θ\in L$ such that $\lvert A_1 \cap \dotsb \cap A_t \rvert \in \{ θ\lvert A_1 \rvert, \dotsc, θ\lvert A_t \rvert\}$. I…
▽ More
For a set $L$ of positive proper fractions and a positive integer $r \geq 2$, a fractional $r$-closed $L$-intersecting family is a collection $\mathcal{F} \subset \mathcal{P}([n])$ with the property that for any $2 \leq t \leq r$ and $A_1, \dotsc, A_t \in \mathcal{F}$ there exists $θ\in L$ such that $\lvert A_1 \cap \dotsb \cap A_t \rvert \in \{ θ\lvert A_1 \rvert, \dotsc, θ\lvert A_t \rvert\}$. In this paper we show that for $r \geq 3$ and $L = \{θ\}$ any fractional $r$-closed $θ$-intersecting family has size at most linear in $n$, and this is best possible up to a constant factor. We also show that in the case $θ= 1/2$ we have a tight upper bound of $\lfloor \frac{3n}{2} \rfloor - 2$ and that a maximal $r$-closed $(1/2)$-intersecting family is determined uniquely up to isomorphism.
△ Less
Submitted 11 April, 2024; v1 submitted 4 November, 2022;
originally announced November 2022.
-
PC Adjusted Testing for Low Dimensional Parameters
Authors:
Sohom Bhattacharya,
Rounak Dey,
Rajarshi Mukherjee
Abstract:
In this paper, we investigate the impact of high-dimensional Principal Component (PC) adjustments on inferring the effects of variables on outcomes, with a focus on applications in genetic association studies where PC adjustment is commonly used to account for population stratification. We consider high-dimensional linear regression in the regime where the number of covariates grows proportionally…
▽ More
In this paper, we investigate the impact of high-dimensional Principal Component (PC) adjustments on inferring the effects of variables on outcomes, with a focus on applications in genetic association studies where PC adjustment is commonly used to account for population stratification. We consider high-dimensional linear regression in the regime where the number of covariates grows proportionally to the number of samples. In this setting, we provide an asymptotically precise understanding of when PC adjustments yield valid tests with controlled Type I error rates. Our results demonstrate that, under both fixed and diverging signal strengths, PC regression often fails to control the Type I error at the desired nominal level. Furthermore, we establish necessary and sufficient conditions for Type I error inflation based on covariate distributions. These theoretical findings are further supported by a series of numerical experiments.
△ Less
Submitted 27 June, 2025; v1 submitted 22 September, 2022;
originally announced September 2022.
-
IID Sampling from Posterior Dirichlet Process Mixtures
Authors:
Sourabh Bhattacharya
Abstract:
The influence of Dirichlet process mixture is ubiquitous in the Bayesian nonparametrics literature. But sampling from its posterior distribution remains a challenge, despite the advent of various Markov chain Monte Carlo methods. The primary challenge is the infinite-dimensional setup, and even if the infinite-dimensional random measure is integrated out, high-dimensionality and discreteness still…
▽ More
The influence of Dirichlet process mixture is ubiquitous in the Bayesian nonparametrics literature. But sampling from its posterior distribution remains a challenge, despite the advent of various Markov chain Monte Carlo methods. The primary challenge is the infinite-dimensional setup, and even if the infinite-dimensional random measure is integrated out, high-dimensionality and discreteness still remain difficult issues to deal with.
In this article, exploiting the key ideas proposed in Bhattacharya (2021b), we propose a novel methodology for drawing iid realizations from posteriors of Dirichlet process mixtures. We focus in particular on the more general and flexible model of Bhattacharya (2008), so that the methods developed here are simply applicable to the traditional Dirichlet process mixture.
We illustrate our ideas on the well-known enzyme, acidity and the galaxy datasets, which are usually considered benchmark datasets for mixture applications. Generating 10, 000 iid realizations from the Dirichlet process mixture posterior of Bhattacharya (2008) given these datasets took 19 minutes, 8 minutes and 5 minutes, respectively, in our parallel implementation.
△ Less
Submitted 18 June, 2022;
originally announced June 2022.
-
Sequential Bayesian Neural Subnetwork Ensembles
Authors:
Sanket Jantre,
Shrijita Bhattacharya,
Nathan M. Urban,
Byung-Jun Yoon,
Tapabrata Maiti,
Prasanna Balaprakash,
Sandeep Madireddy
Abstract:
Deep ensembles have emerged as a powerful technique for improving predictive performance and enhancing model robustness across various applications by leveraging model diversity. However, traditional deep ensemble methods are often computationally expensive and rely on deterministic models, which may limit their flexibility. Additionally, while sparse subnetworks of dense models have shown promise…
▽ More
Deep ensembles have emerged as a powerful technique for improving predictive performance and enhancing model robustness across various applications by leveraging model diversity. However, traditional deep ensemble methods are often computationally expensive and rely on deterministic models, which may limit their flexibility. Additionally, while sparse subnetworks of dense models have shown promise in matching the performance of their dense counterparts and even enhancing robustness, existing methods for inducing sparsity typically incur training costs comparable to those of training a single dense model, as they either gradually prune the network during training or apply thresholding post-training. In light of these challenges, we propose an approach for sequential ensembling of dynamic Bayesian neural subnetworks that consistently maintains reduced model complexity throughout the training process while generating diverse ensembles in a single forward pass. Our approach involves an initial exploration phase to identify high-performing regions within the parameter space, followed by multiple exploitation phases that take advantage of the compactness of the sparse model. These exploitation phases quickly converge to different minima in the energy landscape, corresponding to high-performing subnetworks that together form a diverse and robust ensemble. We empirically demonstrate that our proposed approach outperforms traditional dense and sparse deterministic and Bayesian ensemble models in terms of prediction accuracy, uncertainty estimation, out-of-distribution detection, and adversarial robustness.
△ Less
Submitted 19 August, 2024; v1 submitted 1 June, 2022;
originally announced June 2022.
-
Recurrence and transience of Rademacher series
Authors:
Satyaki Bhattacharya,
Stanislav Volkov
Abstract:
We introduce the notion of {\bf a}-walk $S(n)=a_1 X_1+\dots+a_n X_n$, based on a sequence of positive numbers ${\bf a}=(a_1,a_2,\dots)$ and a Rademacher sequence $X_1,X_2,\dots$. We study recurrence/transience (properly defined) of such walks for various sequences of ${\bf a}$. In particular, we establish the classification in the cases where $a_k=\lfloor k^β\rfloor$, $β>0$, as well as in the case…
▽ More
We introduce the notion of {\bf a}-walk $S(n)=a_1 X_1+\dots+a_n X_n$, based on a sequence of positive numbers ${\bf a}=(a_1,a_2,\dots)$ and a Rademacher sequence $X_1,X_2,\dots$. We study recurrence/transience (properly defined) of such walks for various sequences of ${\bf a}$. In particular, we establish the classification in the cases where $a_k=\lfloor k^β\rfloor$, $β>0$, as well as in the case $a_k=\lceil \log_γk \rceil$ or $a_k=\log_γk$ for $γ>1$.
△ Less
Submitted 13 October, 2022; v1 submitted 30 May, 2022;
originally announced May 2022.
-
Simplicial structures in ecological networks
Authors:
Udit Raj,
Shashankaditya Upadhyay,
Moumita Karmakar,
Sudeepto Bhattacharya
Abstract:
An ecological network is a formal representation of a specific type of interaction in a corresponding ecosystem. Such networks have traditionally been modelled as encoding exclusively pairwise interactions among the fundamental units of ecosystems and have been represented and analysed using graph-theoretic methods. However, many real-world ecosystems may entertain non-binary, polyadic relations b…
▽ More
An ecological network is a formal representation of a specific type of interaction in a corresponding ecosystem. Such networks have traditionally been modelled as encoding exclusively pairwise interactions among the fundamental units of ecosystems and have been represented and analysed using graph-theoretic methods. However, many real-world ecosystems may entertain non-binary, polyadic relations between their units, which cannot be captured by the pairwise interaction methods, but require higher-order interaction framework, and consequently the corresponding ecological networks cannot be modelled using graph-theoretic framework. This work gives a structural definition of ecological network suitable for modelling all orders of interactions between the fundamental units of the corresponding ecological system, including and going beyond the pairwise interaction framework. Carbon mediation between units of some select ecosystems are studied by modelling the corresponding ecological networks as simplicial complexes following the definition. The concept of graph centrality measure has been extended to simplicial centrality, and some important centrality measures of these networks at various structural levels of the complexes have been calculated. The centrality measures reveal valuable structural information including information about those vertices that are more likely to participate in higher-order interactions, as well as inform whether there is a difference in the ranks of vertices for these higher-order networks based on graph centrality and simplicial centrality measures.
△ Less
Submitted 25 March, 2022;
originally announced March 2022.
-
Higher-order social-ecological network as a simplicial complex
Authors:
Sudeepto Bhattacharya
Abstract:
A social-ecological network is a formal representation of a corresponding social-ecological system, and encodes a relation within a given system as an interaction. Conventionally, such networks have been defined as encoding and representing pairwise interactions among the fundamental units of the system. This work proposes a combinatorial definition of social-ecological network by means of its str…
▽ More
A social-ecological network is a formal representation of a corresponding social-ecological system, and encodes a relation within a given system as an interaction. Conventionally, such networks have been defined as encoding and representing pairwise interactions among the fundamental units of the system. This work proposes a combinatorial definition of social-ecological network by means of its structure as a simplicial complex. The proposed definition is a comprehensive one that takes into account the heterogeneity of interactions within a given SES, and the higher-order social-ecological network modelled using this definition is able to represent the modelled SES by capturing all orders of interactions within the system. Such a social-ecological network consequently, is better equipped to capture and represent the structural details of the real-world SES, and is thus capable of facilitating a deeper insight into the complex behaviour of the represented SES emergent through the higher-order interactions within the system, as compared to the conventional graph-theoretic network that exclusively models pairwise interactions.
△ Less
Submitted 16 February, 2022; v1 submitted 31 December, 2021;
originally announced December 2021.
-
The lattice of nil-Hecke algebras over real and complex reflection groups
Authors:
Sutanay Bhattacharya,
Apoorva Khare
Abstract:
Associated to every complex reflection group, we construct a lattice of quotients of its braid monoid-algebra, which we term nil-Hecke algebras, and which are obtained by killing all braid words that are "sufficiently long", as well as some integer power of each generator. These include usual nil-Coxeter algebras, nil-Temperley-Lieb algebras, and their variants, and lead to symmetric semigroup mod…
▽ More
Associated to every complex reflection group, we construct a lattice of quotients of its braid monoid-algebra, which we term nil-Hecke algebras, and which are obtained by killing all braid words that are "sufficiently long", as well as some integer power of each generator. These include usual nil-Coxeter algebras, nil-Temperley-Lieb algebras, and their variants, and lead to symmetric semigroup module categories which necessarily cannot be monoidal.
Motivated by classical work of Coxeter (1957) and the Broue-Malle-Rouquier freeness conjecture [Crelle 1998], and continuing beyond work of the second author [Trans. Amer. Math. Soc. 2018], we obtain a complete classification of the finite-dimensional nil-Hecke algebras for all complex reflection groups $W$. These comprise the usual nil-Coxeter algebras for $W$ of finite type, their "fully commutative" analogues for $W$ of FC-finite type, three exceptional algebras (of types $F_4,H_3,H_4$), and three exceptional series (of types $B_n$ and $A_n$, two of them novel). In particular, we find the first - and only two - finite-dimensional nil-Hecke algebras over discrete complex reflection groups; this breaks from the nil-Coxeter case (where no braid words are further killed, and) where Marin [J. Pure Appl. Alg. 2014] and Khare [Trans. Amer. Math. Soc. 2018] showed that such algebras do not exist.
In addition to these algebras, and also algebraic connections (to PBW deformations and non-monoidal tensor categories), we further uncover combinatorial bases of algebras, both known (fully commutative elements) and novel ($\bar{12}$-avoiding signed permutations). Our classification draws from and brings together results of Popov [Comm. Math. Inst. Utrecht 1982], Stembridge [J. Alg. Combin. 1996, 1998], Malle [Transform. Groups} 1996], Postnikov via Gowravaram-Khovanova (2015), Hart [J. Group Th. 2017], and Khare [Trans. Amer. Math. Soc. 2018].
△ Less
Submitted 18 May, 2022; v1 submitted 29 November, 2021;
originally announced November 2021.
-
Forcing minimal patterns of triods
Authors:
Sourav Bhattacharya
Abstract:
\emph{Rotation numbers} for some maps of \emph{triods} was introduced in \cite{BMR}. The goal of this paper is to study \emph{patterns} of \emph{triods} which don't force other \emph{patterns} with the same \emph{rotation number} which we name \emph{triod twists}. We obtain their complete characterization and show that these \emph{patterns} can be conjugated to \emph{circle rotation} by a \emph{pi…
▽ More
\emph{Rotation numbers} for some maps of \emph{triods} was introduced in \cite{BMR}. The goal of this paper is to study \emph{patterns} of \emph{triods} which don't force other \emph{patterns} with the same \emph{rotation number} which we name \emph{triod twists}. We obtain their complete characterization and show that these \emph{patterns} can be conjugated to \emph{circle rotation} by a \emph{piecewise monotone} map. We also describe the dynamics of \emph{unimodal triod twist patterns} with a given rational \emph{rotation number}.
△ Less
Submitted 9 March, 2023; v1 submitted 13 November, 2021;
originally announced November 2021.
-
Sharp Signal Detection Under Ferromagnetic Ising Models
Authors:
Sohom Bhattacharya,
Rajarshi Mukherjee,
Gourab Ray
Abstract:
In this paper we study the effect of dependence on detecting a class of structured signals in Ferromagnetic Ising models. Natural examples of our class include Ising Models on lattices, and Mean-Field type Ising Models such as dense Erdős-Rényi, and dense random regular graphs. Our results not only provide sharp constants of detection in each of these cases and thereby pinpoint the precise relatio…
▽ More
In this paper we study the effect of dependence on detecting a class of structured signals in Ferromagnetic Ising models. Natural examples of our class include Ising Models on lattices, and Mean-Field type Ising Models such as dense Erdős-Rényi, and dense random regular graphs. Our results not only provide sharp constants of detection in each of these cases and thereby pinpoint the precise relationship of the detection problem with the underlying dependence, but also demonstrate how to be agnostic over the strength of dependence present in the respective models.
△ Less
Submitted 6 October, 2021;
originally announced October 2021.
-
Variational Bayes algorithm and posterior consistency of Ising model parameter estimation
Authors:
Minwoo Kim,
Shrijita Bhattacharya,
Tapabrata Maiti
Abstract:
Ising models originated in statistical physics and are widely used in modeling spatial data and computer vision problems. However, statistical inference of this model remains challenging due to intractable nature of the normalizing constant in the likelihood. Here, we use a pseudo-likelihood instead to study the Bayesian estimation of two-parameter, inverse temperature, and magnetization, Ising mo…
▽ More
Ising models originated in statistical physics and are widely used in modeling spatial data and computer vision problems. However, statistical inference of this model remains challenging due to intractable nature of the normalizing constant in the likelihood. Here, we use a pseudo-likelihood instead to study the Bayesian estimation of two-parameter, inverse temperature, and magnetization, Ising model with a fully specified coupling matrix. We develop a computationally efficient variational Bayes procedure for model estimation. Under the Gaussian mean-field variational family, we derive posterior contraction rates of the variational posterior obtained under the pseudo-likelihood. We also discuss the loss incurred due to variational posterior over true posterior for the pseudo-likelihood approach. Extensive simulation studies validate the efficacy of mean-field Gaussian and bivariate Gaussian families as the possible choices of the variational family for inference of Ising model parameters.
△ Less
Submitted 3 September, 2021;
originally announced September 2021.
-
Birkhoff-James extensions of continuous functions on metric spaces
Authors:
Saptak Bhattacharya
Abstract:
In this paper, we extend the investigations regarding Birkhoff-James orthogonality of linear operators to bounded continuous functions on metric spaces. We introduce Birkhoff-James extensions of continuous functions and study them in detail, in the separate contexts of compact and non-compact metric spaces. We conclude by discussing an application of our ideas to the study of Birkhoff-James orthog…
▽ More
In this paper, we extend the investigations regarding Birkhoff-James orthogonality of linear operators to bounded continuous functions on metric spaces. We introduce Birkhoff-James extensions of continuous functions and study them in detail, in the separate contexts of compact and non-compact metric spaces. We conclude by discussing an application of our ideas to the study of Birkhoff-James orthogonality in $C(X)$ with the supremum norm, where $X$ is a compact metric space.
△ Less
Submitted 28 August, 2021;
originally announced August 2021.
-
An ensemble of high rank matrices arising from tournaments
Authors:
Niranjan Balachandran,
Srimanta Bhattacharya,
Brahadeesh Sankarnarayanan
Abstract:
Suppose $\mathbb{F}$ is a field and let $\mathbf{a} := (a_1, a_2, \dotsc)$ be a sequence of non-zero elements in $\mathbb{F}$. For $\mathbf{a}_n := (a_1, \dotsc, a_n)$, we consider the family $\mathcal{M}_n(\mathbf{a})$ of $n \times n$ symmetric matrices $M$ over $\mathbb{F}$ with all diagonal entries zero and the $(i, j)$th element of $M$ either $a_i$ or $a_j$ for $i < j$. In this short paper, we…
▽ More
Suppose $\mathbb{F}$ is a field and let $\mathbf{a} := (a_1, a_2, \dotsc)$ be a sequence of non-zero elements in $\mathbb{F}$. For $\mathbf{a}_n := (a_1, \dotsc, a_n)$, we consider the family $\mathcal{M}_n(\mathbf{a})$ of $n \times n$ symmetric matrices $M$ over $\mathbb{F}$ with all diagonal entries zero and the $(i, j)$th element of $M$ either $a_i$ or $a_j$ for $i < j$. In this short paper, we show that all matrices in a certain subclass of $\mathcal{M}_n(\mathbf{a})$ -- which can be naturally associated with transitive tournaments -- have rank at least $\lfloor 2n/3 \rfloor - 1$. We also show that if $\operatorname{char}(\mathbb{F}) \neq 2$ and $M$ is a matrix chosen uniformly at random from $\mathcal{M}_n(\mathbf{a})$, then with high probability $\operatorname{rank}(M) \geq \bigl(\frac{1}{2} - o(1)\bigr)n$.
△ Less
Submitted 15 July, 2023; v1 submitted 24 August, 2021;
originally announced August 2021.
-
Matrix completion with data-dependent missingness probabilities
Authors:
Sohom Bhattacharya,
Sourav Chatterjee
Abstract:
The problem of completing a large matrix with lots of missing entries has received widespread attention in the last couple of decades. Two popular approaches to the matrix completion problem are based on singular value thresholding and nuclear norm minimization. Most of the past works on this subject assume that there is a single number $p$ such that each entry of the matrix is available independe…
▽ More
The problem of completing a large matrix with lots of missing entries has received widespread attention in the last couple of decades. Two popular approaches to the matrix completion problem are based on singular value thresholding and nuclear norm minimization. Most of the past works on this subject assume that there is a single number $p$ such that each entry of the matrix is available independently with probability $p$ and missing otherwise. This assumption may not be realistic for many applications. In this work, we replace it with the assumption that the probability that an entry is available is an unknown function $f$ of the entry itself. For example, if the entry is the rating given to a movie by a viewer, then it seems plausible that high value entries have greater probability of being available than low value entries. We propose two new estimators, based on singular value thresholding and nuclear norm minimization, to recover the matrix under this assumption. The estimators involve no tuning parameters, and are shown to be consistent under a low rank assumption. We also provide a consistent estimator of the unknown function $f$.
△ Less
Submitted 22 April, 2022; v1 submitted 4 June, 2021;
originally announced June 2021.
-
On Some Bounds on the Perturbation of Invariant Subspaces of Normal Matrices with Application to a Graph Connection Problem
Authors:
Subhrajit Bhattacharya
Abstract:
We provide upper bounds on the perturbation of invariant subspaces of normal matrices measured using a metric on the space of vector subspaces of $\mathbb{C}^n$ in terms of the spectrum of both the unperturbed \& perturbed matrices, as well as, spectrum of the unperturbed matrix only. The results presented give tighter bounds than the Davis-Khan $\sinΘ$ theorem. We apply the result to a graph pert…
▽ More
We provide upper bounds on the perturbation of invariant subspaces of normal matrices measured using a metric on the space of vector subspaces of $\mathbb{C}^n$ in terms of the spectrum of both the unperturbed \& perturbed matrices, as well as, spectrum of the unperturbed matrix only. The results presented give tighter bounds than the Davis-Khan $\sinΘ$ theorem. We apply the result to a graph perturbation problem.
△ Less
Submitted 19 June, 2021; v1 submitted 16 March, 2021;
originally announced March 2021.
-
Monotonicity of the over-rotation intervals for bimodal maps
Authors:
Sourav Bhattacharya,
Alexander Blokh
Abstract:
We show that the connectedness of the set of parameters for which the over-rotation interval of a bimodal interval map is constant. In other words, the over-rotation interval is a monotone function of a bimodal interval map.
We show that the connectedness of the set of parameters for which the over-rotation interval of a bimodal interval map is constant. In other words, the over-rotation interval is a monotone function of a bimodal interval map.
△ Less
Submitted 4 March, 2021;
originally announced March 2021.
-
Unicritical Laminations
Authors:
Sourav Bhattacharya,
Alexander Blokh,
Dierk Schleicher
Abstract:
Thurston introduced \emph{invariant (quadratic) laminations} in his 1984 preprint as a vehicle for understanding the connected Julia sets and the parameter space of quadratic polynomials. Important ingredients of his analysis of the angle doubling map $σ_2$ on the unit circle $\mathbb{S}^1$ were the Central Strip Lemma, non-existence of wandering polygons, the transitivity of the first return map…
▽ More
Thurston introduced \emph{invariant (quadratic) laminations} in his 1984 preprint as a vehicle for understanding the connected Julia sets and the parameter space of quadratic polynomials. Important ingredients of his analysis of the angle doubling map $σ_2$ on the unit circle $\mathbb{S}^1$ were the Central Strip Lemma, non-existence of wandering polygons, the transitivity of the first return map on vertices of periodic polygons, and the non-crossing of minors of quadratic invariant laminations. We use Thurston's methods to prove similar results for \emph{unicritical} laminations of arbitrary degree $d$ and to show that the set of so-called \emph{minors} of unicritical laminations themselves form a \emph{Unicritical Minor Lamination} $\mathrm{UML}_d$. In the end we verify the \emph{Fatou conjecture} for the unicritical laminations and extend the \emph{Lavaurs algorithm} onto $\mathrm{UML}_d$.
△ Less
Submitted 20 January, 2021;
originally announced January 2021.
-
Geometric ergodicity of Gibbs samplers for the Horseshoe and its regularized variants
Authors:
Suman K. Bhattacharya,
Kshitij Khare,
Subhadip Pal
Abstract:
The Horseshoe is a widely used and popular continuous shrinkage prior for high-dimensional Bayesian linear regression. Recently, regularized versions of the Horseshoe prior have also been introduced in the literature. Various Gibbs sampling Markov chains have been developed in the literature to generate approximate samples from the corresponding intractable posterior densities. Establishing geomet…
▽ More
The Horseshoe is a widely used and popular continuous shrinkage prior for high-dimensional Bayesian linear regression. Recently, regularized versions of the Horseshoe prior have also been introduced in the literature. Various Gibbs sampling Markov chains have been developed in the literature to generate approximate samples from the corresponding intractable posterior densities. Establishing geometric ergodicity of these Markov chains provides crucial technical justification for the accuracy of asymptotic standard errors for Markov chain based estimates of posterior quantities. In this paper, we establish geometric ergodicity for various Gibbs samplers corresponding to the Horseshoe prior and its regularized variants in the context of linear regression. First, we establish geometric ergodicity of a Gibbs sampler for the original Horseshoe posterior under strictly weaker conditions than existing analyses in the literature. Second, we consider the regularized Horseshoe prior introduced in Piironen and Vehtari (2017), and prove geometric ergodicity for a Gibbs sampling Markov chain to sample from the corresponding posterior without any truncation constraint on the global and local shrinkage parameters. Finally, we consider a variant of this regularized Horseshoe prior introduced in Nishimura and Suchard (2020), and again establish geometric ergodicity for a Gibbs sampling Markov chain to sample from the corresponding posterior.
△ Less
Submitted 1 January, 2021;
originally announced January 2021.
-
Variational Bayes Neural Network: Posterior Consistency, Classification Accuracy and Computational Challenges
Authors:
Shrijita Bhattacharya,
Zihuan Liu,
Tapabrata Maiti
Abstract:
Bayesian neural network models (BNN) have re-surged in recent years due to the advancement of scalable computations and its utility in solving complex prediction problems in a wide variety of applications. Despite the popularity and usefulness of BNN, the conventional Markov Chain Monte Carlo based implementation suffers from high computational cost, limiting the use of this powerful technique in…
▽ More
Bayesian neural network models (BNN) have re-surged in recent years due to the advancement of scalable computations and its utility in solving complex prediction problems in a wide variety of applications. Despite the popularity and usefulness of BNN, the conventional Markov Chain Monte Carlo based implementation suffers from high computational cost, limiting the use of this powerful technique in large scale studies. The variational Bayes inference has become a viable alternative to circumvent some of the computational issues. Although the approach is popular in machine learning, its application in statistics is somewhat limited. This paper develops a variational Bayesian neural network estimation methodology and related statistical theory. The numerical algorithms and their implementational are discussed in detail. The theory for posterior consistency, a desirable property in nonparametric Bayesian statistics, is also developed. This theory provides an assessment of prediction accuracy and guidelines for characterizing the prior distributions and variational family. The loss of using a variational posterior over the true posterior has also been quantified. The development is motivated by an important biomedical engineering application, namely building predictive tools for the transition from mild cognitive impairment to Alzheimer's disease. The predictors are multi-modal and may involve complex interactive relations.
△ Less
Submitted 18 November, 2020;
originally announced November 2020.
-
Function Optimization with Posterior Gaussian Derivative Process
Authors:
Sucharita Roy,
Sourabh Bhattacharya
Abstract:
In this article, we propose and develop a novel Bayesian algorithm for optimization of functions whose first and second partial derivatives are known. The basic premise is the Gaussian process representation of the function which induces a first derivative process that is also Gaussian. The Bayesian posterior solutions of the derivative process set equal to zero, given data consisting of suitable…
▽ More
In this article, we propose and develop a novel Bayesian algorithm for optimization of functions whose first and second partial derivatives are known. The basic premise is the Gaussian process representation of the function which induces a first derivative process that is also Gaussian. The Bayesian posterior solutions of the derivative process set equal to zero, given data consisting of suitable choices of input points in the function domain and their function values, emulate the stationary points of the function, which can be fine-tuned by setting restrictions on the prior in terms of the first and second derivatives of the objective function. These observations motivate us to propose a general and effective algorithm for function optimization that attempts to get closer to the true optima adaptively with in-built iterative stages. We provide theoretical foundation to this algorithm, proving almost sure convergence to the true optima as the number of iterative stages tends to infinity. The theoretical foundation hinges upon our proofs of almost sure uniform convergence of the posteriors associated with Gaussian and Gaussian derivative processes to the underlying function and its derivatives in appropriate fixed-domain infill asymptotics setups; rates of convergence are also available. We also provide Bayesian characterization of the number of optima using information inherent in our optimization algorithm. We illustrate our Bayesian optimization algorithm with five different examples involving maxima, minima, saddle points and even inconclusiveness. Our examples range from simple, one-dimensional problems to challenging 50 and 100-dimensional problems.
△ Less
Submitted 26 October, 2020;
originally announced October 2020.
-
Quantile Regression Neural Networks: A Bayesian Approach
Authors:
Sanket R. Jantre,
Shrijita Bhattacharya,
Tapabrata Maiti
Abstract:
This article introduces a Bayesian neural network estimation method for quantile regression assuming an asymmetric Laplace distribution (ALD) for the response variable. It is shown that the posterior distribution for feedforward neural network quantile regression is asymptotically consistent under a misspecified ALD model. This consistency proof embeds the problem from density estimation domain an…
▽ More
This article introduces a Bayesian neural network estimation method for quantile regression assuming an asymmetric Laplace distribution (ALD) for the response variable. It is shown that the posterior distribution for feedforward neural network quantile regression is asymptotically consistent under a misspecified ALD model. This consistency proof embeds the problem from density estimation domain and uses bounds on the bracketing entropy to derive the posterior consistency over Hellinger neighborhoods. This consistency result is shown in the setting where the number of hidden nodes grow with the sample size. The Bayesian implementation utilizes the normal-exponential mixture representation of the ALD density. The algorithm uses Markov chain Monte Carlo (MCMC) simulation technique - Gibbs sampling coupled with Metropolis-Hastings algorithm. We have addressed the issue of complexity associated with the afore-mentioned MCMC implementation in the context of chain convergence, choice of starting values, and step sizes. We have illustrated the proposed method with simulation studies and real data examples.
△ Less
Submitted 28 September, 2020;
originally announced September 2020.
-
Bayesian Appraisal of Random Series Convergence with Application to Climate Change
Authors:
Sucharita Roy,
Sourabh Bhattacharya
Abstract:
Roy and Bhattacharya (2020) provided Bayesian characterization of infinite series, and their most important application, namely, to the Dirichlet series characterizing the (in)famous Riemann Hypothesis, revealed insights that are not in support of the most celebrated conjecture for over 150 years.
In contrast with deterministic series considered by Roy and Bhattacharya (2020), in this article we…
▽ More
Roy and Bhattacharya (2020) provided Bayesian characterization of infinite series, and their most important application, namely, to the Dirichlet series characterizing the (in)famous Riemann Hypothesis, revealed insights that are not in support of the most celebrated conjecture for over 150 years.
In contrast with deterministic series considered by Roy and Bhattacharya (2020), in this article we take up random infinite series for our investigation. Remarkably, our method does not require any simplifying assumption. Albeit the Bayesian characterization theory for random series is no different from that for the deterministic setup, construction of effective upper bounds for partial sums, required for implementation, turns out to be a challenging undertaking in the random setup. In this article, we construct parametric and nonparametric upper bound forms for the partial sums of random infinite series and demonstrate the generality of the latter in comparison to the former. Simulation studies exhibit high accuracy and efficiency of the nonparametric bound in all the setups that we consider.
Finally, exploiting the property that the summands tend to zero in the case of series convergence, we consider application of our nonparametric bound driven Bayesian method to global climate change analysis. Specifically, analyzing the global average temperature record over the years 1850--2016 and Holocene global average temperature reconstruction data 12,000 years before present, we conclude, in spite of the current global warming situation, that global climate dynamics is subject to temporary variability only, the current global warming being an instance, and long term global warming or cooling either in the past or in the future, are highly unlikely.
△ Less
Submitted 14 September, 2020;
originally announced September 2020.
-
Community structures in simplicial complexes: an application to wildlife corridor designing in Central India -- Eastern Ghats landscape complex, India
Authors:
Saurabh Shanu,
Shashankaditya Upadhyay,
Arijit Roy,
Raghunandan Chundawat,
Sudeepto Bhattacharya
Abstract:
The concept of simplicial complex from Algebraic Topology is applied to understand and model the flow of genetic information, processes and organisms between the areas of unimpaired habitats to design a network of wildlife corridors for Tigers (Panthera Tigris Tigris) in Central India Eastern Ghats landscape complex. The work extends and improves on a previous work that has made use of the concept…
▽ More
The concept of simplicial complex from Algebraic Topology is applied to understand and model the flow of genetic information, processes and organisms between the areas of unimpaired habitats to design a network of wildlife corridors for Tigers (Panthera Tigris Tigris) in Central India Eastern Ghats landscape complex. The work extends and improves on a previous work that has made use of the concept of minimum spanning tree obtained from the weighted graph in the focal landscape, which suggested a viable corridor network for the tiger population of the Protected Areas (PAs) in the landscape complex. Centralities of the network identify the habitat patches and the critical parameters that are central to the process of tiger movement across the network. We extend the concept of vertex centrality to that of the simplicial centrality yielding inter-vertices adjacency and connection. As a result, the ecological information propagates expeditiously and even on a local scale in these networks representing a well-integrated and self-explanatory model as a community structure. A simplicial complex network based on the network centralities calculated in the landscape matrix presents a tiger corridor network in the landscape complex that is proposed to correspond better to reality than the previously proposed model. Because of the aforementioned functional and structural properties of the network, the work proposes an ecological network of corridors for the most tenable usage by the tiger populations both in the PAs and outside the PAs in the focal landscape.
△ Less
Submitted 6 August, 2020;
originally announced August 2020.
-
A Bayesian Multiple Testing Paradigm for Model Selection in Inverse Regression Problems
Authors:
Debashis Chatterjee,
Sourabh Bhattacharya
Abstract:
In this article, we propose a novel Bayesian multiple testing formulation for model and variable selection in inverse setups, judiciously embedding the idea of inverse reference distributions proposed by Bhattacharya (2013) in a mixture framework consisting of the competing models. We develop the theory and methods in the general context encompassing parametric and nonparametric competing models,…
▽ More
In this article, we propose a novel Bayesian multiple testing formulation for model and variable selection in inverse setups, judiciously embedding the idea of inverse reference distributions proposed by Bhattacharya (2013) in a mixture framework consisting of the competing models. We develop the theory and methods in the general context encompassing parametric and nonparametric competing models, dependent data, as well as misspecifications. Our investigation shows that asymptotically the multiple testing procedure almost surely selects the best possible inverse model that minimizes the minimum Kullback-Leibler divergence from the true model. We also show that the error rates, namely, versions of the false discovery rate and the false non-discovery rate converge to zero almost surely as the sample size goes to infinity. Asymptotic α-control of versions of the false discovery rate and its impact on the convergence of false non-discovery rate versions, are also investigated.
Our simulation experiments involve small sample based selection among inverse Poisson log regression and inverse geometric logit and probit regression, where the regressions are either linear or based on Gaussian processes. Additionally, variable selection is also considered. Our multiple testing results turn out to be very encouraging in the sense of selecting the best models in all the non-misspecified and misspecified cases.
△ Less
Submitted 15 July, 2020;
originally announced July 2020.