Skip to main content

Showing 1–15 of 15 results for author: Hajek, B

Searching in archive stat. Search in all archives.
.
  1. arXiv:2504.16279  [pdf, ps, other

    math.ST cs.IT stat.AP

    Detecting Correlation between Multiple Unlabeled Gaussian Networks

    Authors: Taha Ameen, Bruce Hajek

    Abstract: This paper studies the hypothesis testing problem to determine whether m > 2 unlabeled graphs with Gaussian edge weights are correlated under a latent permutation. Previously, a sharp detection threshold for the correlation parameter ρwas established by Wu, Xu and Yu for this problem when m = 2. Presently, their result is leveraged to derive necessary and sufficient conditions for general m. In do… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

    Comments: 7 pages, appearing at IEEE ISIT 2025

  2. arXiv:2310.18543  [pdf, other

    math.ST stat.AP

    Robust Graph Matching when Nodes are Corrupt

    Authors: Taha Ameen, Bruce Hajek

    Abstract: Two models are introduced to investigate graph matching in the presence of corrupt nodes. The weak model, inspired by biological networks, allows one or both networks to have a positive fraction of molecular entities interact randomly with their network. For this model, it is shown that no estimator can correctly recover a positive fraction of the corrupt nodes. Necessary conditions for any estima… ▽ More

    Submitted 31 October, 2023; v1 submitted 27 October, 2023; originally announced October 2023.

    Comments: 31 pages, 1 figure

  3. Regenerative Particle Thompson Sampling

    Authors: Zeyu Zhou, Bruce Hajek, Nakjung Choi, Anwar Walid

    Abstract: This paper proposes regenerative particle Thompson sampling (RPTS), a flexible variation of Thompson sampling. Thompson sampling itself is a Bayesian heuristic for solving stochastic bandit problems, but it is hard to implement in practice due to the intractability of maintaining a continuous posterior distribution. Particle Thompson sampling (PTS) is an approximation of Thompson sampling obtained… ▽ More

    Submitted 22 January, 2024; v1 submitted 15 March, 2022; originally announced March 2022.

    Comments: Mainbody 14 pages, appendix 32 pages, 16 figures

    Journal ref: "Particle Thompson Sampling with Static Particles" and "Improving Particle Thompson Sampling through Regenerative Particles," 2023 57th Annual Conference on Information Sciences and Systems (CISS), Baltimore, MD, USA, 2023

  4. arXiv:1801.06818  [pdf, other

    stat.ML cs.SI math.PR

    Community Recovery in a Preferential Attachment Graph

    Authors: Bruce Hajek, Suryanarayana Sankagiri

    Abstract: A message passing algorithm is derived for recovering communities within a graph generated by a variation of the Barabási-Albert preferential attachment model. The estimator is assumed to know the arrival times, or order of attachment, of the vertices. The derivation of the algorithm is based on belief propagation under an independence assumption. Two precursors to the message passing algorithm ar… ▽ More

    Submitted 20 July, 2018; v1 submitted 21 January, 2018; originally announced January 2018.

    Comments: arXiv admin note: text overlap with arXiv:1801.06816

  5. arXiv:1801.06816   

    stat.ML math.PR

    Preferential Attachment Graphs with Planted Communities

    Authors: Bruce Hajek, Suryanarayana Sankagiri

    Abstract: A variation of the preferential attachment random graph model of Barabási and Albert is defined that incorporates planted communities. The graph is built progressively, with new vertices attaching to the existing ones one-by-one. At every step, the incoming vertex is randomly assigned a label, which represents a community it belongs to. This vertex then chooses certain vertices as its neighbors, w… ▽ More

    Submitted 27 January, 2018; v1 submitted 21 January, 2018; originally announced January 2018.

    Comments: Discovered large overlap with J. Jordan, Geometric preferential attachment in non-uniform metric spaces (2013) Electronic J. Prob, Vol. 18, no. 8, pp 1-15. New aspects of our approach will be moved to: Recovering a Hidden Community in a Preferential Attachment Graph, arXiv:1801.06818

  6. arXiv:1602.06410  [pdf, other

    stat.ML cs.IT cs.SI math.ST

    Semidefinite Programs for Exact Recovery of a Hidden Community

    Authors: Bruce Hajek, Yihong Wu, Jiaming Xu

    Abstract: We study a semidefinite programming (SDP) relaxation of the maximum likelihood estimation for exactly recovering a hidden community of cardinality $K$ from an $n \times n$ symmetric data matrix $A$, where for distinct indices $i,j$, $A_{ij} \sim P$ if $i, j$ are both in the community and $A_{ij} \sim Q$ otherwise, for two known probability distributions $P$ and $Q$. We identify a sufficient condit… ▽ More

    Submitted 3 June, 2016; v1 submitted 20 February, 2016; originally announced February 2016.

  7. arXiv:1510.09219  [pdf, other

    stat.ML cs.IT cs.SI math.PR math.ST

    Submatrix localization via message passing

    Authors: Bruce Hajek, Yihong Wu, Jiaming Xu

    Abstract: The principal submatrix localization problem deals with recovering a $K\times K$ principal submatrix of elevated mean $μ$ in a large $n\times n$ symmetric matrix subject to additive standard Gaussian noise. This problem serves as a prototypical example for community detection, in which the community corresponds to the support of the submatrix. The main result of this paper is that in the regime… ▽ More

    Submitted 30 October, 2015; originally announced October 2015.

  8. arXiv:1510.02786  [pdf, ps, other

    stat.ML cs.CC cs.SI math.PR

    Recovering a Hidden Community Beyond the Kesten-Stigum Threshold in $O(|E| \log^*|V|)$ Time

    Authors: Bruce Hajek, Yihong Wu, Jiaming Xu

    Abstract: Community detection is considered for a stochastic block model graph of n vertices, with K vertices in the planted community, edge probability p for pairs of vertices both in the community, and edge probability q for other pairs of vertices. The main focus of the paper is on weak recovery of the community based on the graph G, with o(K) misclassified vertices on average, in the sublinear regime… ▽ More

    Submitted 15 January, 2018; v1 submitted 9 October, 2015; originally announced October 2015.

    Comments: New title replaces spectral limit by Kesten-Stigum threshold

  9. arXiv:1509.07859  [pdf, ps, other

    stat.ML cs.IT

    Information Limits for Recovering a Hidden Community

    Authors: Bruce Hajek, Yihong Wu, Jiaming Xu

    Abstract: We study the problem of recovering a hidden community of cardinality $K$ from an $n \times n$ symmetric data matrix $A$, where for distinct indices $i,j$, $A_{ij} \sim P$ if $i, j$ both belong to the community and $A_{ij} \sim Q$ otherwise, for two known probability distributions $P$ and $Q$ depending on $n$. If $P={\rm Bern}(p)$ and $Q={\rm Bern}(q)$ with $p>q$, it reduces to the problem of findi… ▽ More

    Submitted 24 January, 2016; v1 submitted 25 September, 2015; originally announced September 2015.

    Comments: v2 establishes information limits of both weak and exact recovery with sharp constants for general P and Q

  10. arXiv:1502.07738  [pdf, ps, other

    stat.ML cs.SI math.PR

    Achieving Exact Cluster Recovery Threshold via Semidefinite Programming: Extensions

    Authors: Bruce Hajek, Yihong Wu, Jiaming Xu

    Abstract: Resolving a conjecture of Abbe, Bandeira and Hall, the authors have recently shown that the semidefinite programming (SDP) relaxation of the maximum likelihood estimator achieves the sharp threshold for exactly recovering the community structure under the binary stochastic block model of two equal-sized clusters. The same was shown for the case of a single cluster and outliers. Extending the proof… ▽ More

    Submitted 14 June, 2016; v1 submitted 26 February, 2015; originally announced February 2015.

    Comments: This paper was accepted to IEEE Transactions on Information Theory on April 25, 2016. The material was presented in part at the 2015 49th Asilomar Conference on Signals, Systems and Computers and the 2015 IEEE Information Theory Workshop. This work was also in part presented at the Workshop on Community Detection, February 26-27, Institut Henri Poincaré, Paris

  11. arXiv:1502.04631  [pdf, other

    stat.ML

    Clustering and Inference From Pairwise Comparisons

    Authors: Rui Wu, Jiaming Xu, R. Srikant, Laurent Massoulié, Marc Lelarge, Bruce Hajek

    Abstract: Given a set of pairwise comparisons, the classical ranking problem computes a single ranking that best represents the preferences of all users. In this paper, we study the problem of inferring individual preferences, arising in the context of making personalized recommendations. In particular, we assume that there are $n$ users of $r$ types; users of the same type provide similar pairwise comparis… ▽ More

    Submitted 17 December, 2015; v1 submitted 16 February, 2015; originally announced February 2015.

    Comments: Corrected typos in the abstract

  12. arXiv:1412.6156  [pdf, other

    stat.ML cs.DS math.PR

    Achieving Exact Cluster Recovery Threshold via Semidefinite Programming

    Authors: Bruce Hajek, Yihong Wu, Jiaming Xu

    Abstract: The binary symmetric stochastic block model deals with a random graph of $n$ vertices partitioned into two equal-sized clusters, such that each pair of vertices is connected independently with probability $p$ within clusters and $q$ across clusters. In the asymptotic regime of $p=a \log n/n$ and $q=b \log n/n$ for fixed $a,b$ and $n \to \infty$, we show that the semidefinite programming relaxation… ▽ More

    Submitted 5 January, 2016; v1 submitted 24 November, 2014; originally announced December 2014.

    Comments: This paper was accepted to IEEE Transactions on Information Theory on January 3, 2016

  13. arXiv:1406.6625  [pdf, ps, other

    math.ST cs.CC stat.ML

    Computational Lower Bounds for Community Detection on Random Graphs

    Authors: Bruce Hajek, Yihong Wu, Jiaming Xu

    Abstract: This paper studies the problem of detecting the presence of a small dense community planted in a large Erdős-Rényi random graph $\mathcal{G}(N,q)$, where the edge probability within the community exceeds $q$ by a constant factor. Assuming the hardness of the planted clique detection problem, we show that the computational complexity of detecting the community exhibits the following phase transitio… ▽ More

    Submitted 11 March, 2015; v1 submitted 25 June, 2014; originally announced June 2014.

    Comments: 28 pages

  14. arXiv:1406.5638  [pdf, other

    stat.ML math.ST

    Minimax-optimal Inference from Partial Rankings

    Authors: Bruce Hajek, Sewoong Oh, Jiaming Xu

    Abstract: This paper studies the problem of inferring a global preference based on the partial rankings provided by many users over different subsets of items according to the Plackett-Luce model. A question of particular interest is how to optimally assign items to users for ranking and how many item assignments are needed to achieve a target estimation error. For a given assignment of items to users, we f… ▽ More

    Submitted 21 June, 2014; originally announced June 2014.

    Comments: 16 pages, 2 figures

  15. arXiv:1310.0512  [pdf, other

    stat.ML

    Jointly Clustering Rows and Columns of Binary Matrices: Algorithms and Trade-offs

    Authors: Jiaming Xu, Rui Wu, Kai Zhu, Bruce Hajek, R. Srikant, Lei Ying

    Abstract: In standard clustering problems, data points are represented by vectors, and by stacking them together, one forms a data matrix with row or column cluster structure. In this paper, we consider a class of binary matrices, arising in many applications, which exhibit both row and column cluster structure, and our goal is to exactly recover the underlying row and column clusters by observing only a sm… ▽ More

    Submitted 4 February, 2014; v1 submitted 1 October, 2013; originally announced October 2013.