Skip to main content

Showing 1–22 of 22 results for author: Lyzinski, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.02825  [pdf, ps, other

    stat.ML cs.LG

    Asymptotically perfect seeded graph matching without edge correlation (and applications to inference)

    Authors: Tong Qi, Vera Andersson, Peter Viechnicki, Vince Lyzinski

    Abstract: We present the OmniMatch algorithm for seeded multiple graph matching. In the setting of $d$-dimensional Random Dot Product Graphs (RDPG), we prove that under mild assumptions, OmniMatch with $s$ seeds asymptotically and efficiently perfectly aligns $O(s^α)$ unseeded vertices -- for $α<2\wedge d/4$ -- across multiple networks even in the presence of no edge correlation. We demonstrate the effectiv… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

    Comments: 10 figures, 35 pages

  2. arXiv:2506.00077  [pdf, ps, other

    cs.CL cs.LG stat.ML

    Gaussian mixture models as a proxy for interacting language models

    Authors: Edward L. Wang, Tianyu Wang, Avanti Athreya, Vince Lyzinski, Carey E. Priebe

    Abstract: Large language models (LLMs) are a powerful tool with the ability to match human capabilities and behavior in many settings. Retrieval-augmented generation (RAG) further allows LLMs to generate diverse output depending on the contents of their RAG database. This motivates their use in the social sciences to study human behavior between individuals when large-scale experiments are infeasible. Howev… ▽ More

    Submitted 3 June, 2025; v1 submitted 29 May, 2025; originally announced June 2025.

    MSC Class: 62R07

  3. arXiv:2409.17544  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Optimizing the Induced Correlation in Omnibus Joint Graph Embeddings

    Authors: Konstantinos Pantazis, Michael Trosset, William N. Frost, Carey E. Priebe, Vince Lyzinski

    Abstract: Theoretical and empirical evidence suggests that joint graph embedding algorithms induce correlation across the networks in the embedding space. In the Omnibus joint graph embedding framework, previous results explicitly delineated the dual effects of the algorithm-induced and model-inherent correlations on the correlation across the embedded networks. Accounting for and mitigating the algorithm-i… ▽ More

    Submitted 30 September, 2024; v1 submitted 26 September, 2024; originally announced September 2024.

    Comments: 34 pages, 8 figures

  4. arXiv:2308.13451  [pdf, other

    stat.ML cs.LG math.CO stat.AP stat.ME

    Gotta match 'em all: Solution diversification in graph matching matched filters

    Authors: Zhirui Li, Ben Johnson, Daniel L. Sussman, Carey E. Priebe, Vince Lyzinski

    Abstract: We present a novel approach for finding multiple noisily embedded template graphs in a very large background graph. Our method builds upon the graph-matching-matched-filter technique proposed in Sussman et al., with the discovery of multiple diverse matchings being achieved by iteratively penalizing a suitable node-pair similarity matrix in the matched filter algorithm. In addition, we propose alg… ▽ More

    Submitted 4 July, 2024; v1 submitted 25 August, 2023; originally announced August 2023.

    Comments: 27 pages, 12 figures, 3 tables

  5. arXiv:2208.09710  [pdf, other

    stat.ML cs.IR cs.LG

    Adversarial contamination of networks in the setting of vertex nomination: a new trimming method

    Authors: Sheyda Peyman, Minh Tang, Vince Lyzinski

    Abstract: As graph data becomes more ubiquitous, the need for robust inferential graph algorithms to operate in these complex data domains is crucial. In many cases of interest, inference is further complicated by the presence of adversarial data contamination. The effect of the adversary is frequently to change the data distribution in ways that negatively affect statistical and algorithmic performance. We… ▽ More

    Submitted 20 August, 2022; originally announced August 2022.

  6. arXiv:2205.03486  [pdf, other

    stat.ML cs.LG stat.ME

    Clustered Graph Matching for Label Recovery and Graph Classification

    Authors: Zhirui Li, Jesus Arroyo, Konstantinos Pantazis, Vince Lyzinski

    Abstract: Given a collection of vertex-aligned networks and an additional label-shuffled network, we propose procedures for leveraging the signal in the vertex-aligned collection to recover the labels of the shuffled network. We consider matching the shuffled network to averages of the networks in the vertex-aligned collection at different levels of granularity. We demonstrate both in theory and practice th… ▽ More

    Submitted 29 March, 2023; v1 submitted 6 May, 2022; originally announced May 2022.

    Comments: 22 pages, 8 figures, 5 tables

  7. arXiv:2112.12316  [pdf, ps, other

    cs.IT

    Signed and Unsigned Partial Information Decompositions of Continuous Network Interactions

    Authors: Jesse Milzman, Vince Lyzinski

    Abstract: We investigate the partial information decomposition (PID) framework as a tool for edge nomination. We consider both the $I_{\cap}^{\text{min}}$ and $I_{\cap}^{\text{PM}}$ PIDs, from arXiv:1004.2515 and arXiv:1801.09010 respectively, and we both numerically and analytically investigate the utility of these frameworks for discovering significant edge interactions. In the course of our work, we exte… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

  8. arXiv:2106.12621  [pdf, other

    cs.LG cs.IR stat.ME

    Leveraging semantically similar queries for ranking via combining representations

    Authors: Hayden S. Helm, Marah Abdin, Benjamin D. Pedigo, Shweti Mahajan, Vince Lyzinski, Youngser Park, Amitabh Basu, Piali~Choudhury, Christopher M. White, Weiwei Yang, Carey E. Priebe

    Abstract: In modern ranking problems, different and disparate representations of the items to be ranked are often available. It is sensible, then, to try to combine these representations to improve ranking. Indeed, learning to rank via combining representations is both principled and practical for learning a ranking function for a particular query. In extremely data-scarce settings, however, the amount of l… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

  9. arXiv:2101.12430  [pdf, other

    cs.LG cs.IR cs.SI stat.ML

    Subgraph nomination: Query by Example Subgraph Retrieval in Networks

    Authors: Al-Fahad M. Al-Qadhi, Carey E. Priebe, Hayden S. Helm, Vince Lyzinski

    Abstract: This paper introduces the subgraph nomination inference task, in which example subgraphs of interest are used to query a network for similarly interesting subgraphs. This type of problem appears time and again in real world problems connected to, for example, user recommendation systems and structural retrieval tasks in social and biological/connectomic networks. We formally define the subgraph no… ▽ More

    Submitted 19 December, 2022; v1 submitted 29 January, 2021; originally announced January 2021.

    Comments: 37 pages, 11 figures

  10. arXiv:2010.14622  [pdf, other

    cs.SI stat.ME

    Vertex nomination between graphs via spectral embedding and quadratic programming

    Authors: Runbing Zheng, Vince Lyzinski, Carey E. Priebe, Minh Tang

    Abstract: Given a network and a subset of interesting vertices whose identities are only partially known, the vertex nomination problem seeks to rank the remaining vertices in such a way that the interesting vertices are ranked at the top of the list. An important variant of this problem is vertex nomination in the multi-graphs setting. Given two graphs $G_1, G_2$ with common vertices and a vertex of intere… ▽ More

    Submitted 27 March, 2022; v1 submitted 24 October, 2020; originally announced October 2020.

  11. arXiv:2005.02151  [pdf, other

    cs.IR cs.LG math.ST stat.ML

    Vertex Nomination in Richly Attributed Networks

    Authors: Keith Levin, Carey E. Priebe, Vince Lyzinski

    Abstract: Vertex nomination is a lightly-supervised network information retrieval task in which vertices of interest in one graph are used to query a second graph to discover vertices of interest in the second graph. Similar to other information retrieval tasks, the output of a vertex nomination scheme is a ranked list of the vertices in the second graph, with the heretofore unknown vertices of interest ide… ▽ More

    Submitted 4 May, 2023; v1 submitted 29 April, 2020; originally announced May 2020.

    Comments: 46 pages, 5 figures

  12. arXiv:2002.01648  [pdf, other

    stat.ML cs.LG stat.ME

    Graph matching between bipartite and unipartite networks: to collapse, or not to collapse, that is the question

    Authors: Jesús Arroyo, Carey E. Priebe, Vince Lyzinski

    Abstract: Graph matching consists of aligning the vertices of two unlabeled graphs in order to maximize the shared structure across networks; when the graphs are unipartite, this is commonly formulated as minimizing their edge disagreements. In this paper, we address the common setting in which one of the graphs to match is a bipartite network and one is unipartite. Commonly, the bipartite networks are coll… ▽ More

    Submitted 12 April, 2021; v1 submitted 5 February, 2020; originally announced February 2020.

  13. arXiv:1908.02572  [pdf, other

    cs.SI math.CO

    Multiplex graph matching matched filters

    Authors: Konstantinos Pantazis, Daniel L. Sussman, Youngser Park, Zhirui Li, Carey E. Priebe, Vince Lyzinski

    Abstract: We consider the problem of detecting a noisy induced multiplex template network in a larger multiplex background network. Our approach, which extends the framework of Sussman et al. (2019) to the multiplex setting, leverages a multiplex analogue of the classical graph matching problem to use the template as a matched filter for efficiently searching the background for candidate template matches. T… ▽ More

    Submitted 3 December, 2021; v1 submitted 22 July, 2019; originally announced August 2019.

    Comments: 27 pages, 10 figures

  14. arXiv:1905.01776  [pdf, other

    stat.ML cs.LG cs.SI stat.CO

    Vertex Nomination, Consistent Estimation, and Adversarial Modification

    Authors: Joshua Agterberg, Youngser Park, Jonathan Larson, Christopher White, Carey E. Priebe, Vince Lyzinski

    Abstract: Given a pair of graphs $G_1$ and $G_2$ and a vertex set of interest in $G_1$, the vertex nomination (VN) problem seeks to find the corresponding vertices of interest in $G_2$ (if they exist) and produce a rank list of the vertices in $G_2$, with the corresponding vertices of interest in $G_2$ concentrating, ideally, at the top of the rank list. In this paper, we define and derive the analogue of B… ▽ More

    Submitted 14 April, 2020; v1 submitted 5 May, 2019; originally announced May 2019.

    Comments: 34 pages, 8 figures

  15. arXiv:1812.10519  [pdf, other

    stat.ML cs.LG math.ST

    Maximum Likelihood Estimation and Graph Matching in Errorfully Observed Networks

    Authors: Jesús Arroyo, Daniel L. Sussman, Carey E. Priebe, Vince Lyzinski

    Abstract: Given a pair of graphs with the same number of vertices, the inexact graph matching problem consists in finding a correspondence between the vertices of these graphs that minimizes the total number of induced edge disagreements. We study this problem from a statistical framework in which one of the graphs is an errorfully observed copy of the other. We introduce a corrupting channel model, and sho… ▽ More

    Submitted 2 July, 2020; v1 submitted 26 December, 2018; originally announced December 2018.

  16. On a 'Two Truths' Phenomenon in Spectral Graph Clustering

    Authors: Carey E. Priebe, Youngser Park, Joshua T. Vogelstein, John M. Conroy, Vince Lyzinski, Minh Tang, Avanti Athreya, Joshua Cape, Eric Bridgeford

    Abstract: Clustering is concerned with coherently grouping observations without any explicit concept of true groupings. Spectral graph clustering - clustering the vertices of a graph based on their spectral embedding - is commonly approached via K-means (or, more generally, Gaussian mixture model) clustering composed with either Laplacian or Adjacency spectral embedding (LSE or ASE). Recent theoretical resu… ▽ More

    Submitted 11 February, 2019; v1 submitted 23 August, 2018; originally announced August 2018.

    Journal ref: PNAS 116 (2019) 5995-6000

  17. arXiv:1803.02423  [pdf, other

    stat.ML cs.DS

    Matched Filters for Noisy Induced Subgraph Detection

    Authors: Daniel L. Sussman, Youngser Park, Carey E. Priebe, Vince Lyzinski

    Abstract: The problem of finding the vertex correspondence between two noisy graphs with different number of vertices where the smaller graph is still large has many applications in social networks, neuroscience, and computer vision. We propose a solution to this problem via a graph matching matched filter: centering and padding the smaller adjacency matrix and applying graph matching methods to align it to… ▽ More

    Submitted 1 July, 2019; v1 submitted 6 March, 2018; originally announced March 2018.

    Comments: 41 pages, 7 figures

  18. arXiv:1705.02294  [pdf, other

    math.ST cs.SI

    Matchability of heterogeneous networks pairs

    Authors: Vince Lyzinski, Daniel L. Sussman

    Abstract: We consider the problem of graph matchability in non-identically distributed networks. In a general class of edge-independent networks, we demonstrate that graph matchability can be lost with high probability when matching the networks directly. We further demonstrate that under mild model assumptions, matchability is almost perfectly recovered by centering the networks using Universal Singular Va… ▽ More

    Submitted 20 March, 2019; v1 submitted 5 May, 2017; originally announced May 2017.

    Comments: 44 pages, 10 figures

  19. arXiv:1605.02315  [pdf, other

    stat.ML cs.IT math.CO

    Information Recovery in Shuffled Graphs via Graph Matching

    Authors: Vince Lyzinski

    Abstract: While many multiple graph inference methodologies operate under the implicit assumption that an explicit vertex correspondence is known across the vertex sets of the graphs, in practice these correspondences may only be partially or errorfully known. Herein, we provide an information theoretic foundation for understanding the practical impact that errorfully observed vertex correspondences can hav… ▽ More

    Submitted 27 September, 2017; v1 submitted 8 May, 2016; originally announced May 2016.

    Comments: 55 pages, 6 figures

  20. Semi-External Memory Sparse Matrix Multiplication for Billion-Node Graphs

    Authors: Da Zheng, Disa Mhembere, Vince Lyzinski, Joshua Vogelstein, Carey E. Priebe, Randal Burns

    Abstract: Sparse matrix multiplication is traditionally performed in memory and scales to large matrices using the distributed memory of multiple nodes. In contrast, we scale sparse matrix multiplication beyond memory capacity by implementing sparse matrix dense matrix multiplication (SpMM) in a semi-external memory (SEM) fashion; i.e., we keep the sparse matrix on commodity SSDs and dense matrices in memor… ▽ More

    Submitted 14 October, 2016; v1 submitted 9 February, 2016; originally announced February 2016.

    Comments: published in IEEE Transactions on Parallel and Distributed Systems

  21. arXiv:1508.04422  [pdf, other

    stat.ML cs.LG cs.NE stat.ME

    Scalable Out-of-Sample Extension of Graph Embeddings Using Deep Neural Networks

    Authors: Aren Jansen, Gregory Sell, Vince Lyzinski

    Abstract: Several popular graph embedding techniques for representation learning and dimensionality reduction rely on performing computationally expensive eigendecompositions to derive a nonlinear transformation of the input data space. The resulting eigenvectors encode the embedding coordinates for the training samples only, and so the embedding of novel data samples requires further costly computation. In… ▽ More

    Submitted 14 June, 2016; v1 submitted 18 August, 2015; originally announced August 2015.

    Comments: 10 pages, 2 figures, 1 table, this paper is under consideration for publication in Pattern Recognition Letters

  22. arXiv:1112.5507  [pdf, other

    math.OC cs.DS q-bio.NC

    Fast Approximate Quadratic Programming for Large (Brain) Graph Matching

    Authors: Joshua T. Vogelstein, John M. Conroy, Vince Lyzinski, Louis J. Podrazik, Steven G. Kratzer, Eric T. Harley, Donniell E. Fishkind, R. Jacob Vogelstein, Carey E. Priebe

    Abstract: Quadratic assignment problems (QAPs) arise in a wide variety of domains, ranging from operations research to graph theory to computer vision to neuroscience. In the age of big data, graph valued data is becoming more prominent, and with it, a desire to run algorithms on ever larger graphs. Because QAP is NP-hard, exact algorithms are intractable. Approximate algorithms necessarily employ an accura… ▽ More

    Submitted 13 September, 2014; v1 submitted 22 December, 2011; originally announced December 2011.

    Comments: 17 pages, 5 figures, 2 tables