Graph Summarization via Node Grouping: A Spectral Algorithm

Merchant, Arpit; Mathioudakis, Michael; Wang, Yanhao

doi:10.1145/3539597.3570441

Computer Science > Social and Information Networks

arXiv:2211.04169 (cs)

[Submitted on 8 Nov 2022]

Title:Graph Summarization via Node Grouping: A Spectral Algorithm

Authors:Arpit Merchant, Michael Mathioudakis, Yanhao Wang

View PDF

Abstract:Graph summarization via node grouping is a popular method to build concise graph representations by grouping nodes from the original graph into supernodes and encoding edges into superedges such that the loss of adjacency information is minimized. Such summaries have immense applications in large-scale graph analytics due to their small size and high query processing efficiency. In this paper, we reformulate the loss minimization problem for summarization into an equivalent integer maximization problem. By initially allowing relaxed (fractional) solutions for integer maximization, we analytically expose the underlying connections to the spectral properties of the adjacency matrix. Consequently, we design an algorithm called SpecSumm that consists of two phases. In the first phase, motivated by spectral graph theory, we apply k-means clustering on the k largest (in magnitude) eigenvectors of the adjacency matrix to assign nodes to supernodes. In the second phase, we propose a greedy heuristic that updates the initial assignment to further improve summary quality. Finally, via extensive experiments on 11 datasets, we show that SpecSumm efficiently produces high-quality summaries compared to state-of-the-art summarization algorithms and scales to graphs with millions of nodes.

Comments:	Full version of the paper published at WSDM 2023
Subjects:	Social and Information Networks (cs.SI); Machine Learning (cs.LG)
Cite as:	arXiv:2211.04169 [cs.SI]
	(or arXiv:2211.04169v1 [cs.SI] for this version)
	https://doi.org/10.48550/arXiv.2211.04169
Related DOI:	https://doi.org/10.1145/3539597.3570441

Submission history

From: Arpit Merchant [view email]
[v1] Tue, 8 Nov 2022 11:23:16 UTC (40,551 KB)

Computer Science > Social and Information Networks

Title:Graph Summarization via Node Grouping: A Spectral Algorithm

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Social and Information Networks

Title:Graph Summarization via Node Grouping: A Spectral Algorithm

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators