-
Statistical network isomorphism
Authors:
Pierre Miasnikof,
Alexander Y. Shestopaloff,
Cristián Bravo,
Yuri Lawryshyn
Abstract:
Graph isomorphism is a problem for which there is no known polynomial-time solution. Nevertheless, assessing (dis)similarity between two or more networks is a key task in many areas, such as image recognition, biology, chemistry, computer and social networks. Moreover, questions of similarity are typically more general and their answers more widely applicable than the more restrictive isomorphism…
▽ More
Graph isomorphism is a problem for which there is no known polynomial-time solution. Nevertheless, assessing (dis)similarity between two or more networks is a key task in many areas, such as image recognition, biology, chemistry, computer and social networks. Moreover, questions of similarity are typically more general and their answers more widely applicable than the more restrictive isomorphism question. In this article, we offer a statistical answer to the following questions: a) {\it ``Are networks $G_1$ and $G_2$ similar?''}, b) {\it ``How different are the networks $G_1$ and $G_2$?''} and c) {\it ``Is $G_3$ more similar to $G_1$ or $G_2$?''}. Our comparisons begin with the transformation of each graph into an all-pairs distance matrix. Our node-node distance, Jaccard distance, has been shown to offer a good reflection of the graph's connectivity structure. We then model these distances as probability distributions. Finally, we use well-established statistical tools to gauge the (dis)similarities in terms of probability distribution (dis)similarity. This comparison procedure aims to detect (dis)similarities in connectivity structure, not in easily observable graph characteristics, such as degrees, edge counts or density. We validate our hypothesis that graphs can be meaningfully summarized and compared via their node-node distance distributions, using several synthetic and real-world graphs. Empirical results demonstrate its validity and the accuracy of our comparison technique.
△ Less
Submitted 20 June, 2022;
originally announced June 2022.
-
Graph clustering with Boltzmann machines
Authors:
Pierre Miasnikof,
Mohammad Bagherbeik,
Ali Sheikholeslami
Abstract:
Graph clustering is the process of grouping vertices into densely connected sets called clusters. We tailor two mathematical programming formulations from the literature, to this problem. In doing so, we obtain a heuristic approximation to the intra-cluster density maximization problem. We use two variations of a Boltzmann machine heuristic to obtain numerical solutions. For benchmarking purposes,…
▽ More
Graph clustering is the process of grouping vertices into densely connected sets called clusters. We tailor two mathematical programming formulations from the literature, to this problem. In doing so, we obtain a heuristic approximation to the intra-cluster density maximization problem. We use two variations of a Boltzmann machine heuristic to obtain numerical solutions. For benchmarking purposes, we compare solution quality and computational performances to those obtained using a commercial solver, Gurobi. We also compare clustering quality to the clusters obtained using the popular Louvain modularity maximization method. Our initial results clearly demonstrate the superiority of our problem formulations. They also establish the superiority of the Boltzmann machine over the traditional exact solver. In the case of smaller less complex graphs, Boltzmann machines provide the same solutions as Gurobi, but with solution times that are orders of magnitude lower. In the case of larger and more complex graphs, Gurobi fails to return meaningful results within a reasonable time frame. Finally, we also note that both our clustering formulations, the distance minimization and $K$-medoids, yield clusters of superior quality to those obtained with the Louvain algorithm.
△ Less
Submitted 26 September, 2022; v1 submitted 4 March, 2022;
originally announced March 2022.
-
Market Graph Clustering Via QUBO and Digital Annealing
Authors:
Seo Hong,
Pierre Miasnikof,
Roy Kwon,
Yuri Lawryshyn
Abstract:
Our goal is to find representative nodes of a market graph that best replicate the returns of a broader market graph (index), a common task in the financial industry. We model our reference index as a market graph and express the index tracking problem in a quadratic K-medoids form. We take advantage of a purpose built hardware architecture, the Fujitsu Digital Annealer, to circumvent the NP-hard…
▽ More
Our goal is to find representative nodes of a market graph that best replicate the returns of a broader market graph (index), a common task in the financial industry. We model our reference index as a market graph and express the index tracking problem in a quadratic K-medoids form. We take advantage of a purpose built hardware architecture, the Fujitsu Digital Annealer, to circumvent the NP-hard nature of the problem and solve our formulation efficiently. In this article, we combine three separate areas of the literature, market graph models, K-medoid clustering and quadratic binary optimization modeling, to formulate the index-tracking problem as a quadratic K-medoid graph-clustering problem. Our initial results show we accurately replicate the returns of a broad market index, using only a small subset of its constituent assets. Moreover, our quadratic formulation allows us to take advantage of recent hardware advances, to overcome the NP-hard nature of the problem.
△ Less
Submitted 18 June, 2020;
originally announced June 2020.