Null-adjusted persistence function for high-resolution community detection
Authors:
Alessandro Avellone,
Paolo Bartesaghi,
Stefano Benati,
Christos Charalambous,
Rosanna Grassi
Abstract:
Modularity and persistence probability are two widely used quality functions for detecting communities in complex networks. In this paper, we introduce a new objective function called null-adjusted persistence, which incorporates features from both modularity and persistence probability, as it implies a comparison of persistence probability with the same null model of modularity. We prove key anal…
▽ More
Modularity and persistence probability are two widely used quality functions for detecting communities in complex networks. In this paper, we introduce a new objective function called null-adjusted persistence, which incorporates features from both modularity and persistence probability, as it implies a comparison of persistence probability with the same null model of modularity. We prove key analytic properties of this new function. We show that the null-adjusted persistence overcomes the limitations of modularity, such as scaling behavior and resolution limits, and the limitation of the persistence probability, which is an increasing function with respect to the cluster size. We propose to find the partition that maximizes the null-adjusted persistence with a variation of the Louvain method and we tested its effectiveness on benchmark and real networks. We found out that maximizing null-adjusted persistence outperforms modularity maximization, as it detects higher resolution partitions in dense and large networks.
△ Less
Submitted 19 May, 2025;
originally announced May 2025.
On Finding the Community with Maximum Persistence Probability
Authors:
Alessandro Avellone,
Stefano Benati,
Rosanna Grassi,
Giorgio Rizzini
Abstract:
The persistence probability is a statistical index that has been proposed to detect one or more communities embedded in a network. Even though its definition is straightforward, e.g, the probability that a random walker remains in a group of nodes, it has been seldom applied possibly for the difficulty of developing an efficient algorithm to calculate it. Here, we propose a new mathematical progra…
▽ More
The persistence probability is a statistical index that has been proposed to detect one or more communities embedded in a network. Even though its definition is straightforward, e.g, the probability that a random walker remains in a group of nodes, it has been seldom applied possibly for the difficulty of developing an efficient algorithm to calculate it. Here, we propose a new mathematical programming model to find the community with the largest persistence probability. The model is integer fractional programming, but it can be reduced to mixed-integer linear programming with an appropriate variable substitution. Nevertheless, the problem can be solved in a reasonable time for networks of small size only, therefore we developed some heuristic procedures to approximate the optimal solution. First, we elaborated a randomized greedy-ascent method, taking advantage of a peculiar data structure to generate feasible solutions fast. After analyzing the greedy output and determining where the optimal solution is eventually located, we implemented improving procedures based on a local exchange, but applying different long term diversification principles, that are based on variable neighborhood search and random restart. Next, we applied the algorithms on simulated graphs that reproduce accurately the clustering characteristics found in real networks to determine the reliability and the effectiveness of our methodology. Finally, we applied our method to two real networks, comparing our findings to what found by two well-known alternative community detection procedures.
△ Less
Submitted 17 June, 2022;
originally announced June 2022.