Tolerating the Community Detection Resolution Limit with Edge Weighting
Authors:
Jonathan W. Berry,
Bruce Hendrickson,
Randall A. LaViolette,
Cynthia A. Phillips
Abstract:
Communities of vertices within a giant network such as the World-Wide Web are likely to be vastly smaller than the network itself. However, Fortunato and Barthélemy have proved that modularity maximization algorithms for community detection may fail to resolve communities with fewer than $\sqrt{L/2}$ edges, where $L$ is the number of edges in the entire network. This resolution limit leads modul…
▽ More
Communities of vertices within a giant network such as the World-Wide Web are likely to be vastly smaller than the network itself. However, Fortunato and Barthélemy have proved that modularity maximization algorithms for community detection may fail to resolve communities with fewer than $\sqrt{L/2}$ edges, where $L$ is the number of edges in the entire network. This resolution limit leads modularity maximization algorithms to have notoriously poor accuracy on many real networks. Fortunato and Barthélemy's argument can be extended to networks with weighted edges as well, and we derive this corollary argument. We conclude that weighted modularity algorithms may fail to resolve communities with fewer than $\sqrt{W ε/2}$ total edge weight, where $W$ is the total edge weight in the network and $ε$ is the maximum weight of an inter-community edge. If $ε$ is small, then small communities can be resolved.
Given a weighted or unweighted network, we describe how to derive new edge weights in order to achieve a low $ε$, we modify the ``CNM'' community detection algorithm to maximize weighted modularity, and show that the resulting algorithm has greatly improved accuracy. In experiments with an emerging community standard benchmark, we find that our simple CNM variant is competitive with the most accurate community detection methods yet proposed.
△ Less
Submitted 7 October, 2009; v1 submitted 5 March, 2009;
originally announced March 2009.
Community Detection via Facility Location
Authors:
Jonathan W. Berry,
Bruce Hendrickson,
Randall A. LaViolette,
Vitus J. Leung,
Cynthia A. Phillips
Abstract:
In this paper we apply theoretical and practical results from facility location theory to the problem of community detection in networks. The result is an algorithm that computes bounds on a minimization variant of local modularity. We also define the concept of an edge support and a new measure of the goodness of community structures with respect to this concept. We present preliminary results…
▽ More
In this paper we apply theoretical and practical results from facility location theory to the problem of community detection in networks. The result is an algorithm that computes bounds on a minimization variant of local modularity. We also define the concept of an edge support and a new measure of the goodness of community structures with respect to this concept. We present preliminary results and note that our methods are massively parallelizable.
△ Less
Submitted 9 November, 2007; v1 submitted 19 October, 2007;
originally announced October 2007.