Modularity resolution limit

🚨 This page is a work in progress.

Fortunato and Barthélemy have shown that modularity suffers from a resolution limit, implying that optimizing for modularity does not necessarily reveal the actual community structure of the network (see here). Two communities might get combined into a larger one, yielding a higher modularity. This is the case for communities where , i.e. the number of edges inside community is too small for the grouping of vertices to be recognized as one community.

Note that the notion of “community” is defined in the “weak” sense as in this paper: “In a weak community the sum of all degrees within [the subgraph] is larger than the sum of all degrees toward the rest of the network”. In the mathematical definition of modularity, however, what is considered a community is dependent on the size of the whole network (number of edges ) – which is not ideal.

Fortunato and Barthélemy point out the importance of further investigation of the modules found by an algorithm that optimizes for modularity since one does not know if a claimed “module” is really just one community or a combination of multiple communities. Special care is required for communities with

as they are likely to be composed of two or more smaller1 ones (this number results from merging the two biggest, but still unrecognizable modules together). Yet, even bigger modules could suffer from this constraint of modularity if they are – as the authors call it – “fuzzy”, meaning that they have an increasing number of external edges (edges to other modules), making the difference between internal and external edges smaller.

Graph illustrating the resolution limit of modularity
Network consisting of two and one clique. The former are not correctly identified as individual communities when optimizing for modularity. ( cliques identified as single communities), ( cliques grouped together as a pair).

The above figure illustrates the resolution limit in an extreme example. The two cliques (complete graphs ) on the right are interconnected by one single edge (“bridge”) and share just one edge with the rest of the network, yet they are not recognized as two individual communities; optimal modularity assigns all eight vertices to the same community. The clique may be an arbitrary graph with edges, so that the whole network consists of edges. This ensures that for the cliques (composed of edges each) the condition is met. A complete graph holds edges, thus, our network consists of edges. , yet still, which happens to be sufficiently close to 2.

1

The terms “small” and “big” are used here not to indicate the number of nodes, but the number of intra-edges in the respective communities. How nodes are distributed in the network does not have any impact on modularity, i.e. whether a community has more or less nodes is irrelevant.

2

Fortunato and Barthélemy point out that they calculate as if the variables were continuous, which they are not. Thus, for the given formula of the resolution limit, one has to test the closest integer values.