Delta modularity calculation (singleton)

🚨 This page is a work in progress. TODO: Maybe add a graphic here to explain the various variables.

In the modularity optimization phase, we only rely on local information. Recalculating the global modularity value for every possible neighbor of each vertex would significantly degrade the performance of the Louvain algorithm. This is why Blondel et al. employ a delta modularity formula that can be applied to millions of nodes. In this section, we will explain and derive the formula and show how it can be simplified for usage in a program.

We have seen here that the modularity $Q (C)$ for a partition $C$ is given by:

$Q (C) = \frac{1}{2 m} c \in C \sum (Σ_{c} - \frac{( Σ _{\overset{c}{^}} ) ^{2}}{2 m})$

We are only intrested in the contribution of a single community $c$ to the overall modularity. With a slight abuse of notation, let $Q (c)$ denote modularity of community $c$ ¹, such that $Q (C) = \sum_{c \in C} Q (c)$ . Therefore, for the modularity of a community $c$ , we get:

$Q (c) = \frac{Σ _{c}}{2 m} - (\frac{Σ _{\overset{c}{^}}}{2 m})^{2}$

Next, we look closely at the process of “moving” a vertex $u$ to the community of one of its neighbors $N (u)$ . We assume the vertex to be located in a singleton community. The formula Blondel et al. presented is for exactly this case, where we remove $u$ from its singleton community and then insert it into the neighbor’s community. This is actually only applicable for the first iteration of the modularity optimization phase where in the beginning of every new pass we deal with singleton communities. The authors state that they use a similar formula to calculate the modularity change when $u$ is removed from any community (also those with more than one vertex in it), but do not reveal the expression used. We will therefore derive the generalized version in the next section.

For now, let us consider the case where we move a vertex $u$ from its singleton community that we will denote by $\overset{c}{˚}_{u}$ to any other community $c$ . The modularity change (delta) is then given by

$Δ Q (u, \overset{c}{˚}_{u}, c) = Q^{'} (c) - Q (c) - Q (\overset{c}{˚}_{u})$

where $Q (c)$ is the quality of community $c$ before the merge (and hence without vertex $u$ in that community) and $Q^{'} (c)$ is the quality of $c$ after the merge (and hence after vertex $u$ was integrated into community $c$ ). As the singleton community does not exist anymore after the merge, we also subtract the modularity of the singleton community $Q (\overset{c}{˚}_{u})$ . In the following, let the prime symbol always denote the state of a variable after the merge.

With the above formula, this gives us:

$Δ Q (u, \overset{c}{˚}_{u}, c) = Q^{'} (c) - Q (c) - Q (\overset{c}{˚}_{u}) = [\frac{Σ _{c}^{'}}{2 m} - (\frac{Σ _{\overset{c}{^}}^{'}}{2 m})^{2}] - [\frac{Σ _{c}}{2 m} - (\frac{Σ _{\overset{c}{^}}}{2 m})^{2}] - [\frac{Σ _{\overset{c}{˚}_{u}}}{2 m} - (\frac{k _{u}}{2 m})^{2}]$

which is the equation presented in the original Louvain paper. If we consider that after the merging process, the sum of the weights of the edges between node $u$ and community $c$ – that we will denote² by $k_{u}^{\to c}$ – now adds to the community $c$ accommodating vertex $u$ , we obtain

$Σ_{c}^{'} = Σ_{c} + 2 k_{u}^{\to c}$

Likewise, we find that the weighted vertex degree $k_{u}$ adds to the total weighted vertex degree of $c$ after the merge, thus:

$Σ_{\overset{c}{^}}^{'} = Σ_{\overset{c}{^}} + k_{u}$

With this, we can simplify our expression for the modularity change. Note that we use $Σ_{\overset{c}{˚}_{u}} = 0$ , as there are no edges inside a singleton community $\overset{c}{˚}_{u}$ since we do not allow self-loops in the original graph.

$Δ Q (u, \overset{c}{˚}_{u}, c) = [\frac{Σ _{c} + 2 k _{u}^{\to c}}{2 m} - (\frac{Σ _{\overset{c}{^}} + k _{u}}{2 m})^{2}] - [\frac{Σ _{c}}{2 m} - (\frac{Σ _{\overset{c}{^}}}{2 m})^{2}] - [\frac{Σ _{\overset{c}{˚}_{u}}}{2 m} - (\frac{k _{u}}{2 m})^{2}] = [\frac{Σ _{c} + 2 k _{u}^{\to c}}{2 m} - (\frac{Σ _{\overset{c}{^}} + k _{u}}{2 m})^{2}] - [\frac{Σ _{c}}{2 m} - (\frac{Σ _{\overset{c}{^}}}{2 m})^{2} - (\frac{k _{u}}{2 m})^{2}] = [\frac{Σ _{c} + 2 k _{u}^{\to c}}{2 m} - (\frac{Σ _{\overset{c}{^}} + k _{u}}{2 m})^{2}] - [\frac{Σ _{c}}{2 m} - (\frac{Σ _{\overset{c}{^}}}{2 m})^{2} - (\frac{k _{u}}{2 m})^{2}] = \frac{2 k _{u}^{\to c}}{2 m} - (\frac{Σ _{\overset{c}{^}}}{2 m})^{2} - 2 \frac{Σ _{\overset{c}{^}}}{2 m} \frac{k _{u}}{2 m} - (\frac{k _{u}}{2 m})^{2} + (\frac{Σ _{\overset{c}{^}}}{2 m})^{2} + (\frac{k _{u}}{2 m})^{2}] = \frac{k _{u}^{\to c}}{m} - 2 \frac{Σ _{\overset{c}{^}} k _{u}}{m \cdot 2 m} = \frac{1}{m} (k_{u}^{\to c} - \frac{Σ _{\overset{c}{^}} k _{u}}{2 m})$

Finally, we have

$Δ Q (u, \overset{c}{˚}_{u}, c) \propto k_{u}^{\to c} - \frac{Σ _{\overset{c}{^}} k _{u}}{2 m}$

We can ignore the constant $\frac{1}{m}$ since we only compare different modularity increases $Δ Q (u, \overset{c}{˚}_{u}, c$ with each other and therefore merely require a relative measure. This saves us one division in the algorithm. After one complete pass, the new global modularity is calculated using the formula from here³.

Our short formula is used in the algorithm to efficiently calculate the delta modularity gain $Δ Q$ . To remove a vertex $u$ from its previous community $c_{u}$ or to insert it into a new community $c$ , only $Σ_{c}$ and $Σ_{\overset{c}{^}}$ have to be updated. $Σ_{c}$ is adjusted for the global modularity calculation and not for the delta modularity $Δ Q$ . Note that we can precalculate the sum of the weights of edges between $u$ and community $c_{u}$ or $c$ ( $k_{u}^{\to c_{u}}$ and $k_{u}^{\to c}$ ) before calling the remove- or insert-function. This is crucial for the speed of Louvain as $Δ Q$ has to be computed frequently. As stated above, we also dropped the factor $\frac{1}{m}$ to save one division. For the calculation of global modularity, we do not omit the factor $\frac{1}{2 m}$ in order to obtain the absolute global modularity $Q$ .

We can distinguish the two functions by looking at the argument, which is either a community $c$ or a partition $C$ , i.e. a set of communities $C = {c_{1}, \dots, c_{k}}$ .

This is done in conformity with these notes on modularity. The arrow does not indicate a direction; we still deal with an undirected graph.

In newer versions of the algorithm, this is not the case anymore and we only use the relative modularity calculation. TODO

Fast Louvain

Delta modularity calculation (singleton)