Unknown

Dataset Information

0

From Louvain to Leiden: guaranteeing well-connected communities.


ABSTRACT: Community detection is often used to understand the structure of large and complex networks. One of the most popular algorithms for uncovering community structure is the so-called Louvain algorithm. We show that this algorithm has a major defect that largely went unnoticed until now: the Louvain algorithm may yield arbitrarily badly connected communities. In the worst case, communities may even be disconnected, especially when running the algorithm iteratively. In our experimental analysis, we observe that up to 25% of the communities are badly connected and up to 16% are disconnected. To address this problem, we introduce the Leiden algorithm. We prove that the Leiden algorithm yields communities that are guaranteed to be connected. In addition, we prove that, when the Leiden algorithm is applied iteratively, it converges to a partition in which all subsets of all communities are locally optimally assigned. Furthermore, by relying on a fast local move approach, the Leiden algorithm runs faster than the Louvain algorithm. We demonstrate the performance of the Leiden algorithm for several benchmark and real-world networks. We find that the Leiden algorithm is faster than the Louvain algorithm and uncovers better partitions, in addition to providing explicit guarantees.

SUBMITTER: Traag VA 

PROVIDER: S-EPMC6435756 | biostudies-literature | 2019 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

From Louvain to Leiden: guaranteeing well-connected communities.

Traag V A VA   Waltman L L   van Eck N J NJ  

Scientific reports 20190326 1


Community detection is often used to understand the structure of large and complex networks. One of the most popular algorithms for uncovering community structure is the so-called Louvain algorithm. We show that this algorithm has a major defect that largely went unnoticed until now: the Louvain algorithm may yield arbitrarily badly connected communities. In the worst case, communities may even be disconnected, especially when running the algorithm iteratively. In our experimental analysis, we o  ...[more]

Similar Datasets

2017-07-13 | GSE93080 | GEO
2020-08-13 | GSE140782 | GEO
| S-EPMC5240969 | biostudies-literature
| S-EPMC5362157 | biostudies-literature
| S-EPMC8068632 | biostudies-literature
| S-EPMC4517897 | biostudies-literature
| S-EPMC5586368 | biostudies-literature
| S-EPMC5549760 | biostudies-literature
| S-EPMC5898763 | biostudies-literature
| S-EPMC10914749 | biostudies-literature