Unknown

Dataset Information

0

Embedding-based Silhouette community detection.


ABSTRACT: Mining complex data in the form of networks is of increasing interest in many scientific disciplines. Network communities correspond to densely connected subnetworks, and often represent key functional parts of real-world systems. This paper proposes the embedding-based Silhouette community detection (SCD), an approach for detecting communities, based on clustering of network node embeddings, i.e. real valued representations of nodes derived from their neighborhoods. We investigate the performance of the proposed SCD approach on 234 synthetic networks, as well as on a real-life social network. Even though SCD is not based on any form of modularity optimization, it performs comparably or better than state-of-the-art community detection algorithms, such as the InfoMap and Louvain. Further, we demonstrate that SCD's outputs can be used along with domain ontologies in semantic subgroup discovery, yielding human-understandable explanations of communities detected in a real-life protein interaction network. Being embedding-based, SCD is widely applicable and can be tested out-of-the-box as part of many existing network learning and exploration pipelines.

SUBMITTER: Skrlj B 

PROVIDER: S-EPMC7652809 | biostudies-literature | 2020

REPOSITORIES: biostudies-literature

altmetric image

Publications

Embedding-based Silhouette community detection.

Škrlj Blaž B   Kralj Jan J   Lavrač Nada N  

Machine learning 20200727 11


Mining complex data in the form of networks is of increasing interest in many scientific disciplines. Network communities correspond to densely connected subnetworks, and often represent key functional parts of real-world systems. This paper proposes the embedding-based Silhouette community detection (SCD), an approach for detecting communities, based on clustering of network node embeddings, i.e. real valued representations of nodes derived from their neighborhoods. We investigate the performan  ...[more]

Similar Datasets

| S-EPMC7485691 | biostudies-literature
| S-EPMC10900708 | biostudies-literature
| S-EPMC9916288 | biostudies-literature
| S-EPMC6416296 | biostudies-literature
| S-EPMC4828636 | biostudies-literature
| S-EPMC555759 | biostudies-literature
2012-02-21 | E-GEOD-34592 | biostudies-arrayexpress
2012-02-21 | GSE34592 | GEO
| S-EPMC4066938 | biostudies-other
| S-EPMC5524321 | biostudies-literature