Unknown

Dataset Information

0

Communication-Optimal Distributed Dynamic Graph Clustering.


ABSTRACT: We consider the problem of clustering graph nodes over large-scale dynamic graphs, such as citation networks, images and web networks, when graph updates such as node/edge insertions/deletions are observed distributively. We propose communication-efficient algorithms for two well-established communication models namely the message passing and the blackboard models. Given a graph with n nodes that is observed at s remote sites over time [1, t], the two proposed algorithms have communication costs Õ(ns) and Õ(n + s) (Õ hides a polylogarithmic factor), almost matching their lower bounds, Ω(ns) and Ω (n + s), respectively, in the message passing and the blackboard models. More importantly, we prove that at each time point in [1, t] our algorithms generate clustering quality nearly as good as that of centralizing all updates up to that time and then applying a standard centralized clustering algorithm. We conducted extensive experiments on both synthetic and real-life datasets which confirmed the communication efficiency of our approach over baseline algorithms while achieving comparable clustering results.

SUBMITTER: Zhu CJ 

PROVIDER: S-EPMC9275443 | biostudies-literature | 2019 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Communication-Optimal Distributed Dynamic Graph Clustering.

Zhu Chun Jiang CJ   Zhu Tan T   Lam Kam-Yiu KY   Han Song S   Bi Jinbo J  

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence 20190701


We consider the problem of clustering graph nodes over large-scale dynamic graphs, such as citation networks, images and web networks, when graph updates such as node/edge insertions/deletions are observed distributively. We propose communication-efficient algorithms for two well-established communication models namely the message passing and the blackboard models. Given a graph with <i>n</i> nodes that is observed at <i>s</i> remote sites over time [1, <i>t</i>], the two proposed algorithms hav  ...[more]

Similar Datasets

| S-EPMC2799515 | biostudies-literature
| S-EPMC10805810 | biostudies-literature
| S-EPMC2077249 | biostudies-literature
| S-EPMC8213183 | biostudies-literature
| S-EPMC8819718 | biostudies-literature
| S-EPMC8282031 | biostudies-literature
| S-EPMC6168142 | biostudies-literature
| S-EPMC7959621 | biostudies-literature
| S-EPMC8128468 | biostudies-literature
| S-EPMC8289385 | biostudies-literature