Unknown

Dataset Information

0

Stability of similarity measurements for bipartite networks.


ABSTRACT: Similarity is a fundamental measure in network analyses and machine learning algorithms, with wide applications ranging from personalized recommendation to socio-economic dynamics. We argue that an effective similarity measurement should guarantee the stability even under some information loss. With six bipartite networks, we investigate the stabilities of fifteen similarity measurements by comparing the similarity matrixes of two data samples which are randomly divided from original data sets. Results show that, the fifteen measurements can be well classified into three clusters according to their stabilities, and measurements in the same cluster have similar mathematical definitions. In addition, we develop a top-n-stability method for personalized recommendation, and find that the unstable similarities would recommend false information to users, and the performance of recommendation would be largely improved by using stable similarity measurements. This work provides a novel dimension to analyze and evaluate similarity measurements, which can further find applications in link prediction, personalized recommendation, clustering algorithms, community detection and so on.

SUBMITTER: Liu JG 

PROVIDER: S-EPMC4698667 | biostudies-literature | 2016 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Stability of similarity measurements for bipartite networks.

Liu Jian-Guo JG   Hou Lei L   Pan Xue X   Guo Qiang Q   Zhou Tao T  

Scientific reports 20160104


Similarity is a fundamental measure in network analyses and machine learning algorithms, with wide applications ranging from personalized recommendation to socio-economic dynamics. We argue that an effective similarity measurement should guarantee the stability even under some information loss. With six bipartite networks, we investigate the stabilities of fifteen similarity measurements by comparing the similarity matrixes of two data samples which are randomly divided from original data sets.  ...[more]

Similar Datasets

| S-EPMC3622082 | biostudies-other
| S-EPMC4736915 | biostudies-literature
| S-EPMC3069038 | biostudies-other
| S-EPMC8559930 | biostudies-literature
| S-EPMC8099108 | biostudies-literature
| S-EPMC4488376 | biostudies-literature
| S-EPMC4149528 | biostudies-literature
| S-EPMC4450581 | biostudies-literature
| S-EPMC3048397 | biostudies-literature
| S-EPMC7300072 | biostudies-literature