Dataset Information

Stacking models for nearly optimal link prediction in complex networks.

ABSTRACT: Most real-world networks are incompletely observed. Algorithms that can accurately predict which links are missing can dramatically speed up network data collection and improve network model validation. Many algorithms now exist for predicting missing links, given a partially observed network, but it has remained unknown whether a single best predictor exists, how link predictability varies across methods and networks from different domains, and how close to optimality current methods are. We answer these questions by systematically evaluating 203 individual link predictor algorithms, representing three popular families of methods, applied to a large corpus of 550 structurally diverse networks from six scientific domains. We first show that individual algorithms exhibit a broad diversity of prediction errors, such that no one predictor or family is best, or worst, across all realistic inputs. We then exploit this diversity using network-based metalearning to construct a series of "stacked" models that combine predictors into a single algorithm. Applied to a broad range of synthetic networks, for which we may analytically calculate optimal performance, these stacked models achieve optimal or nearly optimal levels of accuracy. Applied to real-world networks, stacked models are superior, but their accuracy varies strongly by domain, suggesting that link prediction may be fundamentally easier in social networks than in biological or technological networks. These results indicate that the state of the art for link prediction comes from combining individual algorithms, which can achieve nearly optimal predictions. We close with a brief discussion of limitations and opportunities for further improvements.

SUBMITTER: Ghasemian A

PROVIDER: S-EPMC7519231 | biostudies-literature | 2020 Sep

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Stacking models for nearly optimal link prediction in complex networks.

Ghasemian Amir A Hosseinmardi Homa H Galstyan Aram A Airoldi Edoardo M EM Clauset Aaron A

Proceedings of the National Academy of Sciences of the United States of America 20200904 38

Most real-world networks are incompletely observed. Algorithms that can accurately predict which links are missing can dramatically speed up network data collection and improve network model validation. Many algorithms now exist for predicting missing links, given a partially observed network, but it has remained unknown whether a single best predictor exists, how link predictability varies across methods and networks from different domains, and how close to optimality current methods are. We an ...[more]

PMID: 32887799

Dataset Information

Stacking models for nearly optimal link prediction in complex networks.

Publications

Stacking models for nearly optimal link prediction in complex networks.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Link-Prediction Enhanced Consensus Clustering for Complex Networks.
| S-EPMC4874693 | biostudies-literature

On the complexity of quantum link prediction in complex networks.
| S-EPMC10781705 | biostudies-literature

An information-theoretic model for link prediction in complex networks.
| S-EPMC4558573 | biostudies-literature

LinkPred: a high performance library for link prediction in complex networks.
| S-EPMC8157017 | biostudies-literature

Path-based extensions of local link prediction methods for complex networks.
| S-EPMC7670409 | biostudies-literature

Similarity-based future common neighbors model for link prediction in complex networks.
| S-EPMC6242980 | biostudies-literature

Toward link predictability of complex networks.
| S-EPMC4345601 | biostudies-literature

Link prediction in multiplex online social networks.
| S-EPMC5367313 | biostudies-literature

Optimal blending of multiple independent prediction models.
| S-EPMC9998929 | biostudies-literature

Optimal control of aging in complex networks.
| S-EPMC7456090 | biostudies-literature