Unknown

Dataset Information

0

Negative Sampling for Hyperlink Prediction in Networks


ABSTRACT: While graphs capture pairwise relations between entities, hypergraphs deal with higher-order ones, thereby ensuring losslessness. However, in hyperlink (i.e., higher-order link) prediction, where hyperlinks and non-hyperlinks are treated as “positive” and “negative” classes respectively, hypergraphs suffer from the problem of extreme class imbalance. Given this context, “negative sampling”—under-sampling the negative class of non-hyperlinks—becomes mandatory for performing hyperlink prediction. No prior work on hyperlink prediction deals with this problem. In this work, which is the first of its kind, we deal with this problem in the context of hyperlink prediction. More specifically, we leverage graph sampling techniques for sampling non-hyperlinks in hyperlink prediction. Our analysis clearly establishes the effect of random sampling, which is the norm in both link- as well as hyperlink-prediction. Further, we formalize the notion of “hardness” of non-hyperlinks via a measure of density, and analyze its distribution over various negative sampling techniques. We experiment with some real-world hypergraph datasets and provide both qualitative and quantitative results on the effects of negative sampling. We also establish its importance in evaluating hyperlink prediction algorithms.

SUBMITTER: Lauw H 

PROVIDER: S-EPMC7206280 | biostudies-literature | 2020 Apr

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC7068866 | biostudies-literature
| S-EPMC6829266 | biostudies-literature
| S-EPMC5365094 | biostudies-literature
| S-EPMC6731643 | biostudies-literature
| S-EPMC5379509 | biostudies-literature
| S-EPMC4203686 | biostudies-literature
| S-EPMC4675424 | biostudies-literature
| S-EPMC4933923 | biostudies-literature
| S-EPMC9044246 | biostudies-literature
| S-EPMC555505 | biostudies-literature