Unknown

Dataset Information

0

Principled approach to the selection of the embedding dimension of networks.


ABSTRACT: Network embedding is a general-purpose machine learning technique that encodes network structure in vector spaces with tunable dimension. Choosing an appropriate embedding dimension - small enough to be efficient and large enough to be effective - is challenging but necessary to generate embeddings applicable to a multitude of tasks. Existing strategies for the selection of the embedding dimension rely on performance maximization in downstream tasks. Here, we propose a principled method such that all structural information of a network is parsimoniously encoded. The method is validated on various embedding algorithms and a large corpus of real-world networks. The embedding dimension selected by our method in real-world networks suggest that efficient encoding in low-dimensional spaces is usually possible.

SUBMITTER: Gu W 

PROVIDER: S-EPMC8213704 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC7968242 | biostudies-literature
| S-EPMC10282331 | biostudies-literature
2020-05-18 | GSE125020 | GEO
| S-EPMC9204564 | biostudies-literature
| S-EPMC9048673 | biostudies-literature
| S-EPMC6586941 | biostudies-literature
| S-EPMC10448701 | biostudies-literature
| S-EPMC7189270 | biostudies-literature
| S-EPMC9713410 | biostudies-literature
| S-EPMC9997680 | biostudies-literature