Unknown

Dataset Information

0

PecanPy: a fast, efficient, and parallelized Python implementation of node2vec.


ABSTRACT: Learning low-dimensional representations (embeddings) of nodes in large graphs is key to applying machine learning on massive biological networks. Node2vec is the most widely used method for node embedding. However, its original Python and C ++ implementations scale poorly with network density, failing for dense biological networks with hundreds of millions of edges. We have developed PecanPy, a new Python implementation of node2vec that uses cache-optimized compact graph data structures and precomputing/parallelization to result in fast, high-quality node embeddings for biological networks of all sizes and densities. PecanPy software is freely available at https://github.com/krishnanlab/PecanPy. Supplementary data are available at Bioinformatics online.

SUBMITTER: Liu R 

PROVIDER: S-EPMC8504639 | biostudies-literature | 2021 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

PecanPy: a fast, efficient and parallelized Python implementation of node2vec.

Liu Renming R   Krishnan Arjun A  

Bioinformatics (Oxford, England) 20211001 19


<h4>Summary</h4>Learning low-dimensional representations (embeddings) of nodes in large graphs is key to applying machine learning on massive biological networks. Node2vec is the most widely used method for node embedding. However, its original Python and C++ implementations scale poorly with network density, failing for dense biological networks with hundreds of millions of edges. We have developed PecanPy, a new Python implementation of node2vec that uses cache-optimized compact graph data str  ...[more]

Similar Datasets

| S-EPMC5609504 | biostudies-literature
| S-EPMC10779519 | biostudies-literature
| S-EPMC5561639 | biostudies-other
| S-EPMC7141843 | biostudies-literature
2024-09-06 | GSE276553 | GEO
| S-EPMC8479650 | biostudies-literature
| S-EPMC10418261 | biostudies-literature
| S-EPMC11549839 | biostudies-literature
| S-EPMC10611056 | biostudies-literature
| S-EPMC2375128 | biostudies-literature