Unknown

Dataset Information

0

Bioentity2vec: Attribute- and behavior-driven representation for predicting multi-type relationships between bioentities.


ABSTRACT: BACKGROUND:The explosive growth of genomic, chemical, and pathological data provides new opportunities and challenges for humans to thoroughly understand life activities in cells. However, there exist few computational models that aggregate various bioentities to comprehensively reveal the physical and functional landscape of biological systems. RESULTS:We constructed a molecular association network, which contains 18 edges (relationships) between 8 nodes (bioentities). Based on this, we propose Bioentity2vec, a new method for representing bioentities, which integrates information about the attributes and behaviors of a bioentity. Applying the random forest classifier, we achieved promising performance on 18 relationships, with an area under the curve of 0.9608 and an area under the precision-recall curve of 0.9572. CONCLUSIONS:Our study shows that constructing a network with rich topological and biological information is important for systematic understanding of the biological landscape at the molecular level. Our results show that Bioentity2vec can effectively represent biological entities and provides easily distinguishable information about classification tasks. Our method is also able to simultaneously predict relationships between single types and multiple types, which will accelerate progress in biological experimental research and industrial product development.

SUBMITTER: Guo ZH 

PROVIDER: S-EPMC7293023 | biostudies-literature | 2020 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Bioentity2vec: Attribute- and behavior-driven representation for predicting multi-type relationships between bioentities.

Guo Zhen-Hao ZH   Guo Zhen-Hao ZH   You Zhu-Hong ZH   Wang Yan-Bin YB   Huang De-Shuang DS   Yi Hai-Cheng HC   Chen Zhan-Heng ZH  

GigaScience 20200601 6


<h4>Background</h4>The explosive growth of genomic, chemical, and pathological data provides new opportunities and challenges for humans to thoroughly understand life activities in cells. However, there exist few computational models that aggregate various bioentities to comprehensively reveal the physical and functional landscape of biological systems.<h4>Results</h4>We constructed a molecular association network, which contains 18 edges (relationships) between 8 nodes (bioentities). Based on t  ...[more]

Similar Datasets

| S-EPMC5562722 | biostudies-literature
| S-EPMC4685756 | biostudies-literature
| S-EPMC4198697 | biostudies-literature
| S-EPMC10619403 | biostudies-literature
| S-EPMC6185864 | biostudies-literature
| S-EPMC2748682 | biostudies-literature
| S-EPMC7136970 | biostudies-literature
| S-EPMC4139289 | biostudies-literature
| S-EPMC3377689 | biostudies-literature
| S-EPMC6381841 | biostudies-literature