Unknown

Dataset Information

0

Arete - candidate gene prioritization using biological network topology with additional evidence types.


ABSTRACT: BACKGROUND:Refinement of candidate gene lists to select the most promising candidates for further experimental verification remains an essential step between high-throughput exploratory analysis and the discovery of specific causal genes. Given the qualitative and semantic complexity of biological data, successfully addressing this challenge requires development of flexible and interoperable solutions for making the best possible use of the largest possible fraction of all available data. RESULTS:We have developed an easily accessible framework that links two established network-based gene prioritization approaches with a supporting isolation forest-based integrative ranking method. The defining feature of the method is that both topological information of the biological networks and additional sources of evidence can be considered at the same time. The implementation was realized as an app extension for the Cytoscape graph analysis suite, and therefore can further benefit from the synergy with other analysis methods available as part of this system. CONCLUSIONS:We provide efficient reference implementations of two popular gene prioritization algorithms - DIAMOnD and random walk with restart for the Cytoscape system. An extension of those methods was also developed that allows outputs of these algorithms to be combined with additional data. To demonstrate the utility of our software, we present two example disease gene prioritization application cases and show how our tool can be used to evaluate these different approaches.

SUBMITTER: Lysenko A 

PROVIDER: S-EPMC5501438 | biostudies-literature | 2017

REPOSITORIES: biostudies-literature

altmetric image

Publications

Arete - candidate gene prioritization using biological network topology with additional evidence types.

Lysenko Artem A   Boroevich Keith Anthony KA   Tsunoda Tatsuhiko T  

BioData mining 20170706


<h4>Background</h4>Refinement of candidate gene lists to select the most promising candidates for further experimental verification remains an essential step between high-throughput exploratory analysis and the discovery of specific causal genes. Given the qualitative and semantic complexity of biological data, successfully addressing this challenge requires development of flexible and interoperable solutions for making the best possible use of the largest possible fraction of all available data  ...[more]

Similar Datasets

| S-EPMC2945940 | biostudies-literature
| S-EPMC2194797 | biostudies-other
| S-EPMC3501465 | biostudies-literature
| S-EPMC4808289 | biostudies-literature
| S-EPMC3896815 | biostudies-other
| S-EPMC2657789 | biostudies-literature
| S-EPMC6434278 | biostudies-literature
| S-EPMC4987917 | biostudies-literature
| S-EPMC9681200 | biostudies-literature
| S-EPMC2896186 | biostudies-literature