Unknown

Dataset Information

0

Candidate gene prioritization with Endeavour.


ABSTRACT: Genomic studies and high-throughput experiments often produce large lists of candidate genes among which only a small fraction are truly relevant to the disease, phenotype or biological process of interest. Gene prioritization tackles this problem by ranking candidate genes by profiling candidates across multiple genomic data sources and integrating this heterogeneous information into a global ranking. We describe an extended version of our gene prioritization method, Endeavour, now available for six species and integrating 75 data sources. The performance (Area Under the Curve) of Endeavour on cross-validation benchmarks using 'gold standard' gene sets varies from 88% (for human phenotypes) to 95% (for worm gene function). In addition, we have also validated our approach using a time-stamped benchmark derived from the Human Phenotype Ontology, which provides a setting close to prospective validation. With this benchmark, using 3854 novel gene-phenotype associations, we observe a performance of 82%. Altogether, our results indicate that this extended version of Endeavour efficiently prioritizes candidate genes. The Endeavour web server is freely available at https://endeavour.esat.kuleuven.be/.

SUBMITTER: Tranchevent LC 

PROVIDER: S-EPMC4987917 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC2447805 | biostudies-other
| S-EPMC2194797 | biostudies-other
| S-EPMC2383918 | biostudies-literature
| S-EPMC1274252 | biostudies-literature
| S-EPMC2657789 | biostudies-literature
| S-EPMC2935433 | biostudies-literature
| S-EPMC10580627 | biostudies-literature
| S-EPMC5501438 | biostudies-literature
| S-EPMC1929163 | biostudies-literature
| S-EPMC4528628 | biostudies-literature