Unknown

Dataset Information

0

Preclinical validation of therapeutic targets predicted by tensor factorization on heterogeneous graphs.


ABSTRACT: Incorrect drug target identification is a major obstacle in drug discovery. Only 15% of drugs advance from Phase II to approval, with ineffective targets accounting for over 50% of these failures1-3. Advances in data fusion and computational modeling have independently progressed towards addressing this issue. Here, we capitalize on both these approaches with Rosalind, a comprehensive gene prioritization method that combines heterogeneous knowledge graph construction with relational inference via tensor factorization to accurately predict disease-gene links. Rosalind demonstrates an increase in performance of 18%-50% over five comparable state-of-the-art algorithms. On historical data, Rosalind prospectively identifies 1 in 4 therapeutic relationships eventually proven true. Beyond efficacy, Rosalind is able to accurately predict clinical trial successes (75% recall at rank 200) and distinguish likely failures (74% recall at rank 200). Lastly, Rosalind predictions were experimentally tested in a patient-derived in-vitro assay for Rheumatoid arthritis (RA), which yielded 5 promising genes, one of which is unexplored in RA.

SUBMITTER: Paliwal S 

PROVIDER: S-EPMC7589557 | biostudies-literature | 2020 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Preclinical validation of therapeutic targets predicted by tensor factorization on heterogeneous graphs.

Paliwal Saee S   de Giorgio Alex A   Neil Daniel D   Michel Jean-Baptiste JB   Lacoste Alix Mb AM  

Scientific reports 20201026 1


Incorrect drug target identification is a major obstacle in drug discovery. Only 15% of drugs advance from Phase II to approval, with ineffective targets accounting for over 50% of these failures<sup>1-3</sup>. Advances in data fusion and computational modeling have independently progressed towards addressing this issue. Here, we capitalize on both these approaches with Rosalind, a comprehensive gene prioritization method that combines heterogeneous knowledge graph construction with relational i  ...[more]

Similar Datasets

| S-EPMC6368709 | biostudies-literature
| S-EPMC5330508 | biostudies-literature
| S-EPMC5479529 | biostudies-literature
| S-EPMC5430728 | biostudies-literature
| S-EPMC6563906 | biostudies-literature
| S-EPMC5342366 | biostudies-literature
| S-EPMC6022648 | biostudies-literature
| S-EPMC4112560 | biostudies-literature
| S-EPMC3984787 | biostudies-other
| S-EPMC6660346 | biostudies-literature