Unknown

Dataset Information

0

Evaluating the evaluation of cancer driver genes.


ABSTRACT: Sequencing has identified millions of somatic mutations in human cancers, but distinguishing cancer driver genes remains a major challenge. Numerous methods have been developed to identify driver genes, but evaluation of the performance of these methods is hindered by the lack of a gold standard, that is, bona fide driver gene mutations. Here, we establish an evaluation framework that can be applied to driver gene prediction methods. We used this framework to compare the performance of eight such methods. One of these methods, described here, incorporated a machine-learning-based ratiometric approach. We show that the driver genes predicted by each of the eight methods vary widely. Moreover, the P values reported by several of the methods were inconsistent with the uniform values expected, thus calling into question the assumptions that were used to generate them. Finally, we evaluated the potential effects of unexplained variability in mutation rates on false-positive driver gene predictions. Our analysis points to the strengths and weaknesses of each of the currently available methods and offers guidance for improving them in the future.

SUBMITTER: Tokheim CJ 

PROVIDER: S-EPMC5167163 | biostudies-literature | 2016 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Evaluating the evaluation of cancer driver genes.

Tokheim Collin J CJ   Papadopoulos Nickolas N   Kinzler Kenneth W KW   Vogelstein Bert B   Karchin Rachel R  

Proceedings of the National Academy of Sciences of the United States of America 20161122 50


Sequencing has identified millions of somatic mutations in human cancers, but distinguishing cancer driver genes remains a major challenge. Numerous methods have been developed to identify driver genes, but evaluation of the performance of these methods is hindered by the lack of a gold standard, that is, bona fide driver gene mutations. Here, we establish an evaluation framework that can be applied to driver gene prediction methods. We used this framework to compare the performance of eight suc  ...[more]

Similar Datasets

| S-EPMC8192921 | biostudies-literature
| S-EPMC4148527 | biostudies-other
| S-EPMC6936061 | biostudies-literature
| S-EPMC3018819 | biostudies-other
| S-EPMC6943052 | biostudies-literature
| S-EPMC6874647 | biostudies-literature
| S-EPMC8142487 | biostudies-literature
| S-EPMC6029450 | biostudies-literature
| S-EPMC6598669 | biostudies-literature
| S-EPMC7807764 | biostudies-literature