Ontology highlight
ABSTRACT: Motivation
Comprehensive catalogue of genes that drive tumor initiation and progression in cancer is key to advancing diagnostics, therapeutics and treatment. Given the complexity of cancer, the catalogue is far from complete yet. Increasing evidence shows that driver genes exhibit consistent aberration patterns across multiple-omics in tumors. In this study, we aim to leverage complementary information encoded in each of the omics data to identify novel driver genes through an integrative framework. Specifically, we integrated mutations, gene expression, DNA copy numbers, DNA methylation and protein abundance, all available in The Cancer Genome Atlas (TCGA) and developed iDriver, a non-parametric Bayesian framework based on multivariate statistical modeling to identify driver genes in an unsupervised fashion. iDriver captures the inherent clusters of gene aberrations and constructs the background distribution that is used to assess and calibrate the confidence of driver genes identified through multi-dimensional genomic data.Results
We applied the method to 4 cancer types in TCGA and identified candidate driver genes that are highly enriched with known drivers. (e.g.: P Availability and implementationThe C?++?source code is freely available at https://medschool.vanderbilt.edu/cgg/ .Contacts
hai.yang@vanderbilt.edu or bingshan.li@Vanderbilt.Edu.Supplementary information
Supplementary data are available at Bioinformatics online.
SUBMITTER: Yang H
PROVIDER: S-EPMC6075201 | biostudies-literature | 2017 Feb
REPOSITORIES: biostudies-literature
Bioinformatics (Oxford, England) 20170201 4
<h4>Motivation</h4>Comprehensive catalogue of genes that drive tumor initiation and progression in cancer is key to advancing diagnostics, therapeutics and treatment. Given the complexity of cancer, the catalogue is far from complete yet. Increasing evidence shows that driver genes exhibit consistent aberration patterns across multiple-omics in tumors. In this study, we aim to leverage complementary information encoded in each of the omics data to identify novel driver genes through an integrati ...[more]