Unknown

Dataset Information

0

Pan-cancer identification of clinically relevant genomic subtypes using outcome-weighted integrative clustering.


ABSTRACT:

Background

Comprehensive molecular profiling has revealed somatic variations in cancer at genomic, epigenomic, transcriptomic, and proteomic levels. The accumulating data has shown clearly that molecular phenotypes of cancer are complex and influenced by a multitude of factors. Conventional unsupervised clustering applied to a large patient population is inevitably driven by the dominant variation from major factors such as cell-of-origin or histology. Translation of these data into clinical relevance requires more effective extraction of information directly associated with patient outcome.

Methods

Drawing from ideas in supervised text classification, we developed survClust, an outcome-weighted clustering algorithm for integrative molecular stratification focusing on patient survival. survClust was performed on 18 cancer types across multiple data modalities including somatic mutation, DNA copy number, DNA methylation, and mRNA, miRNA, and protein expression from the Cancer Genome Atlas study to identify novel prognostic subtypes.

Results

Our analysis identified the prognostic role of high tumor mutation burden with concurrently high CD8 T cell immune marker expression and the aggressive clinical behavior associated with CDKN2A deletion across cancer types. Visualization of somatic alterations, at a genome-wide scale (total mutation burden, mutational signature, fraction genome altered) and at the individual gene level, using circomap further revealed indolent versus aggressive subgroups in a pan-cancer setting.

Conclusions

Our analysis has revealed prognostic molecular subtypes not previously identified by unsupervised clustering. The algorithm and tools we developed have direct utility toward patient stratification based on tumor genomics to inform clinical decision-making. The survClust software tool is available at https://github.com/arorarshi/survClust .

SUBMITTER: Arora A 

PROVIDER: S-EPMC7716509 | biostudies-literature | 2020 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Pan-cancer identification of clinically relevant genomic subtypes using outcome-weighted integrative clustering.

Arora Arshi A   Olshen Adam B AB   Seshan Venkatraman E VE   Shen Ronglai R  

Genome medicine 20201203 1


<h4>Background</h4>Comprehensive molecular profiling has revealed somatic variations in cancer at genomic, epigenomic, transcriptomic, and proteomic levels. The accumulating data has shown clearly that molecular phenotypes of cancer are complex and influenced by a multitude of factors. Conventional unsupervised clustering applied to a large patient population is inevitably driven by the dominant variation from major factors such as cell-of-origin or histology. Translation of these data into clin  ...[more]

Similar Datasets

| S-EPMC5959300 | biostudies-literature
| S-EPMC10984038 | biostudies-literature
| S-EPMC4339277 | biostudies-literature
| S-EPMC5851245 | biostudies-literature
| S-EPMC6823358 | biostudies-literature
| S-EPMC3702647 | biostudies-literature
| S-EPMC4526352 | biostudies-literature
2015-03-04 | E-GEOD-40774 | biostudies-arrayexpress
2015-03-04 | GSE40774 | GEO
| S-EPMC7854517 | biostudies-literature