Unknown

Dataset Information

0

PolyClustR: defining communities of reconciled cancer subtypes with biological and prognostic significance.


ABSTRACT: BACKGROUND:To ensure cancer patients are stratified towards treatments that are optimally beneficial, it is a priority to define robust molecular subtypes using clustering methods applied to high-dimensional biological data. If each of these methods produces different numbers of clusters for the same data, it is difficult to achieve an optimal solution. Here, we introduce "polyClustR", a tool that reconciles clusters identified by different methods into subtype "communities" using a hypergeometric test or a measure of relative proportion of common samples. RESULTS:The polyClustR pipeline was initially tested using a breast cancer dataset to demonstrate how results are compatible with and add to the understanding of this well-characterised cancer. Two uveal melanoma datasets were then utilised to identify and validate novel subtype communities with significant metastasis-free prognostic differences and associations with known chromosomal aberrations. CONCLUSION:We demonstrate the value of the polyClustR approach of applying multiple consensus clustering algorithms and systematically reconciling the results in identifying novel subtype communities of two cancer types, which nevertheless are compatible with established understanding of these diseases. An R implementation of the pipeline is available at: https://github.com/syspremed/polyClustR.

SUBMITTER: Eason K 

PROVIDER: S-EPMC5970540 | biostudies-literature | 2018 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

polyClustR: defining communities of reconciled cancer subtypes with biological and prognostic significance.

Eason Katherine K   Nyamundanda Gift G   Sadanandam Anguraj A  

BMC bioinformatics 20180525 1


<h4>Background</h4>To ensure cancer patients are stratified towards treatments that are optimally beneficial, it is a priority to define robust molecular subtypes using clustering methods applied to high-dimensional biological data. If each of these methods produces different numbers of clusters for the same data, it is difficult to achieve an optimal solution. Here, we introduce "polyClustR", a tool that reconciles clusters identified by different methods into subtype "communities" using a hype  ...[more]

Similar Datasets

2020-09-01 | GSE147384 | GEO
| S-EPMC4244930 | biostudies-literature
| S-EPMC3247293 | biostudies-literature
2019-05-01 | GSE115010 | GEO
| S-EPMC4741767 | biostudies-literature
| S-EPMC7320923 | biostudies-literature
| S-EPMC10199194 | biostudies-literature
| S-EPMC9760436 | biostudies-literature
| S-EPMC6558990 | biostudies-literature
2020-09-22 | GSE158309 | GEO