Unknown

Dataset Information

0

Semi-supervised identification of cancer subgroups using survival outcomes and overlapping grouping information.


ABSTRACT: Identification of cancer patient subgroups using high throughput genomic data is of critical importance to clinicians and scientists because it can offer opportunities for more personalized treatment and overlapping treatments of cancers. In spite of tremendous efforts, this problem still remains challenging because of low reproducibility and instability of identified cancer subgroups and molecular features. In order to address this challenge, we developed Integrative Genomics Robust iDentification of cancer subgroups (InGRiD), a statistical approach that integrates information from biological pathway databases with high-throughput genomic data to improve the robustness for identification and interpretation of molecularly-defined subgroups of cancer patients. We applied InGRiD to the gene expression data of high-grade serous ovarian cancer from The Cancer Genome Atlas and the Australian Ovarian Cancer Study. The results indicate clear benefits of the pathway-level approaches over the gene-level approaches. In addition, using the proposed InGRiD framework, we also investigate and address the issue of gene sharing among pathways, which often occurs in practice, to further facilitate biological interpretation of key molecular features associated with cancer progression. The R package "InGRiD" implementing the proposed approach is currently available in our research group GitHub webpage ( https://dongjunchung.github.io/INGRID/ ).

SUBMITTER: Wei W 

PROVIDER: S-EPMC6922004 | biostudies-literature | 2019 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Semi-supervised identification of cancer subgroups using survival outcomes and overlapping grouping information.

Wei Wei W   Sun Zequn Z   da Silveira Willian A WA   Yu Zhenning Z   Lawson Andrew A   Hardiman Gary G   Kelemen Linda E LE   Chung Dongjun D  

Statistical methods in medical research 20180116 7


Identification of cancer patient subgroups using high throughput genomic data is of critical importance to clinicians and scientists because it can offer opportunities for more personalized treatment and overlapping treatments of cancers. In spite of tremendous efforts, this problem still remains challenging because of low reproducibility and instability of identified cancer subgroups and molecular features. In order to address this challenge, we developed Integrative Genomics Robust iDentificat  ...[more]

Similar Datasets

2019-11-13 | GSE140262 | GEO
| S-EPMC6540576 | biostudies-literature
| S-EPMC7703937 | biostudies-literature
| PRJNA589061 | ENA
| S-EPMC7096458 | biostudies-literature
| S-EPMC387275 | biostudies-literature
| S-EPMC7148067 | biostudies-literature
| S-EPMC8444075 | biostudies-literature
| S-EPMC6455938 | biostudies-literature
| S-EPMC2666814 | biostudies-literature