Unknown

Dataset Information

0

Comprehensive Cross-Population Analysis of High-Grade Serous Ovarian Cancer Supports No More Than Three Subtypes.


ABSTRACT: Four gene expression subtypes of high-grade serous ovarian cancer (HGSC) have been previously described. In these early studies, a fraction of samples that did not fit well into the four subtype classifications were excluded. Therefore, we sought to systematically determine the concordance of transcriptomic HGSC subtypes across populations without removing any samples. We created a bioinformatics pipeline to independently cluster the five largest mRNA expression datasets using k-means and nonnegative matrix factorization (NMF). We summarized differential expression patterns to compare clusters across studies. While previous studies reported four subtypes, our cross-population comparison does not support four. Because these results contrast with previous reports, we attempted to reproduce analyses performed in those studies. Our results suggest that early results favoring four subtypes may have been driven by the inclusion of serous borderline tumors. In summary, our analysis suggests that either two or three, but not four, gene expression subtypes are most consistent across datasets.

SUBMITTER: Way GP 

PROVIDER: S-EPMC5144978 | biostudies-literature | 2016 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Comprehensive Cross-Population Analysis of High-Grade Serous Ovarian Cancer Supports No More Than Three Subtypes.

Way Gregory P GP   Rudd James J   Wang Chen C   Hamidi Habib H   Fridley Brooke L BL   Konecny Gottfried E GE   Goode Ellen L EL   Greene Casey S CS   Doherty Jennifer A JA  

G3 (Bethesda, Md.) 20161207 12


Four gene expression subtypes of high-grade serous ovarian cancer (HGSC) have been previously described. In these early studies, a fraction of samples that did not fit well into the four subtype classifications were excluded. Therefore, we sought to systematically determine the concordance of transcriptomic HGSC subtypes across populations without removing any samples. We created a bioinformatics pipeline to independently cluster the five largest mRNA expression datasets using k-means and nonneg  ...[more]

Similar Datasets

| S-EPMC6207081 | biostudies-literature
2022-05-31 | GSE204748 | GEO
| S-EPMC4271115 | biostudies-literature
| S-EPMC8036744 | biostudies-literature
| S-EPMC7537352 | biostudies-literature
| S-EPMC6873993 | biostudies-literature
| S-EPMC6195427 | biostudies-literature
| S-EPMC10635053 | biostudies-literature
| S-EPMC7298555 | biostudies-literature
| PRJNA842074 | ENA