Unknown

Dataset Information

0

Learning a Latent Space of Highly Multidimensional Cancer Data.


ABSTRACT: We introduce a Unified Disentanglement Network (UFDN) trained on The Cancer Genome Atlas (TCGA), which we refer to as UFDN-TCGA. We demonstrate that UFDN-TCGA learns a biologically relevant, low-dimensional latent space of high-dimensional gene expression data by applying our network to two classification tasks of cancer status and cancer type. UFDN-TCGA performs comparably to random forest methods. The UFDN allows for continuous, partial interpolation between distinct cancer types. Furthermore, we perform an analysis of differentially expressed genes between skin cutaneous melanoma (SKCM) samples and the same samples interpolated into glioblastoma (GBM). We demonstrate that our interpolations consist of relevant metagenes that recapitulate known glioblastoma mechanisms.

SUBMITTER: Kompa B 

PROVIDER: S-EPMC6934353 | biostudies-literature | 2020

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC5927604 | biostudies-other
| S-EPMC6311689 | biostudies-literature
| S-EPMC9044254 | biostudies-literature
| S-EPMC3706894 | biostudies-other
| S-EPMC7136840 | biostudies-literature
| S-EPMC7561362 | biostudies-literature
| S-EPMC8162036 | biostudies-literature
| S-EPMC5031941 | biostudies-literature
| S-EPMC7212571 | biostudies-literature
| S-EPMC6478574 | biostudies-literature