Unknown

Dataset Information

0

Large-scale RNA-Seq Transcriptome Analysis of 4043 Cancers and 548 Normal Tissue Controls across 12 TCGA Cancer Types.


ABSTRACT: The Cancer Genome Atlas (TCGA) has accrued RNA-Seq-based transcriptome data for more than 4000 cancer tissue samples across 12 cancer types, translating these data into biological insights remains a major challenge. We analyzed and compared the transcriptomes of 4043 cancer and 548 normal tissue samples from 21 TCGA cancer types, and created a comprehensive catalog of gene expression alterations for each cancer type. By clustering genes into co-regulated gene sets, we identified seven cross-cancer gene signatures altered across a diverse panel of primary human cancer samples. A 14-gene signature extracted from these seven cross-cancer gene signatures precisely differentiated between cancerous and normal samples, the predictive accuracy of leave-one-out cross-validation (LOOCV) were 92.04%, 96.23%, 91.76%, 90.05%, 88.17%, 94.29%, and 99.10% for BLCA, BRCA, COAD, HNSC, LIHC, LUAD, and LUSC, respectively. A lung cancer-specific gene signature, containing SFTPA1 and SFTPA2 genes, accurately distinguished lung cancer from other cancer samples, the predictive accuracy of LOOCV for TCGA and GSE5364 data were 95.68% and 100%, respectively. These gene signatures provide rich insights into the transcriptional programs that trigger tumorigenesis and metastasis, and many genes in the signature gene panels may be of significant value to the diagnosis and treatment of cancer.

SUBMITTER: Peng L 

PROVIDER: S-EPMC4544034 | biostudies-literature | 2015

REPOSITORIES: biostudies-literature

altmetric image

Publications

Large-scale RNA-Seq Transcriptome Analysis of 4043 Cancers and 548 Normal Tissue Controls across 12 TCGA Cancer Types.

Peng Li L   Bian Xiu Wu XW   Li Di Kang DK   Xu Chuan C   Wang Guang Ming GM   Xia Qing You QY   Xiong Qing Q  

Scientific reports 20150821


The Cancer Genome Atlas (TCGA) has accrued RNA-Seq-based transcriptome data for more than 4000 cancer tissue samples across 12 cancer types, translating these data into biological insights remains a major challenge. We analyzed and compared the transcriptomes of 4043 cancer and 548 normal tissue samples from 21 TCGA cancer types, and created a comprehensive catalog of gene expression alterations for each cancer type. By clustering genes into co-regulated gene sets, we identified seven cross-canc  ...[more]

Similar Datasets

| S-EPMC4862326 | biostudies-literature
| S-EPMC7109368 | biostudies-literature
| S-EPMC7470976 | biostudies-literature
| S-EPMC3080369 | biostudies-literature
| S-EPMC7671411 | biostudies-literature
| PRJNA912837 | ENA
| PRJNA912838 | ENA
| S-EPMC1525166 | biostudies-literature
| S-EPMC5355310 | biostudies-literature
| S-EPMC9045663 | biostudies-literature