Unknown

Dataset Information

0

A transcriptome-Based Deep Neural Network Classifier for Identifying the Site of Origin in Mucinous Cancer.


ABSTRACT:

Purpose

There is a lack of tools for identifying the site of origin in mucinous cancer. This study aimed to evaluate the performance of a transcriptome-based classifier for identifying the site of origin in mucinous cancer.

Materials and methods

Transcriptomic data of 1878 non-mucinous and 82 mucinous cancer specimens, with 7 sites of origin, namely, the uterine cervix (CESC), colon (COAD), pancreas (PAAD), stomach (STAD), uterine endometrium (UCEC), uterine carcinosarcoma (UCS), and ovary (OV), obtained from The Cancer Genome Atlas, were used as the training and validation sets, respectively. Transcriptomic data of 14 mucinous cancer specimens from a tissue archive were used as the test set. For identifying the site of origin, a set of 100 differentially expressed genes for each site of origin was selected. After removing multiple iterations of the same gene, 427 genes were chosen, and their RNA expression profiles, at each site of origin, were used to train the deep neural network classifier. The performance of the classifier was estimated using the training, validation, and test sets.

Results

The accuracy of the model in the training set was 0.998, while that in the validation set was 0.939 (77/82). In the test set which is newly sequenced from a tissue archive, the model showed an accuracy of 0.857 (12/14). t-SNE analysis revealed that samples in the test set were part of the clusters obtained for the training set.

Conclusion

Although limited by small sample size, we showed that a transcriptome-based classifier could correctly identify the site of origin of mucinous cancer.

SUBMITTER: Ahn T 

PROVIDER: S-EPMC9669684 | biostudies-literature | 2022

REPOSITORIES: biostudies-literature

altmetric image

Publications

A transcriptome-Based Deep Neural Network Classifier for Identifying the Site of Origin in Mucinous Cancer.

Ahn Taejin T   Kim Kidong K   Kim Hyojin H   Kim Sarah S   Park Sangick S   Lee Kyoungbun K  

Cancer informatics 20221115


<h4>Purpose</h4>There is a lack of tools for identifying the site of origin in mucinous cancer. This study aimed to evaluate the performance of a transcriptome-based classifier for identifying the site of origin in mucinous cancer.<h4>Materials and methods</h4>Transcriptomic data of 1878 non-mucinous and 82 mucinous cancer specimens, with 7 sites of origin, namely, the uterine cervix (CESC), colon (COAD), pancreas (PAAD), stomach (STAD), uterine endometrium (UCEC), uterine carcinosarcoma (UCS),  ...[more]

Similar Datasets

2020-12-15 | GSE163126 | GEO
| PRJNA685029 | ENA
| S-EPMC6853695 | biostudies-literature
| S-EPMC10439717 | biostudies-literature
| S-EPMC4977478 | biostudies-literature
| S-EPMC8776667 | biostudies-literature
| S-EPMC10714470 | biostudies-literature
| S-EPMC7067889 | biostudies-literature
| S-EPMC8658957 | biostudies-literature
| S-EPMC7510298 | biostudies-literature