Unknown

Dataset Information

0

Enhancing identification of cancer types via lowly-expressed microRNAs.


ABSTRACT: The primary function of microRNAs (miRNAs) is to maintain cell homeostasis. In cancerous tissues miRNAs' expression undergo drastic alterations. In this study, we use miRNA expression profiles from The Cancer Genome Atlas of 24 cancer types and 3 healthy tissues, collected from >8500 samples. We seek to classify the cancer's origin and tissue identification using the expression from 1046 reported miRNAs. Despite an apparent uniform appearance of miRNAs among cancerous samples, we recover indispensable information from lowly expressed miRNAs regarding the cancer/tissue types. Multiclass support vector machine classification yields an average recall of 58% in identifying the correct tissue and tumor types. Data discretization had led to substantial improvement, reaching an average recall of 91% (95% median). We propose a straightforward protocol as a crucial step in classifying tumors of unknown primary origin. Our counter-intuitive conclusion is that in almost all cancer types, highly expressing miRNAs mask the significant signal that lower expressed miRNAs provide.

SUBMITTER: Rasnic R 

PROVIDER: S-EPMC5435932 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC2727378 | biostudies-literature
| S-EPMC6279243 | biostudies-literature
2014-08-14 | E-GEOD-59037 | biostudies-arrayexpress
| S-EPMC5961134 | biostudies-literature
| S-EPMC2905361 | biostudies-literature
| S-EPMC6212843 | biostudies-literature
| S-EPMC4664671 | biostudies-literature
| S-EPMC7579798 | biostudies-literature
| S-EPMC4291075 | biostudies-literature
| S-EPMC3840150 | biostudies-literature