Unknown

Dataset Information

0

Knowledge about the presence or absence of miRNA isoforms (isomiRs) can successfully discriminate amongst 32 TCGA cancer types.


ABSTRACT: Isoforms of human miRNAs (isomiRs) are constitutively expressed with tissue- and disease-subtype-dependencies. We studied 10 271 tumor datasets from The Cancer Genome Atlas (TCGA) to evaluate whether isomiRs can distinguish amongst 32 TCGA cancers. Unlike previous approaches, we built a classifier that relied solely on 'binarized' isomiR profiles: each isomiR is simply labeled as 'present' or 'absent'. The resulting classifier successfully labeled tumor datasets with an average sensitivity of 90% and a false discovery rate (FDR) of 3%, surpassing the performance of expression-based classification. The classifier maintained its power even after a 15× reduction in the number of isomiRs that were used for training. Notably, the classifier could correctly predict the cancer type in non-TCGA datasets from diverse platforms. Our analysis revealed that the most discriminatory isomiRs happen to also be differentially expressed between normal tissue and cancer. Even so, we find that these highly discriminating isomiRs have not been attracting the most research attention in the literature. Given their ability to successfully classify datasets from 32 cancers, isomiRs and our resulting 'Pan-cancer Atlas' of isomiR expression could serve as a suitable framework to explore novel cancer biomarkers.

SUBMITTER: Telonis AG 

PROVIDER: S-EPMC5389567 | biostudies-literature | 2017 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Knowledge about the presence or absence of miRNA isoforms (isomiRs) can successfully discriminate amongst 32 TCGA cancer types.

Telonis Aristeidis G AG   Magee Rogan R   Loher Phillipe P   Chervoneva Inna I   Londin Eric E   Rigoutsos Isidore I  

Nucleic acids research 20170401 6


Isoforms of human miRNAs (isomiRs) are constitutively expressed with tissue- and disease-subtype-dependencies. We studied 10 271 tumor datasets from The Cancer Genome Atlas (TCGA) to evaluate whether isomiRs can distinguish amongst 32 TCGA cancers. Unlike previous approaches, we built a classifier that relied solely on 'binarized' isomiR profiles: each isomiR is simply labeled as 'present' or 'absent'. The resulting classifier successfully labeled tumor datasets with an average sensitivity of 90  ...[more]

Similar Datasets

| S-EPMC7280754 | biostudies-literature
| S-EPMC6776719 | biostudies-literature
| S-EPMC8865264 | biostudies-literature
| S-EPMC8070023 | biostudies-literature
| S-EPMC5008382 | biostudies-literature
| S-EPMC3759948 | biostudies-literature
| S-EPMC8509996 | biostudies-literature
| S-EPMC2727387 | biostudies-literature
| S-EPMC6770970 | biostudies-literature
| S-EPMC8119692 | biostudies-literature