Unknown

Dataset Information

0

Differential distribution improves gene selection stability and has competitive classification performance for patient survival.


ABSTRACT: A consistent difference in average expression level, often referred to as differential expression (DE), has long been used to identify genes useful for classification. However, recent cancer studies have shown that when transcription factors or epigenetic signals become deregulated, a change in expression variability (DV) of target genes is frequently observed. This suggests that assessing the importance of genes by either differential expression or variability alone potentially misses sets of important biomarkers that could lead to improved predictions and treatments. Here, we describe a new approach for assessing the importance of genes based on differential distribution (DD), which combines information from differential expression and differential variability into a unified metric. We show that feature ranking and selection stability based on DD can perform two to three times better than DE or DV alone, and that DD yields equivalent error rates to DE and DV. Finally, assessing genes via differential distribution produces a complementary set of selected genes to DE and DV, potentially opening up new categories of biomarkers.

SUBMITTER: Strbenac D 

PROVIDER: S-EPMC5291264 | biostudies-literature | 2016 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Differential distribution improves gene selection stability and has competitive classification performance for patient survival.

Strbenac Dario D   Mann Graham J GJ   Yang Jean Y H JY   Ormerod John T JT  

Nucleic acids research 20160517 13


A consistent difference in average expression level, often referred to as differential expression (DE), has long been used to identify genes useful for classification. However, recent cancer studies have shown that when transcription factors or epigenetic signals become deregulated, a change in expression variability (DV) of target genes is frequently observed. This suggests that assessing the importance of genes by either differential expression or variability alone potentially misses sets of i  ...[more]

Similar Datasets

| S-EPMC2527336 | biostudies-literature
| S-EPMC9727124 | biostudies-literature
| S-EPMC7597697 | biostudies-literature
| S-EPMC4226630 | biostudies-literature
| S-EPMC8976470 | biostudies-literature
| S-EPMC10234400 | biostudies-literature
| S-EPMC9199075 | biostudies-literature
| S-EPMC6530857 | biostudies-literature
| S-EPMC5538675 | biostudies-literature
| S-EPMC3137682 | biostudies-literature