Unknown

Dataset Information

0

Cancer outlier analysis based on mixture modeling of gene expression data.


ABSTRACT: Molecular heterogeneity of cancer, partially caused by various chromosomal aberrations or gene mutations, can yield substantial heterogeneity in gene expression profile in cancer samples. To detect cancer-related genes which are active only in a subset of cancer samples or cancer outliers, several methods have been proposed in the context of multiple testing. Such cancer outlier analyses will generally suffer from a serious lack of power, compared with the standard multiple testing setting where common activation of genes across all cancer samples is supposed. In this paper, we consider information sharing across genes and cancer samples, via a parametric normal mixture modeling of gene expression levels of cancer samples across genes after a standardization using the reference, normal sample data. A gene-based statistic for gene selection is developed on the basis of a posterior probability of cancer outlier for each cancer sample. Some efficiency improvement by using our method was demonstrated, even under settings with misspecified, heavy-tailed t-distributions. An application to a real dataset from hematologic malignancies is provided.

SUBMITTER: Mori K 

PROVIDER: S-EPMC3649281 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

Cancer outlier analysis based on mixture modeling of gene expression data.

Mori Keita K   Oura Tomonori T   Noma Hisashi H   Matsui Shigeyuki S  

Computational and mathematical methods in medicine 20130410


Molecular heterogeneity of cancer, partially caused by various chromosomal aberrations or gene mutations, can yield substantial heterogeneity in gene expression profile in cancer samples. To detect cancer-related genes which are active only in a subset of cancer samples or cancer outliers, several methods have been proposed in the context of multiple testing. Such cancer outlier analyses will generally suffer from a serious lack of power, compared with the standard multiple testing setting where  ...[more]

Similar Datasets

| S-EPMC2367561 | biostudies-literature
| S-EPMC2800353 | biostudies-literature
| S-EPMC3394389 | biostudies-literature
| S-EPMC3637832 | biostudies-literature
| S-EPMC5428745 | biostudies-literature
| S-EPMC7176284 | biostudies-literature
| S-EPMC5936001 | biostudies-literature
| S-EPMC4838162 | biostudies-literature
| S-EPMC5451954 | biostudies-literature
| S-EPMC8881390 | biostudies-literature