Unknown

Dataset Information

0

A Bayesian model for cross-study differential gene expression.


ABSTRACT: In this paper we define a hierarchical Bayesian model for microarray expression data collected from several studies and use it to identify genes that show differential expression between two conditions. Key features include shrinkage across both genes and studies, and flexible modeling that allows for interactions between platforms and the estimated effect, as well as concordant and discordant differential expression across studies. We evaluated the performance of our model in a comprehensive fashion, using both artificial data, and a "split-study" validation approach that provides an agnostic assessment of the model's behavior not only under the null hypothesis, but also under a realistic alternative. The simulation results from the artificial data demonstrate the advantages of the Bayesian model. The 1 - AUC values for the Bayesian model are roughly half of the corresponding values for a direct combination of t- and SAM-statistics. Furthermore, the simulations provide guidelines for when the Bayesian model is most likely to be useful. Most noticeably, in small studies the Bayesian model generally outperforms other methods when evaluated by AUC, FDR, and MDR across a range of simulation parameters, and this difference diminishes for larger sample sizes in the individual studies. The split-study validation illustrates appropriate shrinkage of the Bayesian model in the absence of platform-, sample-, and annotation-differences that otherwise complicate experimental data analyses. Finally, we fit our model to four breast cancer studies employing different technologies (cDNA and Affymetrix) to estimate differential expression in estrogen receptor positive tumors versus negative ones. Software and data for reproducing our analysis are publicly available.

SUBMITTER: Scharpf RB 

PROVIDER: S-EPMC2994029 | biostudies-literature | 2009

REPOSITORIES: biostudies-literature

altmetric image

Publications

A Bayesian model for cross-study differential gene expression.

Scharpf Robert B RB   Tjelmeland Håkon H   Parmigiani Giovanni G   Nobel Andrew B AB  

Journal of the American Statistical Association 20090101 488


In this paper we define a hierarchical Bayesian model for microarray expression data collected from several studies and use it to identify genes that show differential expression between two conditions. Key features include shrinkage across both genes and studies, and flexible modeling that allows for interactions between platforms and the estimated effect, as well as concordant and discordant differential expression across studies. We evaluated the performance of our model in a comprehensive fa  ...[more]

Similar Datasets

| S-EPMC2259410 | biostudies-literature
| PRJNA146911 | ENA
| S-EPMC5159802 | biostudies-literature
| S-EPMC2876132 | biostudies-literature
| S-EPMC4965098 | biostudies-literature
| S-EPMC9071439 | biostudies-literature
| S-EPMC3128064 | biostudies-literature
| 2431957 | ecrin-mdr-crc
2011-12-31 | GSE32679 | GEO
| S-EPMC5039701 | biostudies-other