Unknown

Dataset Information

0

Discovering transcriptional modules by Bayesian data integration.


ABSTRACT:

Motivation

We present a method for directly inferring transcriptional modules (TMs) by integrating gene expression and transcription factor binding (ChIP-chip) data. Our model extends a hierarchical Dirichlet process mixture model to allow data fusion on a gene-by-gene basis. This encodes the intuition that co-expression and co-regulation are not necessarily equivalent and hence we do not expect all genes to group similarly in both datasets. In particular, it allows us to identify the subset of genes that share the same structure of transcriptional modules in both datasets.

Results

We find that by working on a gene-by-gene basis, our model is able to extract clusters with greater functional coherence than existing methods. By combining gene expression and transcription factor binding (ChIP-chip) data in this way, we are better able to determine the groups of genes that are most likely to represent underlying TMs.

Availability

If interested in the code for the work presented in this article, please contact the authors.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Savage RS 

PROVIDER: S-EPMC2881394 | biostudies-literature | 2010 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Discovering transcriptional modules by Bayesian data integration.

Savage Richard S RS   Ghahramani Zoubin Z   Griffin Jim E JE   de la Cruz Bernard J BJ   Wild David L DL  

Bioinformatics (Oxford, England) 20100601 12


<h4>Motivation</h4>We present a method for directly inferring transcriptional modules (TMs) by integrating gene expression and transcription factor binding (ChIP-chip) data. Our model extends a hierarchical Dirichlet process mixture model to allow data fusion on a gene-by-gene basis. This encodes the intuition that co-expression and co-regulation are not necessarily equivalent and hence we do not expect all genes to group similarly in both datasets. In particular, it allows us to identify the su  ...[more]

Similar Datasets

| S-EPMC4079061 | biostudies-literature
| S-EPMC2764433 | biostudies-literature
| S-EPMC4658172 | biostudies-literature
| S-EPMC2887046 | biostudies-literature
| S-EPMC1779513 | biostudies-literature
| S-EPMC6821361 | biostudies-literature
| S-EPMC4561111 | biostudies-literature
| S-EPMC5940219 | biostudies-literature
| S-EPMC7331054 | biostudies-literature
| S-EPMC2858157 | biostudies-literature