Unknown

Dataset Information

0

An effective tri-clustering algorithm combining expression data with gene regulation information.


ABSTRACT:

Motivation

Bi-clustering algorithms aim to identify sets of genes sharing similar expression patterns across a subset of conditions. However direct interpretation or prediction of gene regulatory mechanisms may be difficult as only gene expression data is used. Information about gene regulators may also be available, most commonly about which transcription factors may bind to the promoter region and thus control the expression level of a gene. Thus a method to integrate gene expression and gene regulation information is desirable for clustering and analyzing.

Methods

By incorporating gene regulatory information with gene expression data, we define regulated expression values (REV) as indicators of how a gene is regulated by a specific factor. Existing bi-clustering methods are extended to a three dimensional data space by developing a heuristic TRI-Clustering algorithm. An additional approach named Automatic Boundary Searching algorithm (ABS) is introduced to automatically determine the boundary threshold.

Results

Results based on incorporating ChIP-chip data representing transcription factor-gene interactions show that the algorithms are efficient and robust for detecting tri-clusters. Detailed analysis of the tri-cluster extracted from yeast sporulation REV data shows genes in this cluster exhibited significant differences during the middle and late stages. The implicated regulatory network was then reconstructed for further study of defined regulatory mechanisms. Topological and statistical analysis of this network demonstrated evidence of significant changes of TF activities during the different stages of yeast sporulation, and suggests this approach might be a general way to study regulatory networks undergoing transformations.

SUBMITTER: Li A 

PROVIDER: S-EPMC2758278 | biostudies-literature | 2009 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

An effective tri-clustering algorithm combining expression data with gene regulation information.

Li Ao A   Tuck David D  

Gene regulation and systems biology 20090415


<h4>Motivation</h4>Bi-clustering algorithms aim to identify sets of genes sharing similar expression patterns across a subset of conditions. However direct interpretation or prediction of gene regulatory mechanisms may be difficult as only gene expression data is used. Information about gene regulators may also be available, most commonly about which transcription factors may bind to the promoter region and thus control the expression level of a gene. Thus a method to integrate gene expression a  ...[more]

Similar Datasets

| S-EPMC3563403 | biostudies-literature
| S-EPMC4290656 | biostudies-literature
| S-EPMC3376030 | biostudies-literature
| S-EPMC5986621 | biostudies-literature
| S-EPMC4474961 | biostudies-literature
| S-EPMC156590 | biostudies-literature
| S-EPMC1858704 | biostudies-literature
| S-EPMC5559859 | biostudies-other
| S-EPMC5135122 | biostudies-other
| S-EPMC1388097 | biostudies-literature