Unknown

Dataset Information

0

Dynamically weighted clustering with noise set.


ABSTRACT:

Motivation

Various clustering methods have been applied to microarray gene expression data for identifying genes with similar expression profiles. As the biological annotation data accumulated, more and more genes have been organized into functional categories. Functionally related genes may be regulated by common cellular signals, thus likely to be co-expressed. Consequently, utilizing the rapidly increasing functional annotation resources such as Gene Ontology (GO) to improve the performance of clustering methods is of great interest. On the opposite side of clustering, there are genes that have distinct expression profiles and do not co-express with other genes. Identification of these scattered genes could enhance the performance of clustering methods.

Results

We developed a new clustering algorithm, Dynamically Weighted Clustering with Noise set (DWCN), which makes use of gene annotation information and allows for a set of scattered genes, the noise set, to be left out of the main clusters. We tested the DWCN method and contrasted its results with those obtained using several common clustering techniques on a simulated dataset as well as on two public datasets: the Stanford yeast cell-cycle gene expression data, and a gene expression dataset for a group of genetically different yeast segregants.

Conclusion

Our method produces clusters with more consistent functional annotations and more coherent expression patterns than existing clustering techniques.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Shen Y 

PROVIDER: S-EPMC2815660 | biostudies-literature | 2010 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Dynamically weighted clustering with noise set.

Shen Yijing Y   Sun Wei W   Li Ker-Chau KC  

Bioinformatics (Oxford, England) 20091209 3


<h4>Motivation</h4>Various clustering methods have been applied to microarray gene expression data for identifying genes with similar expression profiles. As the biological annotation data accumulated, more and more genes have been organized into functional categories. Functionally related genes may be regulated by common cellular signals, thus likely to be co-expressed. Consequently, utilizing the rapidly increasing functional annotation resources such as Gene Ontology (GO) to improve the perfo  ...[more]

Similar Datasets

| S-EPMC6507172 | biostudies-literature
| S-EPMC8237321 | biostudies-literature
| S-EPMC7307887 | biostudies-literature
| S-EPMC6363982 | biostudies-literature
| S-EPMC4000392 | biostudies-other
| S-EPMC9537289 | biostudies-literature
| S-EPMC9805571 | biostudies-literature
| S-EPMC7029223 | biostudies-literature
| S-EPMC6966853 | biostudies-literature
| S-EPMC4407953 | biostudies-literature