Unknown

Dataset Information

0

Independent component analysis: mining microarray data for fundamental human gene expression modules.


ABSTRACT: As public microarray repositories rapidly accumulate gene expression data, these resources contain increasingly valuable information about cellular processes in human biology. This presents a unique opportunity for intelligent data mining methods to extract information about the transcriptional modules underlying these biological processes. Modeling cellular gene expression as a combination of functional modules, we use independent component analysis (ICA) to derive 423 fundamental components of human biology from a 9395-array compendium of heterogeneous expression data. Annotation using the Gene Ontology (GO) suggests that while some of these components represent known biological modules, others may describe biology not well characterized by existing manually-curated ontologies. In order to understand the biological functions represented by these modules, we investigate the mechanism of the preclinical anti-cancer drug parthenolide (PTL) by analyzing the differential expression of our fundamental components. Our method correctly identifies known pathways and predicts that N-glycan biosynthesis and T-cell receptor signaling may contribute to PTL response. The fundamental gene modules we describe have the potential to provide pathway-level insight into new gene expression datasets.

SUBMITTER: Engreitz JM 

PROVIDER: S-EPMC2991480 | biostudies-literature | 2010 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Independent component analysis: mining microarray data for fundamental human gene expression modules.

Engreitz Jesse M JM   Daigle Bernie J BJ   Marshall Jonathan J JJ   Altman Russ B RB  

Journal of biomedical informatics 20100707 6


As public microarray repositories rapidly accumulate gene expression data, these resources contain increasingly valuable information about cellular processes in human biology. This presents a unique opportunity for intelligent data mining methods to extract information about the transcriptional modules underlying these biological processes. Modeling cellular gene expression as a combination of functional modules, we use independent component analysis (ICA) to derive 423 fundamental components of  ...[more]

Similar Datasets

| S-EPMC2646728 | biostudies-literature
| S-EPMC6142401 | biostudies-literature
| S-EPMC1619899 | biostudies-other
| S-EPMC3333188 | biostudies-literature
| S-EPMC134484 | biostudies-literature
| S-EPMC2262871 | biostudies-literature
| S-EPMC2588448 | biostudies-literature
| S-EPMC2825599 | biostudies-literature
| S-EPMC2712752 | biostudies-literature
| S-EPMC3089881 | biostudies-literature