Unknown

Dataset Information

0

Detecting phenotype-specific interactions between biological processes from microarray data and annotations.


ABSTRACT: High throughput technologies enable researchers to measure expression levels on a genomic scale. However, the correct and efficient biological interpretation of such voluminous data remains a challenging problem. Many tools have been developed for the analysis of GO terms that are over- or under-represented in a list of differentially expressed genes. However, a previously unexplored aspect is the identification of changes in the way various biological processes interact in a given condition with respect to a reference. Here, we present a novel approach that aims at identifying such interactions between biological processes that are significantly different in a given phenotype with respect to normal. The proposed technique uses vector-space representation, SVD-based dimensionality reduction, differential weighting, and bootstrapping to asses the significance of the interactions under the multiple and complex dependencies expected between the biological processes. We illustrate our approach on two real data sets involving breast and lung cancer. More than 88 percent of the interactions found by our approach were deemed to be correct by an extensive manual review of literature. An interesting subset of such interactions is discussed in detail and shown to have the potential to open new avenues for research in lung and breast cancer.

SUBMITTER: Ansari NA 

PROVIDER: S-EPMC3748606 | biostudies-literature | 2012 Sep-Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Detecting phenotype-specific interactions between biological processes from microarray data and annotations.

Ansari Nadeem A NA   Bao Riyue R   Voichiţa Călin C   Drăghici Sorin S  

IEEE/ACM transactions on computational biology and bioinformatics 20120901 5


High throughput technologies enable researchers to measure expression levels on a genomic scale. However, the correct and efficient biological interpretation of such voluminous data remains a challenging problem. Many tools have been developed for the analysis of GO terms that are over- or under-represented in a list of differentially expressed genes. However, a previously unexplored aspect is the identification of changes in the way various biological processes interact in a given condition wit  ...[more]

Similar Datasets

| S-EPMC1774576 | biostudies-literature
| S-EPMC545471 | biostudies-literature
| S-EPMC1933513 | biostudies-literature
| S-EPMC2493035 | biostudies-literature
| S-EPMC3125784 | biostudies-literature
| S-EPMC1800870 | biostudies-literature
| S-EPMC7750965 | biostudies-literature
| S-EPMC2930629 | biostudies-literature
| S-EPMC4006456 | biostudies-literature
2020-06-17 | GSE145695 | GEO