Dataset Information

Making sense out of massive data by going beyond differential expression.

ABSTRACT: With the rapid growth of publicly available high-throughput transcriptomic data, there is increasing recognition that large sets of such data can be mined to better understand disease states and mechanisms. Prior gene expression analyses, both large and small, have been dichotomous in nature, in which phenotypes are compared using clearly defined controls. Such approaches may require arbitrary decisions about what are considered "normal" phenotypes, and what each phenotype should be compared to. Instead, we adopt a holistic approach in which we characterize phenotypes in the context of a myriad of tissues and diseases. We introduce scalable methods that associate expression patterns to phenotypes in order both to assign phenotype labels to new expression samples and to select phenotypically meaningful gene signatures. By using a nonparametric statistical approach, we identify signatures that are more precise than those from existing approaches and accurately reveal biological processes that are hidden in case vs. control studies. Employing a comprehensive perspective on expression, we show how metastasized tumor samples localize in the vicinity of the primary site counterparts and are overenriched for those phenotype labels. We find that our approach provides insights into the biological processes that underlie differences between tissues and diseases beyond those identified by traditional differential expression analyses. Finally, we provide an online resource (http://concordia.csail.mit.edu) for mapping users' gene expression samples onto the expression landscape of tissue and disease.

SUBMITTER: Schmid PR

PROVIDER: S-EPMC3326474 | biostudies-literature | 2012 Apr

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Making sense out of massive data by going beyond differential expression.

Schmid Patrick R PR Palmer Nathan P NP Kohane Isaac S IS Berger Bonnie B

Proceedings of the National Academy of Sciences of the United States of America 20120323 15

With the rapid growth of publicly available high-throughput transcriptomic data, there is increasing recognition that large sets of such data can be mined to better understand disease states and mechanisms. Prior gene expression analyses, both large and small, have been dichotomous in nature, in which phenotypes are compared using clearly defined controls. Such approaches may require arbitrary decisions about what are considered "normal" phenotypes, and what each phenotype should be compared to. ...[more]

PMID: 22447773

Similar Datasets

Project description:Huntington disease (HD) is an autosomal dominant neurodegenerative disorder, characterized by motor, psychiatric and cognitive symptoms. HD is caused by a CAG repeat expansion in the first exon of the HTT gene, resulting in an expanded polyglutamine tract at the N-terminus of the huntingtin protein. Typical disease onset is around mid-life (adult-onset HD) whereas onset below 21 years is classified as juvenile HD. While much research has been done on the underlying HD disease mechanisms, little is known about regulation and expression levels of huntingtin RNA and protein.In this study we used 15 human post-mortem HD brain samples to investigate the expression of wild-type and mutant huntingtin mRNA and protein. In adult-onset HD brain samples, there was a small but significantly lower expression of mutant huntingtin mRNA compared to wild-type huntingtin mRNA, while wild-type and mutant huntingtin protein expression levels did not differ significantly. Juvenile HD subjects did show a lower expression of mutant huntingtin protein compared to wild-type huntingtin protein. Our results in HD brain and fibroblasts suggest that protein aggregation does not affect levels of huntingtin RNA and protein. Additionally, we did not find any evidence for a reduced expression of huntingtin antisense in fibroblasts derived from a homozygous HD patient.We found small differences in allelic huntingtin mRNA levels in adult-onset HD brain, with significantly lower mutant huntingtin mRNA levels. Wild-type and mutant huntingtin protein were not significantly different in adult-onset HD brain samples. Conversely, in juvenile HD brain samples mutant huntingtin protein levels were lower compared with wild-type huntingtin, showing subtle differences between juvenile HD and adult-onset HD. Since most HD model systems harbor juvenile repeat expansions, our results suggest caution with the interpretation of huntingtin mRNA and protein studies using HD cell and animal models with such long repeats. Furthermore, our huntingtin antisense results in homozygous HD cells do not support reduced huntingtin antisense expression due to an expanded CAG repeat.

Dataset Information

Making sense out of massive data by going beyond differential expression.

Publications

Making sense out of massive data by going beyond differential expression.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets