Unknown

Dataset Information

0

Global meta-analysis of transcriptomics studies.


ABSTRACT: Transcriptomics meta-analysis aims at re-using existing data to derive novel biological hypotheses, and is motivated by the public availability of a large number of independent studies. Current methods are based on breaking down studies into multiple comparisons between phenotypes (e.g. disease vs. healthy), based on the studies' experimental designs, followed by computing the overlap between the resulting differential expression signatures. While useful, in this methodology each study yields multiple independent phenotype comparisons, and connections are established not between studies, but rather between subsets of the studies corresponding to phenotype comparisons. We propose a rank-based statistical meta-analysis framework that establishes global connections between transcriptomics studies without breaking down studies into sets of phenotype comparisons. By using a rank product method, our framework extracts global features from each study, corresponding to genes that are consistently among the most expressed or differentially expressed genes in that study. Those features are then statistically modelled via a term-frequency inverse-document frequency (TF-IDF) model, which is then used for connecting studies. Our framework is fast and parameter-free; when applied to large collections of Homo sapiens and Streptococcus pneumoniae transcriptomics studies, it performs better than similarity-based approaches in retrieving related studies, using a Medical Subject Headings gold standard. Finally, we highlight via case studies how the framework can be used to derive novel biological hypotheses regarding related studies and the genes that drive those connections. Our proposed statistical framework shows that it is possible to perform a meta-analysis of transcriptomics studies with arbitrary experimental designs by deriving global expression features rather than decomposing studies into multiple phenotype comparisons.

SUBMITTER: Caldas J 

PROVIDER: S-EPMC3935861 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

Global meta-analysis of transcriptomics studies.

Caldas José J   Vinga Susana S  

PloS one 20140226 2


Transcriptomics meta-analysis aims at re-using existing data to derive novel biological hypotheses, and is motivated by the public availability of a large number of independent studies. Current methods are based on breaking down studies into multiple comparisons between phenotypes (e.g. disease vs. healthy), based on the studies' experimental designs, followed by computing the overlap between the resulting differential expression signatures. While useful, in this methodology each study yields mu  ...[more]

Similar Datasets

2012-02-03 | GSE35354 | GEO
2022-11-03 | GSE198904 | GEO
| S-EPMC7448304 | biostudies-literature
| PRJEB41312 | ENA
| S-EPMC6354025 | biostudies-literature
2011-02-03 | GSE20186 | GEO
2019-10-08 | GSE131793 | GEO
2011-02-03 | E-GEOD-20186 | biostudies-arrayexpress