Unknown

Dataset Information

0

A comparative evaluation of data-merging and meta-analysis methods for reconstructing gene-gene interactions.


ABSTRACT:

Background

We address the problem of integratively analyzing multiple gene expression, microarray datasets in order to reconstruct gene-gene interaction networks. Integrating multiple datasets is generally believed to provide increased statistical power and to lead to a better characterization of the system under study. However, the presence of systematic variation across different studies makes network reverse-engineering tasks particularly challenging. We contrast two approaches that have been frequently used in the literature for addressing systematic biases: meta-analysis methods, which first calculate opportune statistics on single datasets and successively summarize them, and data-merging methods, which directly analyze the pooled data after removing eventual biases. This comparative evaluation is performed on both synthetic and real data, the latter consisting of two manually curated microarray compendia comprising several E. coli and Yeast studies, respectively. Furthermore, the reconstruction of the regulatory network of the transcription factor Ikaros in human Peripheral Blood Mononuclear Cells (PBMCs) is presented as a case-study.

Results

The meta-analysis and data-merging methods included in our experimentations provided comparable performances on both synthetic and real data. Furthermore, both approaches outperformed (a) the naïve solution of merging data together ignoring possible biases, and (b) the results that are expected when only one dataset out of the available ones is analyzed in isolation. Using correlation statistics proved to be more effective than using p-values for correctly ranking candidate interactions. The results from the PBMC case-study indicate that the findings of the present study generalize to different types of network reconstruction algorithms.

Conclusions

Ignoring the systematic variations that differentiate heterogeneous studies can produce results that are statistically indistinguishable from random guessing. Meta-analysis and data merging methods have proved equally effective in addressing this issue, and thus researchers may safely select the approach that best suit their specific application.

SUBMITTER: Lagani V 

PROVIDER: S-EPMC4905611 | biostudies-literature | 2016 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

A comparative evaluation of data-merging and meta-analysis methods for reconstructing gene-gene interactions.

Lagani Vincenzo V   Karozou Argyro D AD   Gomez-Cabrero David D   Silberberg Gilad G   Tsamardinos Ioannis I  

BMC bioinformatics 20160606


<h4>Background</h4>We address the problem of integratively analyzing multiple gene expression, microarray datasets in order to reconstruct gene-gene interaction networks. Integrating multiple datasets is generally believed to provide increased statistical power and to lead to a better characterization of the system under study. However, the presence of systematic variation across different studies makes network reverse-engineering tasks particularly challenging. We contrast two approaches that h  ...[more]

Similar Datasets

| S-EPMC2238724 | biostudies-literature
| S-EPMC6719665 | biostudies-literature
| S-EPMC4093844 | biostudies-literature
| S-EPMC4265362 | biostudies-literature
2013-08-20 | GSE49712 | GEO
| S-EPMC4054597 | biostudies-literature
| S-EPMC4393058 | biostudies-literature
2013-08-20 | E-GEOD-49712 | biostudies-arrayexpress
| S-EPMC3156805 | biostudies-literature
| S-EPMC6696950 | biostudies-literature