Unknown

Dataset Information

0

Estimating genome-wide regulatory activity from multi-omics data sets using mathematical optimization.


ABSTRACT:

Background

Gene regulation is one of the most important cellular processes, indispensable for the adaptability of organisms and closely interlinked with several classes of pathogenesis and their progression. Elucidation of regulatory mechanisms can be approached by a multitude of experimental methods, yet integration of the resulting heterogeneous, large, and noisy data sets into comprehensive and tissue or disease-specific cellular models requires rigorous computational methods. Recently, several algorithms have been proposed which model genome-wide gene regulation as sets of (linear) equations over the activity and relationships of transcription factors, genes and other factors. Subsequent optimization finds those parameters that minimize the divergence of predicted and measured expression intensities. In various settings, these methods produced promising results in terms of estimating transcription factor activity and identifying key biomarkers for specific phenotypes. However, despite their common root in mathematical optimization, they vastly differ in the types of experimental data being integrated, the background knowledge necessary for their application, the granularity of their regulatory model, the concrete paradigm used for solving the optimization problem and the data sets used for evaluation.

Results

Here, we review five recent methods of this class in detail and compare them with respect to several key properties. Furthermore, we quantitatively compare the results of four of the presented methods based on publicly available data sets.

Conclusions

The results show that all methods seem to find biologically relevant information. However, we also observe that the mutual result overlaps are very low, which contradicts biological intuition. Our aim is to raise further awareness of the power of these methods, yet also to identify common shortcomings and necessary extensions enabling focused research on the critical points.

SUBMITTER: Trescher S 

PROVIDER: S-EPMC5369021 | biostudies-literature | 2017 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Estimating genome-wide regulatory activity from multi-omics data sets using mathematical optimization.

Trescher Saskia S   Münchmeyer Jannes J   Leser Ulf U  

BMC systems biology 20170327 1


<h4>Background</h4>Gene regulation is one of the most important cellular processes, indispensable for the adaptability of organisms and closely interlinked with several classes of pathogenesis and their progression. Elucidation of regulatory mechanisms can be approached by a multitude of experimental methods, yet integration of the resulting heterogeneous, large, and noisy data sets into comprehensive and tissue or disease-specific cellular models requires rigorous computational methods. Recentl  ...[more]

Similar Datasets

| S-EPMC6010767 | biostudies-literature
| S-EPMC3630015 | biostudies-literature
| S-EPMC8504614 | biostudies-literature
| S-EPMC8056605 | biostudies-literature
| S-EPMC3569091 | biostudies-literature
| S-EPMC7487249 | biostudies-literature
| S-EPMC8280138 | biostudies-literature
| S-EPMC8262710 | biostudies-literature
| S-EPMC3583146 | biostudies-literature