Unknown

Dataset Information

0

PiMGM: incorporating multi-source priors in mixed graphical models for learning disease networks.


ABSTRACT:

Motivation

Learning probabilistic graphs over mixed data is an important way to combine gene expression and clinical disease data. Leveraging the existing, yet imperfect, information in pathway databases for mixed graphical model (MGM) learning is an understudied problem with tremendous potential applications in systems medicine, the problems of which often involve high-dimensional data.

Results

We present a new method, piMGM, which can learn with accuracy the structure of probabilistic graphs over mixed data by appropriately incorporating priors from multiple experts with different degrees of reliability. We show that piMGM accurately scores the reliability of prior information from a given expert even at low sample sizes. The reliability scores can be used to determine active pathways in healthy and disease samples. We tested piMGM on both simulated and real data from TCGA, and we found that its performance is not affected by unreliable priors. We demonstrate the applicability of piMGM by successfully using prior information to identify pathway components that are important in breast cancer and improve cancer subtype classification.

Availability and implementation

http://www.benoslab.pitt.edu/manatakisECCB2018.html.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Manatakis DV 

PROVIDER: S-EPMC6129280 | biostudies-literature | 2018 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

piMGM: incorporating multi-source priors in mixed graphical models for learning disease networks.

Manatakis Dimitris V DV   Raghu Vineet K VK   Benos Panayiotis V PV  

Bioinformatics (Oxford, England) 20180901 17


<h4>Motivation</h4>Learning probabilistic graphs over mixed data is an important way to combine gene expression and clinical disease data. Leveraging the existing, yet imperfect, information in pathway databases for mixed graphical model (MGM) learning is an understudied problem with tremendous potential applications in systems medicine, the problems of which often involve high-dimensional data.<h4>Results</h4>We present a new method, piMGM, which can learn with accuracy the structure of probabi  ...[more]

Similar Datasets

| S-EPMC4465824 | biostudies-literature
| S-EPMC7166149 | biostudies-literature
| S-EPMC5018402 | biostudies-literature
| S-EPMC6223372 | biostudies-literature
| S-EPMC3346750 | biostudies-literature
| S-EPMC5638204 | biostudies-literature
| S-EPMC4146587 | biostudies-literature
| S-EPMC6449754 | biostudies-literature
| S-EPMC5809157 | biostudies-literature
| S-EPMC5830184 | biostudies-literature