Unknown

Dataset Information

0

Part mutual information for quantifying direct associations in networks.


ABSTRACT: Quantitatively identifying direct dependencies between variables is an important task in data analysis, in particular for reconstructing various types of networks and causal relations in science and engineering. One of the most widely used criteria is partial correlation, but it can only measure linearly direct association and miss nonlinear associations. However, based on conditional independence, conditional mutual information (CMI) is able to quantify nonlinearly direct relationships among variables from the observed data, superior to linear measures, but suffers from a serious problem of underestimation, in particular for those variables with tight associations in a network, which severely limits its applications. In this work, we propose a new concept, "partial independence," with a new measure, "part mutual information" (PMI), which not only can overcome the problem of CMI but also retains the quantification properties of both mutual information (MI) and CMI. Specifically, we first defined PMI to measure nonlinearly direct dependencies between variables and then derived its relations with MI and CMI. Finally, we used a number of simulated data as benchmark examples to numerically demonstrate PMI features and further real gene expression data from Escherichia coli and yeast to reconstruct gene regulatory networks, which all validated the advantages of PMI for accurately quantifying nonlinearly direct associations in networks.

SUBMITTER: Zhao J 

PROVIDER: S-EPMC4983806 | biostudies-other | 2016 May

REPOSITORIES: biostudies-other

altmetric image

Publications

Part mutual information for quantifying direct associations in networks.

Zhao Juan J   Zhou Yiwei Y   Zhang Xiujun X   Chen Luonan L  

Proceedings of the National Academy of Sciences of the United States of America 20160418 18


Quantitatively identifying direct dependencies between variables is an important task in data analysis, in particular for reconstructing various types of networks and causal relations in science and engineering. One of the most widely used criteria is partial correlation, but it can only measure linearly direct association and miss nonlinear associations. However, based on conditional independence, conditional mutual information (CMI) is able to quantify nonlinearly direct relationships among va  ...[more]

Similar Datasets

| S-EPMC7517323 | biostudies-literature
| S-EPMC4357691 | biostudies-literature
| S-EPMC7192899 | biostudies-literature
| S-EPMC2648797 | biostudies-literature
| S-EPMC7593831 | biostudies-literature
| S-EPMC5905993 | biostudies-literature
| S-EPMC9470649 | biostudies-literature
| S-EPMC3632132 | biostudies-literature
| S-EPMC4615624 | biostudies-literature
| S-EPMC6381749 | biostudies-literature