Ontology highlight
ABSTRACT: Background
Correlation network analysis has become an integral tool to study metabolite datasets. Networks are constructed by omitting correlations between metabolites based on two thresholds-namely the r and the associated p-values. While p-value threshold settings follow the rules of multiple hypotheses testing correction, guidelines for r-value threshold settings have not been defined.Results
Here, we introduce a method that allows determining the r-value threshold based on an iterative approach, where different networks are constructed and their network topology is monitored. Once the network topology changes significantly, the threshold is set to the corresponding correlation coefficient value. The approach was exemplified on: (i) a metabolite and morphological trait dataset from a potato association panel, which was grown under normal irrigation and water recovery conditions; and validated (ii) on a metabolite dataset of hearts of fed and fasted mice. For the potato normal irrigation correlation network a threshold of Pearson's |r|??0.23 was suggested, while for the water recovery correlation network a threshold of Pearson's |r|??0.41 was estimated. For both mice networks the threshold was calculated with Pearson's |r|??0.84.Conclusions
Our analysis corrected the previously stated Pearson's correlation coefficient threshold from 0.4 to 0.41 in the water recovery network and from 0.4 to 0.23 for the normal irrigation network. Furthermore, the proposed method suggested a correlation threshold of 0.84 for both mice networks rather than a threshold of 0.7 as applied earlier. We demonstrate that the proposed approach is a valuable tool for constructing biological meaningful networks.
SUBMITTER: Toubiana D
PROVIDER: S-EPMC7945624 | biostudies-literature | 2021 Mar
REPOSITORIES: biostudies-literature
Toubiana David D Maruenda Helena H
BMC bioinformatics 20210310 1
<h4>Background</h4>Correlation network analysis has become an integral tool to study metabolite datasets. Networks are constructed by omitting correlations between metabolites based on two thresholds-namely the r and the associated p-values. While p-value threshold settings follow the rules of multiple hypotheses testing correction, guidelines for r-value threshold settings have not been defined.<h4>Results</h4>Here, we introduce a method that allows determining the r-value threshold based on an ...[more]