Unknown

Dataset Information

0

Phylo_dCor: distance correlation as a novel metric for phylogenetic profiling.


ABSTRACT:

Background

Elaboration of powerful methods to predict functional and/or physical protein-protein interactions from genome sequence is one of the main tasks in the post-genomic era. Phylogenetic profiling allows the prediction of protein-protein interactions at a whole genome level in both Prokaryotes and Eukaryotes. For this reason it is considered one of the most promising methods.

Results

Here, we propose an improvement of phylogenetic profiling that enables handling of large genomic datasets and infer global protein-protein interactions. This method uses the distance correlation as a new measure of phylogenetic profile similarity. We constructed robust reference sets and developed Phylo-dCor, a parallelized version of the algorithm for calculating the distance correlation that makes it applicable to large genomic data. Using Saccharomyces cerevisiae and Escherichia coli genome datasets, we showed that Phylo-dCor outperforms phylogenetic profiling methods previously described based on the mutual information and Pearson's correlation as measures of profile similarity.

Conclusions

In this work, we constructed and assessed robust reference sets and propose the distance correlation as a measure for comparing phylogenetic profiles. To make it applicable to large genomic data, we developed Phylo-dCor, a parallelized version of the algorithm for calculating the distance correlation. Two R scripts that can be run on a wide range of machines are available upon request.

SUBMITTER: Sferra G 

PROVIDER: S-EPMC5584357 | biostudies-literature | 2017 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Phylo_dCor: distance correlation as a novel metric for phylogenetic profiling.

Sferra Gabriella G   Fratini Federica F   Ponzi Marta M   Pizzi Elisabetta E  

BMC bioinformatics 20170905 1


<h4>Background</h4>Elaboration of powerful methods to predict functional and/or physical protein-protein interactions from genome sequence is one of the main tasks in the post-genomic era. Phylogenetic profiling allows the prediction of protein-protein interactions at a whole genome level in both Prokaryotes and Eukaryotes. For this reason it is considered one of the most promising methods.<h4>Results</h4>Here, we propose an improvement of phylogenetic profiling that enables handling of large ge  ...[more]

Similar Datasets

| S-EPMC2718672 | biostudies-literature
| S-EPMC8058397 | biostudies-literature
| S-EPMC8664166 | biostudies-literature
| S-EPMC5790134 | biostudies-literature
| S-EPMC2538145 | biostudies-other
| S-EPMC3806810 | biostudies-literature
| S-EPMC5456046 | biostudies-literature
| S-EPMC9931116 | biostudies-literature
| S-EPMC9191842 | biostudies-literature
| S-EPMC8557847 | biostudies-literature