Unknown

Dataset Information

0

A binary matrix factorization algorithm for protein complex prediction.


ABSTRACT:

Background

Identifying biologically relevant protein complexes from a large protein-protein interaction (PPI) network, is essential to understand the organization of biological systems. However, high-throughput experimental techniques that can produce a large amount of PPIs are known to yield non-negligible rates of false-positives and false-negatives, making the protein complexes difficult to be identified.

Results

We propose a binary matrix factorization (BMF) algorithm under the Bayesian Ying-Yang (BYY) harmony learning, to detect protein complexes by clustering the proteins which share similar interactions through factorizing the binary adjacent matrix of a PPI network. The proposed BYY-BMF algorithm automatically determines the cluster number while this number is pre-given for most existing BMF algorithms. Also, BYY-BMF's clustering results does not depend on any parameters or thresholds, unlike the Markov Cluster Algorithm (MCL) that relies on a so-called inflation parameter. On synthetic PPI networks, the predictions evaluated by the known annotated complexes indicate that BYY-BMF is more robust than MCL for most cases. On real PPI networks from the MIPS and DIP databases, BYY-BMF obtains a better balanced prediction accuracies than MCL and a spectral analysis method, while MCL has its own advantages, e.g., with good separation values.

SUBMITTER: Tu S 

PROVIDER: S-EPMC3724484 | biostudies-literature | 2011 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

A binary matrix factorization algorithm for protein complex prediction.

Tu Shikui S   Chen Runsheng R   Xu Lei L  

Proteome science 20111014


<h4>Background</h4>Identifying biologically relevant protein complexes from a large protein-protein interaction (PPI) network, is essential to understand the organization of biological systems. However, high-throughput experimental techniques that can produce a large amount of PPIs are known to yield non-negligible rates of false-positives and false-negatives, making the protein complexes difficult to be identified.<h4>Results</h4>We propose a binary matrix factorization (BMF) algorithm under th  ...[more]

Similar Datasets

| S-EPMC8675762 | biostudies-literature
| S-EPMC5798492 | biostudies-literature
| S-EPMC6287781 | biostudies-literature
| S-EPMC5732780 | biostudies-literature
| S-EPMC6880730 | biostudies-literature
| S-EPMC3642239 | biostudies-literature
| S-EPMC7908717 | biostudies-literature
| S-EPMC6040094 | biostudies-literature
| S-EPMC4752318 | biostudies-literature
| S-EPMC8253547 | biostudies-literature