Dataset Information

Identification of protein complexes by integrating multiple alignment of protein interaction networks.

ABSTRACT:

Motivation

Protein complexes are one of the keys to studying the behavior of a cell system. Many biological functions are carried out by protein complexes. During the past decade, the main strategy used to identify protein complexes from high-throughput network data has been to extract near-cliques or highly dense subgraphs from a single protein-protein interaction (PPI) network. Although experimental PPI data have increased significantly over recent years, most PPI networks still have many false positive interactions and false negative edge loss due to the limitations of high-throughput experiments. In particular, the false negative errors restrict the search space of such conventional protein complex identification approaches. Thus, it has become one of the most challenging tasks in systems biology to automatically identify protein complexes.

Results

In this study, we propose a new algorithm, NEOComplex ( NE CC- and O rtholog-based Complex identification by multiple network alignment), which integrates functional orthology information that can be obtained from different types of multiple network alignment (MNA) approaches to expand the search space of protein complex detection. As part of our approach, we also define a new edge clustering coefficient (NECC) to assign weights to interaction edges in PPI networks so that protein complexes can be identified more accurately. The NECC is based on the intuition that there is functional information captured in the common neighbors of the common neighbors as well. Our results show that our algorithm outperforms well-known protein complex identification tools in a balance between precision and recall on three eukaryotic species: human, yeast, and fly. As a result of MNAs of the species, the proposed approach can tolerate edge loss in PPI networks and even discover sparse protein complexes which have traditionally been a challenge to predict.

Availability and implementation

http://acolab.ie.nthu.edu.tw/bionetwork/NEOComplex.

Contact

bab@csail.mit.edu or csliao@ie.nthu.edu.tw.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Ma CY

PROVIDER: S-EPMC5860626 | biostudies-literature | 2017 Jun

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Identification of protein complexes by integrating multiple alignment of protein interaction networks.

Ma Cheng-Yu CY Chen Yi-Ping Phoebe YP Berger Bonnie B Liao Chung-Shou CS

Bioinformatics (Oxford, England) 20170601 11

<h4>Motivation</h4>Protein complexes are one of the keys to studying the behavior of a cell system. Many biological functions are carried out by protein complexes. During the past decade, the main strategy used to identify protein complexes from high-throughput network data has been to extract near-cliques or highly dense subgraphs from a single protein-protein interaction (PPI) network. Although experimental PPI data have increased significantly over recent years, most PPI networks still have m ...[more]

PMID: 28130237

Dataset Information

Identification of protein complexes by integrating multiple alignment of protein interaction networks.

Motivation

Results

Availability and implementation

Contact

Supplementary information

Publications

Identification of protein complexes by integrating multiple alignment of protein interaction networks.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Unified Alignment of Protein-Protein Interaction Networks.
| S-EPMC5430463 | biostudies-literature

Protein complex identification by integrating protein-protein interaction evidence from multiple sources.
| S-EPMC3873956 | biostudies-literature

Global alignment of multiple protein interaction networks with application to functional orthology detection.
| S-EPMC2522262 | biostudies-literature

Integrating Multiple Interaction Networks for Gene Function Inference.
| S-EPMC6337127 | biostudies-literature

Optimizing a global alignment of protein interaction networks.
| S-EPMC3799479 | biostudies-literature

Identification of Protein Complexes by Integrating Protein Abundance and Interaction Features Using a Deep Learning Strategy.
| S-EPMC10178578 | biostudies-literature

Integrating multiple networks for protein function prediction.
| S-EPMC4331678 | biostudies-literature

Detecting overlapping protein complexes in protein-protein interaction networks.
| S-EPMC3543700 | biostudies-literature

Alignment of biological networks by integer linear programming: virus-host protein-protein interaction networks.
| S-EPMC7671827 | biostudies-literature

ClusterM: a scalable algorithm for computational prediction of conserved protein complexes across multiple protein interaction networks.
| S-EPMC7677834 | biostudies-literature