Unknown

Dataset Information

0

Structure-based prediction of protein-protein interactions on a genome-wide scale.


ABSTRACT: The genome-wide identification of pairs of interacting proteins is an important step in the elucidation of cell regulatory mechanisms. Much of our present knowledge derives from high-throughput techniques such as the yeast two-hybrid assay and affinity purification, as well as from manual curation of experiments on individual systems. A variety of computational approaches based, for example, on sequence homology, gene co-expression and phylogenetic profiles, have also been developed for the genome-wide inference of protein-protein interactions (PPIs). Yet comparative studies suggest that the development of accurate and complete repertoires of PPIs is still in its early stages. Here we show that three-dimensional structural information can be used to predict PPIs with an accuracy and coverage that are superior to predictions based on non-structural evidence. Moreover, an algorithm, termed PrePPI, which combines structural information with other functional clues, is comparable in accuracy to high-throughput experiments, yielding over 30,000 high-confidence interactions for yeast and over 300,000 for human. Experimental tests of a number of predictions demonstrate the ability of the PrePPI algorithm to identify unexpected PPIs of considerable biological interest. The surprising effectiveness of three-dimensional structural information can be attributed to the use of homology models combined with the exploitation of both close and remote geometric relationships between proteins.

SUBMITTER: Zhang QC 

PROVIDER: S-EPMC3482288 | biostudies-literature | 2012 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications


The genome-wide identification of pairs of interacting proteins is an important step in the elucidation of cell regulatory mechanisms. Much of our present knowledge derives from high-throughput techniques such as the yeast two-hybrid assay and affinity purification, as well as from manual curation of experiments on individual systems. A variety of computational approaches based, for example, on sequence homology, gene co-expression and phylogenetic profiles, have also been developed for the geno  ...[more]

Similar Datasets

| S-EPMC5748165 | biostudies-literature
| S-EPMC8586911 | biostudies-literature
| S-EPMC5940160 | biostudies-literature
| S-EPMC2845582 | biostudies-literature
| S-EPMC4418708 | biostudies-literature
| S-EPMC3667494 | biostudies-literature
| S-EPMC4272565 | biostudies-literature
| S-EPMC122890 | biostudies-literature
| S-EPMC6423278 | biostudies-literature
| S-EPMC6720845 | biostudies-literature