Unknown

Dataset Information

0

Detecting remotely related proteins by their interactions and sequence similarity.


ABSTRACT: The function of an uncharacterized protein is usually inferred either from its homology to, or its interactions with, characterized proteins. Here, we use both sequence similarity and protein interactions to identify relationships between remotely related protein sequences. We rely on the fact that homologous sequences share similar interactions, and, therefore, the set of interacting partners of the partners of a given protein is enriched by its homologs. The approach was bench-marked by assigning the fold and functional family to test sequences of known structure. Specifically, we relied on 1,434 proteins with known folds, as defined in the Structural Classification of Proteins (SCOP) database, and with known interacting partners, as defined in the Database of Interacting Proteins (DIP). For this subset, the specificity of fold assignment was increased from 54% for position-specific iterative BLAST to 75% for our approach, with a concomitant increase in sensitivity for a few percentage points. Similarly, the specificity of family assignment at the e-value threshold of 10(-8) was increased from 70% to 87%. The proposed method would be a useful tool for large-scale automated discovery of remote relationships between protein sequences, given its unique reliance on sequence similarity and protein-protein interactions.

SUBMITTER: Espadaler J 

PROVIDER: S-EPMC1129109 | biostudies-literature | 2005 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Detecting remotely related proteins by their interactions and sequence similarity.

Espadaler Jordi J   Aragüés Ramón R   Eswar Narayanan N   Marti-Renom Marc A MA   Querol Enrique E   Avilés Francesc X FX   Sali Andrej A   Oliva Baldomero B  

Proceedings of the National Academy of Sciences of the United States of America 20050509 20


The function of an uncharacterized protein is usually inferred either from its homology to, or its interactions with, characterized proteins. Here, we use both sequence similarity and protein interactions to identify relationships between remotely related protein sequences. We rely on the fact that homologous sequences share similar interactions, and, therefore, the set of interacting partners of the partners of a given protein is enriched by its homologs. The approach was bench-marked by assign  ...[more]

Similar Datasets

| S-EPMC2447781 | biostudies-literature
| S-EPMC9312204 | biostudies-literature
| S-EPMC6675052 | biostudies-literature
| S-EPMC2377100 | biostudies-literature
| S-EPMC8062480 | biostudies-literature
| S-EPMC3294165 | biostudies-literature
| 2014357 | ecrin-mdr-crc
| S-EPMC2430716 | biostudies-literature
| S-EPMC6077820 | biostudies-literature
| S-EPMC4120521 | biostudies-literature