Unknown

Dataset Information

0

Sequence specificity between interacting and non-interacting homologs identifies interface residues--a homodimer and monomer use case.


ABSTRACT: Protein families participating in protein-protein interactions may contain sub-families that have different binding characteristics, ranging from right binding to showing no interaction at all. Composition differences at the sequence level in these sub-families are often decisive to their differential functional interaction. Methods to predict interface sites from protein sequences typically exploit conservation as a signal. Here, instead, we provide proof of concept that the sequence specificity between interacting versus non-interacting groups can be exploited to recognise interaction sites.We collected homodimeric and monomeric proteins and formed homologous groups, each having an interacting (homodimer) subgroup and a non-interacting (monomer) subgroup. We then compiled multiple sequence alignments of the proteins in the homologous groups and identified compositional differences between the homodimeric and monomeric subgroups for each of the alignment positions. Our results show that this specificity signal distinguishes interface and other surface residues with 40.9% recall and up to 25.1% precision.To our best knowledge, this is the first large scale study that exploits sequence specificity between interacting and non-interacting homologs to predict interaction sites from sequence information only. The performance obtained indicates that this signal contains valuable information to identify protein-protein interaction sites.

SUBMITTER: Hou Q 

PROVIDER: S-EPMC4599308 | biostudies-literature | 2015 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Sequence specificity between interacting and non-interacting homologs identifies interface residues--a homodimer and monomer use case.

Hou Qingzhen Q   Dutilh Bas E BE   Huynen Martijn A MA   Heringa Jaap J   Feenstra K Anton KA  

BMC bioinformatics 20151008


<h4>Background</h4>Protein families participating in protein-protein interactions may contain sub-families that have different binding characteristics, ranging from right binding to showing no interaction at all. Composition differences at the sequence level in these sub-families are often decisive to their differential functional interaction. Methods to predict interface sites from protein sequences typically exploit conservation as a signal. Here, instead, we provide proof of concept that the  ...[more]

Similar Datasets

2019-01-31 | GSE111856 | GEO
| S-EPMC9714365 | biostudies-literature
2019-01-31 | GSE111855 | GEO
2019-01-31 | GSE111854 | GEO
| S-EPMC4518712 | biostudies-literature
| S-EPMC9967328 | biostudies-literature
| S-EPMC4329725 | biostudies-literature
| S-EPMC9070395 | biostudies-literature
| S-EPMC1878507 | biostudies-literature
| S-EPMC3237601 | biostudies-literature