Unknown

Dataset Information

0

A new, structurally nonredundant, diverse data set of protein-protein interfaces and its implications.


ABSTRACT: Here, we present a diverse, structurally nonredundant data set of two-chain protein-protein interfaces derived from the PDB. Using a sequence order-independent structural comparison algorithm and hierarchical clustering, 3799 interface clusters are obtained. These yield 103 clusters with at least five nonhomologous members. We divide the clusters into three types. In Type I clusters, the global structures of the chains from which the interfaces are derived are also similar. This cluster type is expected because, in general, related proteins associate in similar ways. In Type II, the interfaces are similar; however, remarkably, the overall structures and functions of the chains are different. The functional spectrum is broad, from enzymes/inhibitors to immunoglobulins and toxins. The fact that structurally different monomers associate in similar ways, suggests "good" binding architectures. This observation extends a paradigm in protein science: It has been well known that proteins with similar structures may have different functions. Here, we show that it extends to interfaces. In Type III clusters, only one side of the interface is similar across the cluster. This structurally nonredundant data set provides rich data for studies of protein-protein interactions and recognition, cellular networks and drug design. In particular, it may be useful in addressing the difficult question of what are the favorable ways for proteins to interact. (The data set is available at http://protein3d.ncifcrf.gov/~keskino/ and http://home.ku.edu.tr/~okeskin/INTERFACE/INTERFACES.html.)

SUBMITTER: Keskin O 

PROVIDER: S-EPMC2280042 | biostudies-literature | 2004 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

A new, structurally nonredundant, diverse data set of protein-protein interfaces and its implications.

Keskin Ozlem O   Tsai Chung-Jung CJ   Wolfson Haim H   Nussinov Ruth R  

Protein science : a publication of the Protein Society 20040401 4


Here, we present a diverse, structurally nonredundant data set of two-chain protein-protein interfaces derived from the PDB. Using a sequence order-independent structural comparison algorithm and hierarchical clustering, 3799 interface clusters are obtained. These yield 103 clusters with at least five nonhomologous members. We divide the clusters into three types. In Type I clusters, the global structures of the chains from which the interfaces are derived are also similar. This cluster type is  ...[more]

Similar Datasets

| S-EPMC7447090 | biostudies-literature
| S-EPMC4624513 | biostudies-literature
| S-EPMC6599994 | biostudies-literature
| S-EPMC6343515 | biostudies-literature
| S-EPMC7778930 | biostudies-literature
| S-EPMC7182663 | biostudies-literature