Unknown

Dataset Information

0

Persistently conserved positions in structurally similar, sequence dissimilar proteins: roles in preserving protein fold and function.


ABSTRACT: Many protein pairs that share the same fold do not have any detectable sequence similarity, providing a valuable source of information for studying sequence-structure relationship. In this study, we use a stringent data set of structurally similar, sequence-dissimilar protein pairs to characterize residues that may play a role in the determination of protein structure and/or function. For each protein in the database, we identify amino-acid positions that show residue conservation within both close and distant family members. These positions are termed "persistently conserved". We then proceed to determine the "mutually" persistently conserved (MPC) positions: those structurally aligned positions in a protein pair that are persistently conserved in both pair mates. Because of their intra- and interfamily conservation, these positions are good candidates for determining protein fold and function. We find that 45% of the persistently conserved positions are mutually conserved. A significant fraction of them are located in critical positions for secondary structure determination, they are mostly buried, and many of them form spatial clusters within their protein structures. A substitution matrix based on the subset of MPC positions shows two distinct characteristics: (i) it is different from other available matrices, even those that are derived from structural alignments; (ii) its relative entropy is high, emphasizing the special residue restrictions imposed on these positions. Such a substitution matrix should be valuable for protein design experiments.

SUBMITTER: Friedberg I 

PROVIDER: S-EPMC2373454 | biostudies-literature | 2002 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Persistently conserved positions in structurally similar, sequence dissimilar proteins: roles in preserving protein fold and function.

Friedberg Iddo I   Margalit Hanah H  

Protein science : a publication of the Protein Society 20020201 2


Many protein pairs that share the same fold do not have any detectable sequence similarity, providing a valuable source of information for studying sequence-structure relationship. In this study, we use a stringent data set of structurally similar, sequence-dissimilar protein pairs to characterize residues that may play a role in the determination of protein structure and/or function. For each protein in the database, we identify amino-acid positions that show residue conservation within both cl  ...[more]

Similar Datasets

| S-EPMC2673347 | biostudies-literature
| S-EPMC2072066 | biostudies-literature
| S-EPMC8404102 | biostudies-literature
| S-EPMC2740814 | biostudies-literature
| S-EPMC3116163 | biostudies-literature
| S-EPMC8080601 | biostudies-literature
| S-EPMC2813367 | biostudies-literature
| S-EPMC5648816 | biostudies-literature
| S-EPMC3549380 | biostudies-literature
| S-EPMC3609042 | biostudies-literature