Unknown

Dataset Information

0

Protein homologous cores and loops: important clues to evolutionary relationships between structurally similar proteins.


ABSTRACT: To discover remote evolutionary relationships and functional similarities between proteins, biologists rely on comparative sequence analysis, and when structures are available, on structural alignments and various measures of structural similarity. The measures/scores that have most commonly been used for this purpose include: alignment length, percent sequence identity, superposition RMSD and their different combinations. More recently, we have introduced the "Homologous core structure overlap score" (HCS) and the "Loop Hausdorff Measure" (LHM). Along with these we also consider the "gapped structural alignment score" (GSAS), which was introduced earlier by other researchers.We analyze the performance of these and other conventional measures at the task of ranking structure neighbors by homology, and we show that the HCS, LHM, and GSAS scores display considerably improved performance over the conventional measures of sequence or structural similarity.The HCS, LHM, and GSAS scores are easily computable quantities that allow users of structure-neighbor databases to more easily identify interesting structural similarities between proteins.

SUBMITTER: Madej T 

PROVIDER: S-EPMC1852803 | biostudies-other | 2007 Apr

REPOSITORIES: biostudies-other

altmetric image

Publications

Protein homologous cores and loops: important clues to evolutionary relationships between structurally similar proteins.

Madej Thomas T   Panchenko Anna R AR   Chen Jie J   Bryant Stephen H SH  

BMC structural biology 20070410


<h4>Background</h4>To discover remote evolutionary relationships and functional similarities between proteins, biologists rely on comparative sequence analysis, and when structures are available, on structural alignments and various measures of structural similarity. The measures/scores that have most commonly been used for this purpose include: alignment length, percent sequence identity, superposition RMSD and their different combinations. More recently, we have introduced the "Homologous core  ...[more]

Similar Datasets

| S-EPMC3609042 | biostudies-literature
| S-EPMC27587 | biostudies-literature
| S-EPMC5282881 | biostudies-literature
| S-EPMC2740814 | biostudies-literature
| S-EPMC6988545 | biostudies-literature
| S-EPMC7028011 | biostudies-literature
| S-EPMC5321002 | biostudies-literature
| S-EPMC3358857 | biostudies-literature
| S-EPMC4249156 | biostudies-literature
| S-EPMC3705561 | biostudies-literature