Unknown

Dataset Information

0

Homolog detection using global sequence properties suggests an alternate view of structural encoding in protein sequences.


ABSTRACT: We show that a Fourier-based sequence distance function is able to identify structural homologs of target sequences with high accuracy. It is shown that Fourier distances correlate very strongly with independently determined structural distances between molecules, a property of the method that is not attainable using conventional representations. It is further shown that the ability of the Fourier approach to identify protein folds is statistically far in excess of random expectation. It is then shown that, in actual searches for structural homologs of selected target sequences, the Fourier approach gives excellent results. On the basis of these results, we suggest that the global information detected by the Fourier representation is an essential feature of structure encoding in protein sequences and a key to structural homology detection.

SUBMITTER: Scheraga HA 

PROVIDER: S-EPMC3986189 | biostudies-literature | 2014 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Homolog detection using global sequence properties suggests an alternate view of structural encoding in protein sequences.

Scheraga Harold A HA   Rackovsky S S  

Proceedings of the National Academy of Sciences of the United States of America 20140324 14


We show that a Fourier-based sequence distance function is able to identify structural homologs of target sequences with high accuracy. It is shown that Fourier distances correlate very strongly with independently determined structural distances between molecules, a property of the method that is not attainable using conventional representations. It is further shown that the ability of the Fourier approach to identify protein folds is statistically far in excess of random expectation. It is then  ...[more]

Similar Datasets

| S-EPMC9235477 | biostudies-literature
| S-EPMC4741009 | biostudies-literature
| S-EPMC3926946 | biostudies-other
| S-EPMC7838212 | biostudies-literature
| S-EPMC4682398 | biostudies-literature
| S-EPMC8496038 | biostudies-literature
| S-EPMC7490824 | biostudies-literature
| S-EPMC146671 | biostudies-other
| S-EPMC2732808 | biostudies-literature
| S-EPMC6707728 | biostudies-literature