Unknown

Dataset Information

0

Using structure to explore the sequence alignment space of remote homologs.


ABSTRACT: Protein structure modeling by homology requires an accurate sequence alignment between the query protein and its structural template. However, sequence alignment methods based on dynamic programming (DP) are typically unable to generate accurate alignments for remote sequence homologs, thus limiting the applicability of modeling methods. A central problem is that the alignment that is "optimal" in terms of the DP score does not necessarily correspond to the alignment that produces the most accurate structural model. That is, the correct alignment based on structural superposition will generally have a lower score than the optimal alignment obtained from sequence. Variations of the DP algorithm have been developed that generate alternative alignments that are "suboptimal" in terms of the DP score, but these still encounter difficulties in detecting the correct structural alignment. We present here a new alternative sequence alignment method that relies heavily on the structure of the template. By initially aligning the query sequence to individual fragments in secondary structure elements and combining high-scoring fragments that pass basic tests for "modelability", we can generate accurate alignments within a small ensemble. Our results suggest that the set of sequences that can currently be modeled by homology can be greatly extended.

SUBMITTER: Kuziemko A 

PROVIDER: S-EPMC3188491 | biostudies-literature | 2011 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Using structure to explore the sequence alignment space of remote homologs.

Kuziemko Andrew A   Honig Barry B   Petrey Donald D  

PLoS computational biology 20111006 10


Protein structure modeling by homology requires an accurate sequence alignment between the query protein and its structural template. However, sequence alignment methods based on dynamic programming (DP) are typically unable to generate accurate alignments for remote sequence homologs, thus limiting the applicability of modeling methods. A central problem is that the alignment that is "optimal" in terms of the DP score does not necessarily correspond to the alignment that produces the most accur  ...[more]

Similar Datasets

| S-EPMC1579236 | biostudies-literature
| S-EPMC9985440 | biostudies-literature
| S-EPMC4086137 | biostudies-literature
| S-EPMC2896139 | biostudies-literature
| S-EPMC1955456 | biostudies-literature
| S-EPMC3125758 | biostudies-literature
| S-EPMC5714223 | biostudies-literature
| S-EPMC2677745 | biostudies-literature
2024-10-10 | PXD050548 | Pride
| S-EPMC6353097 | biostudies-literature