Unknown

Dataset Information

0

Simultaneous alignment and folding of protein sequences.


ABSTRACT: Accurate comparative analysis tools for low-homology proteins remains a difficult challenge in computational biology, especially sequence alignment and consensus folding problems. We present partiFold-Align, the first algorithm for simultaneous alignment and consensus folding of unaligned protein sequences; the algorithm's complexity is polynomial in time and space. Algorithmically, partiFold-Align exploits sparsity in the set of super-secondary structure pairings and alignment candidates to achieve an effectively cubic running time for simultaneous pairwise alignment and folding. We demonstrate the efficacy of these techniques on transmembrane ?-barrel proteins, an important yet difficult class of proteins with few known three-dimensional structures. Testing against structurally derived sequence alignments, partiFold-Align significantly outperforms state-of-the-art pairwise and multiple sequence alignment tools in the most difficult low-sequence homology case. It also improves secondary structure prediction where current approaches fail. Importantly, partiFold-Align requires no prior training. These general techniques are widely applicable to many more protein families (partiFold-Align is available at http://partifold.csail.mit.edu/ ).

SUBMITTER: Waldispuhl J 

PROVIDER: S-EPMC4082353 | biostudies-literature | 2014 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Simultaneous alignment and folding of protein sequences.

Waldispühl Jérôme J   O'Donnell Charles W CW   Will Sebastian S   Devadas Srinivas S   Backofen Rolf R   Berger Bonnie B  

Journal of computational biology : a journal of computational molecular cell biology 20140425 7


Accurate comparative analysis tools for low-homology proteins remains a difficult challenge in computational biology, especially sequence alignment and consensus folding problems. We present partiFold-Align, the first algorithm for simultaneous alignment and consensus folding of unaligned protein sequences; the algorithm's complexity is polynomial in time and space. Algorithmically, partiFold-Align exploits sparsity in the set of super-secondary structure pairings and alignment candidates to ach  ...[more]

Similar Datasets

| S-EPMC4514930 | biostudies-literature
| S-EPMC2280052 | biostudies-literature
| S-EPMC3587630 | biostudies-literature
| S-EPMC11255384 | biostudies-literature
| S-EPMC7666477 | biostudies-literature
| S-EPMC196869 | biostudies-literature
| S-EPMC2770007 | biostudies-literature
| S-EPMC2447753 | biostudies-literature
| S-EPMC3360771 | biostudies-literature
| S-EPMC6391537 | biostudies-literature