Unknown

Dataset Information

0

A comparative assessment and analysis of 20 representative sequence alignment methods for protein structure prediction.


ABSTRACT: Protein sequence alignment is essential for template-based protein structure prediction and function annotation. We collect 20 sequence alignment algorithms, 10 published and 10 newly developed, which cover all representative sequence- and profile-based alignment approaches. These algorithms are benchmarked on 538 non-redundant proteins for protein fold-recognition on a uniform template library. Results demonstrate dominant advantage of profile-profile based methods, which generate models with average TM-score 26.5% higher than sequence-profile methods and 49.8% higher than sequence-sequence alignment methods. There is no obvious difference in results between methods with profiles generated from PSI-BLAST PSSM matrix and hidden Markov models. Accuracy of profile-profile alignments can be further improved by 9.6% or 21.4% when predicted or native structure features are incorporated. Nevertheless, TM-scores from profile-profile methods including experimental structural features are still 37.1% lower than that from TM-align, demonstrating that the fold-recognition problem cannot be solved solely by improving accuracy of structure feature predictions.

SUBMITTER: Yan R 

PROVIDER: S-EPMC3965362 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

A comparative assessment and analysis of 20 representative sequence alignment methods for protein structure prediction.

Yan Renxiang R   Xu Dong D   Yang Jianyi J   Walker Sara S   Zhang Yang Y  

Scientific reports 20130101


Protein sequence alignment is essential for template-based protein structure prediction and function annotation. We collect 20 sequence alignment algorithms, 10 published and 10 newly developed, which cover all representative sequence- and profile-based alignment approaches. These algorithms are benchmarked on 538 non-redundant proteins for protein fold-recognition on a uniform template library. Results demonstrate dominant advantage of profile-profile based methods, which generate models with a  ...[more]

Similar Datasets

| S-EPMC4065584 | biostudies-literature
| S-EPMC5408826 | biostudies-other
| S-EPMC1579236 | biostudies-literature
| S-EPMC2039753 | biostudies-literature
| S-EPMC6927249 | biostudies-literature
| S-EPMC5897042 | biostudies-literature
| S-EPMC8100175 | biostudies-literature
| S-EPMC3076375 | biostudies-literature
| S-EPMC2803199 | biostudies-literature
| S-EPMC6311937 | biostudies-literature