Unknown

Dataset Information

0

Modularity of Protein Folds as a Tool for Template-Free Modeling of Structures.


ABSTRACT: Predicting the three-dimensional structure of proteins from their amino acid sequences remains a challenging problem in molecular biology. While the current structural coverage of proteins is almost exclusively provided by template-based techniques, the modeling of the rest of the protein sequences increasingly require template-free methods. However, template-free modeling methods are much less reliable and are usually applicable for smaller proteins, leaving much space for improvement. We present here a novel computational method that uses a library of supersecondary structure fragments, known as Smotifs, to model protein structures. The library of Smotifs has saturated over time, providing a theoretical foundation for efficient modeling. The method relies on weak sequence signals from remotely related protein structures to create a library of Smotif fragments specific to the target protein sequence. This Smotif library is exploited in a fragment assembly protocol to sample decoys, which are assessed by a composite scoring function. Since the Smotif fragments are larger in size compared to the ones used in other fragment-based methods, the proposed modeling algorithm, SmotifTF, can employ an exhaustive sampling during decoy assembly. SmotifTF successfully predicts the overall fold of the target proteins in about 50% of the test cases and performs competitively when compared to other state of the art prediction methods, especially when sequence signal to remote homologs is diminishing. Smotif-based modeling is complementary to current prediction methods and provides a promising direction in addressing the structure prediction problem, especially when targeting larger proteins for modeling.

SUBMITTER: Vallat B 

PROVIDER: S-EPMC4529212 | biostudies-literature | 2015 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Modularity of Protein Folds as a Tool for Template-Free Modeling of Structures.

Vallat Brinda B   Madrid-Aliste Carlos C   Fiser Andras A  

PLoS computational biology 20150807 8


Predicting the three-dimensional structure of proteins from their amino acid sequences remains a challenging problem in molecular biology. While the current structural coverage of proteins is almost exclusively provided by template-based techniques, the modeling of the rest of the protein sequences increasingly require template-free methods. However, template-free modeling methods are much less reliable and are usually applicable for smaller proteins, leaving much space for improvement. We prese  ...[more]

Similar Datasets

| S-EPMC4619893 | biostudies-literature
| S-EPMC3203516 | biostudies-literature
| S-EPMC2785131 | biostudies-literature
| S-EPMC7007176 | biostudies-literature
| S-EPMC3259144 | biostudies-literature
| S-EPMC3984454 | biostudies-literature
| S-EPMC6017496 | biostudies-literature
| S-EPMC5035060 | biostudies-literature
| S-EPMC2373934 | biostudies-literature
| S-EPMC8141709 | biostudies-literature