Unknown

Dataset Information

0

FoldRec-C2C: protein fold recognition by combining cluster-to-cluster model and protein similarity network.


ABSTRACT: As a key for studying the protein structures, protein fold recognition is playing an important role in predicting the protein structures associated with COVID-19 and other important structures. However, the existing computational predictors only focus on the protein pairwise similarity or the similarity between two groups of proteins from 2-folds. However, the homology relationship among proteins is in a hierarchical structure. The global protein similarity network will contribute to the performance improvement. In this study, we proposed a predictor called FoldRec-C2C to globally incorporate the interactions among proteins into the prediction. For the FoldRec-C2C predictor, protein fold recognition problem is treated as an information retrieval task in nature language processing. The initial ranking results were generated by a surprised ranking algorithm Learning to Rank, and then three re-ranking algorithms were performed on the ranking lists to adjust the results globally based on the protein similarity network, including seq-to-seq model, seq-to-cluster model and cluster-to-cluster model (C2C). When tested on a widely used and rigorous benchmark dataset LINDAHL dataset, FoldRec-C2C outperforms other 34 state-of-the-art methods in this field. The source code and data of FoldRec-C2C can be downloaded from http://bliulab.net/FoldRec-C2C/download.

SUBMITTER: Shao J 

PROVIDER: S-EPMC7454262 | biostudies-literature | 2020 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

FoldRec-C2C: protein fold recognition by combining cluster-to-cluster model and protein similarity network.

Shao Jiangyi J   Yan Ke K   Liu Bin B  

Briefings in bioinformatics 20210501 3


As a key for studying the protein structures, protein fold recognition is playing an important role in predicting the protein structures associated with COVID-19 and other important structures. However, the existing computational predictors only focus on the protein pairwise similarity or the similarity between two groups of proteins from 2-folds. However, the homology relationship among proteins is in a hierarchical structure. The global protein similarity network will contribute to the perform  ...[more]

Similar Datasets

| S-EPMC4481851 | biostudies-literature
| S-EPMC8768454 | biostudies-literature
| S-EPMC4669437 | biostudies-literature
| S-EPMC8507389 | biostudies-literature
| S-EPMC4071197 | biostudies-literature
| S-EPMC6044633 | biostudies-literature
| S-EPMC24434 | biostudies-literature
| S-EPMC5331664 | biostudies-literature
| S-EPMC2803855 | biostudies-other
| S-EPMC2430716 | biostudies-literature