Unknown

Dataset Information

0

RDb2C2: an improved method to identify the residue-residue pairing in ? strands.


ABSTRACT: BACKGROUND:Despite the great advance of protein structure prediction, accurate prediction of the structures of mainly ? proteins is still highly challenging, but could be assisted by the knowledge of residue-residue pairing in ? strands. Previously, we proposed a ridge-detection-based algorithm RDb2C that adopted a multi-stage random forest framework to predict the ?-? pairing given the amino acid sequence of a protein. RESULTS:In this work, we developed a second version of this algorithm, RDb2C2, by employing the residual neural network to further enhance the prediction accuracy. In the benchmark test, this new algorithm improves the F1-score by >?10 percentage points, reaching impressively high values of ~?72% and ~?73% in the BetaSheet916 and BetaSheet1452 sets, respectively. CONCLUSION:Our new method promotes the prediction accuracy of ?-? pairing to a new level and the prediction results could better assist the structure modeling of mainly ? proteins. We prepared an online server of RDb2C2 at http://structpred.life.tsinghua.edu.cn/rdb2c2.html.

SUBMITTER: Shao D 

PROVIDER: S-EPMC7126467 | biostudies-literature | 2020 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

RDb<sub>2</sub>C2: an improved method to identify the residue-residue pairing in β strands.

Shao Di D   Mao Wenzhi W   Xing Yaoguang Y   Gong Haipeng H  

BMC bioinformatics 20200403 1


<h4>Background</h4>Despite the great advance of protein structure prediction, accurate prediction of the structures of mainly β proteins is still highly challenging, but could be assisted by the knowledge of residue-residue pairing in β strands. Previously, we proposed a ridge-detection-based algorithm RDb<sub>2</sub>C that adopted a multi-stage random forest framework to predict the β-β pairing given the amino acid sequence of a protein.<h4>Results</h4>In this work, we developed a second versio  ...[more]

Similar Datasets

| S-EPMC5907701 | biostudies-literature
| S-EPMC6247404 | biostudies-literature
| S-EPMC1761434 | biostudies-literature
| S-EPMC2882391 | biostudies-literature
| S-EPMC3828260 | biostudies-literature
| S-EPMC8486102 | biostudies-literature
| S-EPMC7450584 | biostudies-literature
2016-07-05 | E-GEOD-83992 | biostudies-arrayexpress
2016-07-05 | GSE83992 | GEO
| S-EPMC5581475 | biostudies-literature