Unknown

Dataset Information

0

Improved protein structure prediction by deep learning irrespective of co-evolution information.


ABSTRACT: Predicting the tertiary structure of a protein from its primary sequence has been greatly improved by integrating deep learning and co-evolutionary analysis, as shown in CASP13 and CASP14. We describe our latest study of this idea, analyzing the efficacy of network size and co-evolution data and its performance on both natural and designed proteins. We show that a large ResNet (convolutional residual neural networks) can predict structures of correct folds for 26 out of 32 CASP13 free-modeling (FM) targets and L/5 long-range contacts with precision over 80%. When co-evolution is not used ResNet still can predict structures of correct folds for 18 CASP13 FM targets, greatly exceeding previous methods that do not use co-evolution either. Even with only primary sequence ResNet can predict structures of correct folds for all tested human-designed proteins. In addition, ResNet may fare better for the designed proteins when trained without co-evolution than with co-evolution. These results suggest that ResNet does not simply denoise co-evolution signals, but instead may learn important protein sequence-structure relationship. This has important implications on protein design and engineering especially when co-evolutionary data is unavailable.

SUBMITTER: Xu J 

PROVIDER: S-EPMC8340610 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC8199773 | biostudies-literature
| S-EPMC8100175 | biostudies-literature
| S-EPMC5443516 | biostudies-literature
| S-EPMC7212484 | biostudies-literature
2020-12-31 | GSE158699 | GEO
| S-EPMC5793808 | biostudies-literature
| S-EPMC7910447 | biostudies-literature
| S-EPMC8240957 | biostudies-literature
| S-EPMC6851476 | biostudies-literature
| S-EPMC8671168 | biostudies-literature