Dataset Information

Improved protein structure prediction by deep learning irrespective of co-evolution information.

ABSTRACT: Predicting the tertiary structure of a protein from its primary sequence has been greatly improved by integrating deep learning and co-evolutionary analysis, as shown in CASP13 and CASP14. We describe our latest study of this idea, analyzing the efficacy of network size and co-evolution data and its performance on both natural and designed proteins. We show that a large ResNet (convolutional residual neural networks) can predict structures of correct folds for 26 out of 32 CASP13 free-modeling (FM) targets and L/5 long-range contacts with precision over 80%. When co-evolution is not used ResNet still can predict structures of correct folds for 18 CASP13 FM targets, greatly exceeding previous methods that do not use co-evolution either. Even with only primary sequence ResNet can predict structures of correct folds for all tested human-designed proteins. In addition, ResNet may fare better for the designed proteins when trained without co-evolution than with co-evolution. These results suggest that ResNet does not simply denoise co-evolution signals, but instead may learn important protein sequence-structure relationship. This has important implications on protein design and engineering especially when co-evolutionary data is unavailable.

SUBMITTER: Xu J

PROVIDER: S-EPMC8340610 | biostudies-literature | 2021 Jul

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Improved protein structure prediction by deep learning irrespective of co-evolution information.

Xu Jinbo J Mcpartlon Matthew M Li Jin J

Nature machine intelligence 20210520

Predicting the tertiary structure of a protein from its primary sequence has been greatly improved by integrating deep learning and co-evolutionary analysis, as shown in CASP13 and CASP14. We describe our latest study of this idea, analyzing the efficacy of network size and co-evolution data and its performance on both natural and designed proteins. We show that a large ResNet (convolutional residual neural networks) can predict structures of correct folds for 26 out of 32 CASP13 free-modeling ( ...[more]

PMID: 34368623

Dataset Information

Improved protein structure prediction by deep learning irrespective of co-evolution information.

Publications

Improved protein structure prediction by deep learning irrespective of co-evolution information.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Recent Applications of Deep Learning Methods on Evolution- and Contact-Based Protein Structure Prediction.
| S-EPMC8199773 | biostudies-literature

CopulaNet: Learning residue co-evolution directly from multiple sequence alignment for protein structure prediction.
| S-EPMC8100175 | biostudies-literature

Membrane protein contact and structure prediction using co-evolution in conjunction with machine learning.
| S-EPMC5443516 | biostudies-literature

SPOT-Disorder2: Improved Protein Intrinsic Disorder Prediction by Ensembled Deep Learning.
| S-EPMC7212484 | biostudies-literature

COFACTOR: improved protein function prediction by combining structure, sequence and protein-protein interaction information.
| S-EPMC5793808 | biostudies-literature

Improved Prediction of Smoking Status via Isoform-Aware RNA-seq Deep Learning Models
2020-12-31 | GSE158699 | GEO

Improved protein structure refinement guided by deep learning based accuracy estimation.
| S-EPMC7910447 | biostudies-literature

Protein Secondary Structure Prediction With a Reductive Deep Learning Method.
| S-EPMC8240957 | biostudies-literature

Prediction of CD44 Structure by Deep Learning-Based Protein Modeling.
| S-EPMC10376988 | biostudies-literature

Protein structure prediction with energy minimization and deep learning approaches.
| S-EPMC10165305 | biostudies-literature