Dataset Information

FilterDCA: Interpretable supervised contact prediction using inter-domain coevolution.

ABSTRACT: Predicting three-dimensional protein structure and assembling protein complexes using sequence information belongs to the most prominent tasks in computational biology. Recently substantial progress has been obtained in the case of single proteins using a combination of unsupervised coevolutionary sequence analysis with structurally supervised deep learning. While reaching impressive accuracies in predicting residue-residue contacts, deep learning has a number of disadvantages. The need for large structural training sets limits the applicability to multi-protein complexes; and their deep architecture makes the interpretability of the convolutional neural networks intrinsically hard. Here we introduce FilterDCA, a simpler supervised predictor for inter-domain and inter-protein contacts. It is based on the fact that contact maps of proteins show typical contact patterns, which results from secondary structure and are reflected by patterns in coevolutionary analysis. We explicitly integrate averaged contacts patterns with coevolutionary scores derived by Direct Coupling Analysis, improving performance over standard coevolutionary analysis, while remaining fully transparent and interpretable. The FilterDCA code is available at http://gitlab.lcqb.upmc.fr/muscat/FilterDCA.

SUBMITTER: Muscat M

PROVIDER: S-EPMC7577475 | biostudies-literature | 2020 Oct

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

FilterDCA: Interpretable supervised contact prediction using inter-domain coevolution.

Muscat Maureen M Croce Giancarlo G Sarti Edoardo E Weigt Martin M

PLoS computational biology 20201009 10

Predicting three-dimensional protein structure and assembling protein complexes using sequence information belongs to the most prominent tasks in computational biology. Recently substantial progress has been obtained in the case of single proteins using a combination of unsupervised coevolutionary sequence analysis with structurally supervised deep learning. While reaching impressive accuracies in predicting residue-residue contacts, deep learning has a number of disadvantages. The need for larg ...[more]

PMID: 33035205

Dataset Information

FilterDCA: Interpretable supervised contact prediction using inter-domain coevolution.

Publications

FilterDCA: Interpretable supervised contact prediction using inter-domain coevolution.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Interpretable adenylation domain specificity prediction using protein language models.
| S-EPMC11761653 | biostudies-literature

Protein inter-residue contact and distance prediction by coupling complementary coevolution features with deep residual networks in CASP14.
| S-EPMC8616805 | biostudies-literature

Multi-domain and complex protein structure prediction using inter-domain interactions from deep learning.
| S-EPMC10692239 | biostudies-literature

Prediction of inter-residue contact clusters from hydrophobic cores.
| S-EPMC2929137 | biostudies-literature

Enhanced Inter-helical Residue Contact Prediction in Transmembrane Proteins.
| S-EPMC3164537 | biostudies-literature

Enhancing coevolution-based contact prediction by imposing structural self-consistency of the contacts.
| S-EPMC6057941 | biostudies-literature

Protein contact prediction by integrating deep multiple sequence alignments, coevolution and machine learning.
| S-EPMC5820155 | biostudies-literature

On the effect of phylogenetic correlations in coevolution-based contact prediction in proteins.
| S-EPMC8177639 | biostudies-literature

Protein inter-domain linker prediction using Random Forest and amino acid physiochemical properties.
| S-EPMC4290662 | biostudies-literature

DeepCDpred: Inter-residue distance and contact prediction for improved prediction of protein structure.
| S-EPMC6324825 | biostudies-literature