Dataset Information

A Computational Framework for Predicting Direct Contacts and Substructures within Protein Complexes.

ABSTRACT: Understanding the physical arrangement of subunits within protein complexes potentially provides valuable clues about how the subunits work together and how the complexes function. The majority of recent research focuses on identifying protein complexes as a whole and seldom studies the inner structures within complexes. In this study, we propose a computational framework to predict direct contacts and substructures within protein complexes. In this framework, we first train a supervised learning model of l2-regularized logistic regression to learn the patterns of direct and indirect interactions within complexes, from where physical subunit interaction networks are predicted. Then, to infer substructures within complexes, we apply a graph clustering method (i.e., maximum modularity clustering (MMC)) and a gene ontology (GO) semantic similarity based functional clustering on partially- and fully-connected networks, respectively. Computational results show that the proposed framework achieves fairly good performance of cross validation and independent test in terms of detecting direct contacts between subunits. Functional analyses further demonstrate the rationality of partitioning the subunits into substructures via the MMC algorithm and functional clustering.

SUBMITTER: Mei S

PROVIDER: S-EPMC6921016 | biostudies-literature | 2019 Oct

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

A Computational Framework for Predicting Direct Contacts and Substructures within Protein Complexes.

Mei Suyu S Zhang Kun K

Biomolecules 20191025 11

Understanding the physical arrangement of subunits within protein complexes potentially provides valuable clues about how the subunits work together and how the complexes function. The majority of recent research focuses on identifying protein complexes as a whole and seldom studies the inner structures within complexes. In this study, we propose a computational framework to predict direct contacts and substructures within protein complexes. In this framework, we first train a supervised learnin ...[more]

PMID: 31717703

Dataset Information

A Computational Framework for Predicting Direct Contacts and Substructures within Protein Complexes.

Publications

A Computational Framework for Predicting Direct Contacts and Substructures within Protein Complexes.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

RegSNPs-intron: a computational framework for predicting pathogenic impact of intronic single nucleotide variants
2019-09-30 | GSE138130 | GEO

A computational framework for distinguishing direct versus indirect interactions in human functional protein-protein interaction networks.
| S-EPMC7238765 | biostudies-literature

DEEP: a general computational framework for predicting enhancers.
| S-EPMC4288148 | biostudies-literature

Probabilistic prediction of contacts in protein-ligand complexes.
| S-EPMC3498326 | biostudies-literature

Computational model explains high activity and rapid cycling of Rho GTPases within protein complexes.
| S-EPMC1676031 | biostudies-literature

Contacts-based prediction of binding affinity in protein-protein complexes.
| S-EPMC4523921 | biostudies-literature

Computational prediction of protein-protein complexes.
| S-EPMC3599296 | biostudies-literature

Computational frameworks for predicting protein interactions via single-cell proximity sequencing
2022-02-07 | GSE196130 | GEO

DNCON2_Inter: predicting interchain contacts for homodimeric and homomultimeric protein complexes using multiple sequence alignments of monomers and deep learning.
| S-EPMC8192766 | biostudies-literature

The protein-DNA contacts in RutR•carAB operator complexes.
| S-EPMC2952853 | biostudies-literature