Unknown

Dataset Information

0

Predicting protein function and binding profile via matching of local evolutionary and geometric surface patterns.


ABSTRACT: Inferring protein functions from structures is a challenging task, as a large number of orphan protein structures from structural genomics project are now solved without their biochemical functions characterized. For proteins binding to similar substrates or ligands and carrying out similar functions, their binding surfaces are under similar physicochemical constraints, and hence the sets of allowed and forbidden residue substitutions are similar. However, it is difficult to isolate such selection pressure due to protein function from selection pressure due to protein folding, and evolutionary relationship reflected by global sequence and structure similarities between proteins is often unreliable for inferring protein function. We have developed a method, called pevoSOAR (pocket-based evolutionary search of amino acid residues), for predicting protein functions by solving the problem of uncovering amino acids residue substitution pattern due to protein function and separating it from amino acids substitution pattern due to protein folding. We incorporate evolutionary information specific to an individual binding region and match local surfaces on a large scale with millions of precomputed protein surfaces to identify those with similar functions. Our pevoSOAR method also generates a probablistic model called the computed binding a profile that characterizes protein-binding activities that may involve multiple substrates or ligands. We show that our method can be used to predict enzyme functions with accuracy. Our method can also assess enzyme binding specificity and promiscuity. In an objective large-scale test of 100 enzyme families with thousands of structures, our predictions are found to be sensitive and specific: At the stringent specificity level of 99.98%, we can correctly predict enzyme functions for 80.55% of the proteins. The overall area under the receiver operating characteristic curve measuring the performance of our prediction is 0.955, close to the perfect value of 1.00. The best Matthews coefficient is 86.6%. Our method also works well in predicting the biochemical functions of orphan proteins from structural genomics projects.

SUBMITTER: Tseng YY 

PROVIDER: S-EPMC2670802 | biostudies-literature | 2009 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Predicting protein function and binding profile via matching of local evolutionary and geometric surface patterns.

Tseng Yan Yuan YY   Dundas Joseph J   Liang Jie J  

Journal of molecular biology 20090106 2


Inferring protein functions from structures is a challenging task, as a large number of orphan protein structures from structural genomics project are now solved without their biochemical functions characterized. For proteins binding to similar substrates or ligands and carrying out similar functions, their binding surfaces are under similar physicochemical constraints, and hence the sets of allowed and forbidden residue substitutions are similar. However, it is difficult to isolate such selecti  ...[more]

Similar Datasets

| S-EPMC2882714 | biostudies-literature
| S-EPMC3398928 | biostudies-literature
| S-EPMC3100846 | biostudies-literature
| S-EPMC3949165 | biostudies-literature
| S-EPMC4979896 | biostudies-literature
| S-EPMC3386866 | biostudies-literature
| S-EPMC3289072 | biostudies-literature
| S-EPMC2626596 | biostudies-literature
2024-07-03 | GSE270411 | GEO
| S-EPMC3585919 | biostudies-literature