Unknown

Dataset Information

0

Evolutionary approach to predicting the binding site residues of a protein from its primary sequence.


ABSTRACT: Protein binding site residues, especially catalytic residues, play a central role in protein function. Because more than 99% of the ? 12 million protein sequences in the nonredundant protein database have no structural information, it is desirable to develop methods to predict the binding site residues of a protein from its primary sequence. This task is highly challenging, because the binding site residues constitute only a small portion of a protein. However, the binding site residues of a protein are clustered in its functional pocket(s), and their spatial patterns tend to be conserved in evolution. To take advantage of these evolutionary and structural principles, we constructed a database of ? 50,000 templates (called the pocket-containing segment database), each of which includes not only a sequence segment that contains a functional pocket but also the structural attributes of the pocket. To use this database, we designed a template-matching technique, termed residue-matching profiling, and established a criterion for selecting templates for a query sequence. Finally, we developed a probabilistic model for assigning spatial scores to matched residues between the template and query sequence in local alignments using a set of selected scoring matrices and for computing the binding likelihood of each matched residue in the query sequence. From the likelihoods, one can predict the binding site residues in the query sequence. An automated computational pipeline was developed for our method. A performance evaluation shows that our method achieves a 70% precision in predicting binding site residues at 60% sensitivity.

SUBMITTER: Tseng YY 

PROVIDER: S-EPMC3069214 | biostudies-literature | 2011 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Evolutionary approach to predicting the binding site residues of a protein from its primary sequence.

Tseng Yan Yuan YY   Li Wen-Hsiung WH   Li Wen-Hsiung WH  

Proceedings of the National Academy of Sciences of the United States of America 20110314 13


Protein binding site residues, especially catalytic residues, play a central role in protein function. Because more than 99% of the ∼ 12 million protein sequences in the nonredundant protein database have no structural information, it is desirable to develop methods to predict the binding site residues of a protein from its primary sequence. This task is highly challenging, because the binding site residues constitute only a small portion of a protein. However, the binding site residues of a pro  ...[more]

Similar Datasets

| S-EPMC2803200 | biostudies-literature
| S-EPMC2777313 | biostudies-literature
| S-EPMC6933864 | biostudies-literature
| S-EPMC5773889 | biostudies-literature
| S-EPMC7002541 | biostudies-literature
| S-EPMC5570179 | biostudies-literature
| S-EPMC3577447 | biostudies-literature
| S-EPMC4574261 | biostudies-literature
| S-EPMC3380730 | biostudies-literature
| S-EPMC2709252 | biostudies-literature