Unknown

Dataset Information

0

Using support vector machine combined with auto covariance to predict protein-protein interactions from protein sequences.


ABSTRACT: Compared to the available protein sequences of different organisms, the number of revealed protein-protein interactions (PPIs) is still very limited. So many computational methods have been developed to facilitate the identification of novel PPIs. However, the methods only using the information of protein sequences are more universal than those that depend on some additional information or predictions about the proteins. In this article, a sequence-based method is proposed by combining a new feature representation using auto covariance (AC) and support vector machine (SVM). AC accounts for the interactions between residues a certain distance apart in the sequence, so this method adequately takes the neighbouring effect into account. When performed on the PPI data of yeast Saccharomyces cerevisiae, the method achieved a very promising prediction result. An independent data set of 11,474 yeast PPIs was used to evaluate this prediction model and the prediction accuracy is 88.09%. The performance of this method is superior to those of the existing sequence-based methods, so it can be a useful supplementary tool for future proteomics studies. The prediction software and all data sets used in this article are freely available at http://www.scucic.cn/Predict_PPI/index.htm.

SUBMITTER: Guo Y 

PROVIDER: S-EPMC2396404 | biostudies-literature | 2008 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Using support vector machine combined with auto covariance to predict protein-protein interactions from protein sequences.

Guo Yanzhi Y   Yu Lezheng L   Wen Zhining Z   Li Menglong M  

Nucleic acids research 20080404 9


Compared to the available protein sequences of different organisms, the number of revealed protein-protein interactions (PPIs) is still very limited. So many computational methods have been developed to facilitate the identification of novel PPIs. However, the methods only using the information of protein sequences are more universal than those that depend on some additional information or predictions about the proteins. In this article, a sequence-based method is proposed by combining a new fea  ...[more]

Similar Datasets

| S-EPMC3292016 | biostudies-literature
| S-EPMC6016146 | biostudies-literature
| S-EPMC2742725 | biostudies-literature
| S-EPMC5056084 | biostudies-other
| S-EPMC2967561 | biostudies-literature
| S-EPMC5662531 | biostudies-other
| S-EPMC7412107 | biostudies-literature
| S-EPMC2627892 | biostudies-other
| S-EPMC5409512 | biostudies-literature
| S-EPMC9056223 | biostudies-literature