Unknown

Dataset Information

0

A novel matrix of sequence descriptors for predicting protein-protein interactions from amino acid sequences.


ABSTRACT: Protein-protein interactions (PPIs) play an important role in the life activities of organisms. With the availability of large amounts of protein sequence data, PPIs prediction methods have attracted increasing attention. A variety of protein sequence coding methods have emerged, but the training of these methods is particularly time consuming. To solve this issue, we have proposed a novel matrix sequence coding method. Based on deep neural network (DNN) and a novel matrix protein sequence descriptor, we constructed a protein interaction prediction model for predicting PPIs. When performed on human PPIs data, the method achieved an accuracy of 94.34%, a recall of 98.28%, an area under the curve (AUC) of 97.79% and a loss of 23.25%. A non-redundant dataset was used to evaluate this prediction model, and the prediction accuracy is 88.29%. These results indicate that the matrix of sequence (MOS) descriptor can enhance the predictive power of PPIs and reduce training time, which can be a useful complement for future proteomics research. The experimental code and experimental results can be found at https://github.com/smalltalkman/hppi-tensorflow.

SUBMITTER: Wang X 

PROVIDER: S-EPMC6555512 | biostudies-literature | 2019

REPOSITORIES: biostudies-literature

altmetric image

Publications

A novel matrix of sequence descriptors for predicting protein-protein interactions from amino acid sequences.

Wang Xue X   Wu Yuejin Y   Wang Rujing R   Wei Yuanyuan Y   Gui Yuanmiao Y  

PloS one 20190607 6


Protein-protein interactions (PPIs) play an important role in the life activities of organisms. With the availability of large amounts of protein sequence data, PPIs prediction methods have attracted increasing attention. A variety of protein sequence coding methods have emerged, but the training of these methods is particularly time consuming. To solve this issue, we have proposed a novel matrix sequence coding method. Based on deep neural network (DNN) and a novel matrix protein sequence descr  ...[more]

Similar Datasets

| S-EPMC5133789 | biostudies-literature
| S-EPMC2775920 | biostudies-literature
| S-EPMC5549711 | biostudies-other
| S-EPMC4047675 | biostudies-literature
| S-EPMC2441995 | biostudies-literature
| S-EPMC1847686 | biostudies-literature
| S-EPMC4682391 | biostudies-literature
| S-EPMC1624854 | biostudies-literature
| S-EPMC1838603 | biostudies-literature
| S-EPMC8253547 | biostudies-literature