Unknown

Dataset Information

0

Large-scale identification of human protein function using topological features of interaction network.


ABSTRACT: The annotation of protein function is a vital step to elucidate the essence of life at a molecular level, and it is also meritorious in biomedical and pharmaceutical industry. Developments of sequencing technology result in constant expansion of the gap between the number of the known sequences and their functions. Therefore, it is indispensable to develop a computational method for the annotation of protein function. Herein, a novel method is proposed to identify protein function based on the weighted human protein-protein interaction network and graph theory. The network topology features with local and global information are presented to characterise proteins. The minimum redundancy maximum relevance algorithm is used to select 227 optimized feature subsets and support vector machine technique is utilized to build the prediction models. The performance of current method is assessed through 10-fold cross-validation test, and the range of accuracies is from 67.63% to 100%. Comparing with other annotation methods, the proposed way possesses a 50% improvement in the predictive accuracy. Generally, such network topology features provide insights into the relationship between protein functions and network architectures. The source code of Matlab is freely available on request from the authors.

SUBMITTER: Li Z 

PROVIDER: S-EPMC5111120 | biostudies-literature | 2016 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Large-scale identification of human protein function using topological features of interaction network.

Li Zhanchao Z   Liu Zhiqing Z   Zhong Wenqian W   Huang Menghua M   Wu Na N   Xie Yun Y   Dai Zong Z   Zou Xiaoyong X  

Scientific reports 20161116


The annotation of protein function is a vital step to elucidate the essence of life at a molecular level, and it is also meritorious in biomedical and pharmaceutical industry. Developments of sequencing technology result in constant expansion of the gap between the number of the known sequences and their functions. Therefore, it is indispensable to develop a computational method for the annotation of protein function. Herein, a novel method is proposed to identify protein function based on the w  ...[more]

Similar Datasets

| S-EPMC5341041 | biostudies-literature
| S-EPMC6221071 | biostudies-literature
| S-EPMC5107936 | biostudies-literature
| S-EPMC1457052 | biostudies-literature
| S-EPMC8294856 | biostudies-literature
| S-EPMC6602452 | biostudies-literature
| S-EPMC6535044 | biostudies-literature
| S-EPMC4304735 | biostudies-literature
| S-EPMC3794914 | biostudies-literature
| S-EPMC7229196 | biostudies-literature