Unknown

Dataset Information

0

Reverse Nearest Neighbor Search on a Protein-Protein Interaction Network to Infer Protein-Disease Associations.


ABSTRACT: The associations between proteins and diseases are crucial information for investigating pathological mechanisms. However, the number of known and reliable protein-disease associations is quite small. In this study, an analysis framework to infer associations between proteins and diseases was developed based on a large data set of a human protein-protein interaction network integrating an effective network search, namely, the reverse k-nearest neighbor (RkNN) search. The RkNN search was used to identify an impact of a protein on other proteins. Then, associations between proteins and diseases were inferred statistically. The method using the RkNN search yielded a much higher precision than a random selection, standard nearest neighbor search, or when applying the method to a random protein-protein interaction network. All protein-disease pair candidates were verified by a literature search. Supporting evidence for 596 pairs was identified. In addition, cluster analysis of these candidates revealed 10 promising groups of diseases to be further investigated experimentally. This method can be used to identify novel associations to better understand complex relationships between proteins and diseases.

SUBMITTER: Suratanee A 

PROVIDER: S-EPMC5513527 | biostudies-literature | 2017

REPOSITORIES: biostudies-literature

altmetric image

Publications

Reverse Nearest Neighbor Search on a Protein-Protein Interaction Network to Infer Protein-Disease Associations.

Suratanee Apichat A   Plaimas Kitiporn K  

Bioinformatics and biology insights 20170713


The associations between proteins and diseases are crucial information for investigating pathological mechanisms. However, the number of known and reliable protein-disease associations is quite small. In this study, an analysis framework to infer associations between proteins and diseases was developed based on a large data set of a human protein-protein interaction network integrating an effective network search, namely, the reverse <i>k</i>-nearest neighbor (R<i>k</i>NN) search. The R<i>k</i>N  ...[more]

Similar Datasets

| S-EPMC9035839 | biostudies-literature
| S-EPMC3476332 | biostudies-literature
| S-EPMC3098085 | biostudies-literature
| S-EPMC2951634 | biostudies-literature
| S-EPMC2967780 | biostudies-literature
| S-EPMC1383574 | biostudies-literature
| S-EPMC9755128 | biostudies-literature
| S-EPMC6895745 | biostudies-literature
| S-EPMC6753955 | biostudies-literature
| S-EPMC4968729 | biostudies-literature