Unknown

Dataset Information

0

PEPPI: Whole-proteome Protein-protein Interaction Prediction through Structure and Sequence Similarity, Functional Association, and Machine Learning.


ABSTRACT: Proteome-wide identification of protein-protein interactions is a formidable task which has yet to be sufficiently addressed by experimental methodologies. Many computational methods have been developed to predict proteome-wide interaction networks, but few leverage both the sensitivity of structural information and the wide availability of sequence data. We present PEPPI, a pipeline which integrates structural similarity, sequence similarity, functional association data, and machine learning-based classification through a naïve Bayesian classifier model to accurately predict protein-protein interactions at a proteomic scale. Through benchmarking against a set of 798 ground truth interactions and an equal number of non-interactions, we have found that PEPPI attains 4.5% higher AUROC than the best of other state-of-the-art methods. As a proteomic-scale application, PEPPI was applied to model the interactions which occur between SARS-CoV-2 and human host cells during coronavirus infection, where 403 high-confidence interactions were identified with predictions covering 73% of a gold standard dataset from PSICQUIC and demonstrating significant complementarity with the most recent high-throughput experiments. PEPPI is available both as a webserver and in a standalone version and should be a powerful and generally applicable tool for computational screening of protein-protein interactions.

SUBMITTER: Bell EW 

PROVIDER: S-EPMC8897833 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC4985167 | biostudies-literature
| S-EPMC9403031 | biostudies-literature
| S-EPMC10635286 | biostudies-literature
| S-EPMC4489281 | biostudies-literature
| S-EPMC5871981 | biostudies-literature
| S-EPMC9754964 | biostudies-literature
| S-EPMC7815773 | biostudies-literature
2013-01-01 | E-GEOD-29210 | biostudies-arrayexpress
| S-EPMC5445391 | biostudies-literature
| S-EPMC9995192 | biostudies-literature