Supervised learning and prediction of physical interactions between human and HIV proteins.
Ontology highlight
ABSTRACT: Infectious diseases result in millions of deaths each year. Physical interactions between pathogen and host proteins often form the basis of such infections. While a number of methods have been proposed for predicting protein-protein interactions (PPIs), they have primarily focused on intra-species protein-protein interactions.We present an application of a supervised learning method for predicting physical interactions between host and pathogen proteins, using the human-HIV system. Using a Support Vector Machine with a linear kernel, we explore the use of a number of features including domain profiles, protein sequence k-mers, and properties of human proteins in a human PPI network. We achieve the best cross-validation performance when we use a combination of all three of these features. At a precision value of 70% we obtain recall values greater than 40%, depending on the ratio of positive examples to negative examples used during training. We use a classifier trained using these features to predict new PPIs between human and HIV proteins. We focus our discussion on those predicted interactions that involve human proteins known to be critical for HIV replication and propagation. Examples of predicted interactions with support in the literature include those necessary for viral attachment to the host membrane and subsequent invasion of the host cell.Unlike intra-species PPIs, host-pathogen PPIs have not yet been experimentally detected on a large scale, though they are likely to play important roles in pathogenesis and disease outcomes. Computational methods that can robustly and accurately predict host-pathogen PPIs hold the promise of guiding future experiments and gaining insights into potential mechanisms of pathogenesis.
SUBMITTER: Dyer MD
PROVIDER: S-EPMC3134873 | biostudies-literature | 2011 Jul
REPOSITORIES: biostudies-literature
ACCESS DATA