Dataset Information

Network-based auto-probit modeling for protein function prediction.

ABSTRACT: Predicting the functional roles of proteins based on various genome-wide data, such as protein-protein association networks, has become a canonical problem in computational biology. Approaching this task as a binary classification problem, we develop a network-based extension of the spatial auto-probit model. In particular, we develop a hierarchical Bayesian probit-based framework for modeling binary network-indexed processes, with a latent multivariate conditional autoregressive Gaussian process. The latter allows for the easy incorporation of protein-protein association network topologies-either binary or weighted-in modeling protein functional similarity. We use this framework to predict protein functions, for functions defined as terms in the Gene Ontology (GO) database, a popular rigorous vocabulary for biological functionality. Furthermore, we show how a natural extension of this framework can be used to model and correct for the high percentage of false negative labels in training data derived from GO, a serious shortcoming endemic to biological databases of this type. Our method performance is evaluated and compared with standard algorithms on weighted yeast protein-protein association networks, extracted from a recently developed integrative database called Search Tool for the Retrieval of INteracting Genes/proteins (STRING). Results show that our basic method is competitive with these other methods, and that the extended method-incorporating the uncertainty in negative labels among the training data-can yield nontrivial improvements in predictive accuracy.

SUBMITTER: Jiang X

PROVIDER: S-EPMC3116961 | biostudies-literature | 2011 Sep

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Network-based auto-probit modeling for protein function prediction.

Jiang Xiaoyu X Gold David D Kolaczyk Eric D ED

Biometrics 20101206 3

Predicting the functional roles of proteins based on various genome-wide data, such as protein-protein association networks, has become a canonical problem in computational biology. Approaching this task as a binary classification problem, we develop a network-based extension of the spatial auto-probit model. In particular, we develop a hierarchical Bayesian probit-based framework for modeling binary network-indexed processes, with a latent multivariate conditional autoregressive Gaussian proces ...[more]

PMID: 21133881

Dataset Information

Network-based auto-probit modeling for protein function prediction.

Publications

Network-based auto-probit modeling for protein function prediction.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Network capacity with probit-based stochastic user equilibrium problem.
| S-EPMC5298322 | biostudies-literature

NetQuilt: Deep Multispecies Network-based Protein Function Prediction using Homology-informed Network Similarity.
| S-EPMC8388039 | biostudies-literature

Protein Function Prediction Based on PPI Networks: Network Reconstruction vs Edge Enrichment.
| S-EPMC8712557 | biostudies-literature

NPF:network propagation for protein function prediction.
| S-EPMC7430911 | biostudies-literature

Bayesian Markov Random Field analysis for protein function prediction based on network data.
| S-EPMC2827541 | biostudies-literature

deepNF: deep network fusion for protein function prediction.
| S-EPMC6223364 | biostudies-literature

Colorectal Cancer Prediction Based on Weighted Gene Co-Expression Network Analysis and Variational Auto-Encoder.
| S-EPMC7563725 | biostudies-literature

Network-based prediction of protein interactions.
| S-EPMC6423278 | biostudies-literature

FunPred 3.0: improved protein function prediction using protein interaction network.
| S-EPMC6535044 | biostudies-literature

ProLanGO: Protein Function Prediction Using Neural Machine Translation Based on a Recurrent Neural Network.
| S-EPMC6151571 | biostudies-literature