Unknown

Dataset Information

0

Prediction of kinase-specific phosphorylation sites using conditional random fields.


ABSTRACT:

Motivation

Phosphorylation is a crucial post-translational protein modification mechanism with important regulatory functions in biological systems. It is catalyzed by a group of enzymes called kinases, each of which recognizes certain target sites in its substrate proteins. Several authors have built computational models trained from sets of experimentally validated phosphorylation sites to predict these target sites for each given kinase. All of these models suffer from certain limitations, such as the fact that they do not take into account the dependencies between amino acid motifs within protein sequences in a global fashion.

Results

We propose a novel approach to predict phosphorylation sites from the protein sequence. The method uses a positive dataset to train a conditional random field (CRF) model. The negative training dataset is used to specify the decision threshold corresponding to a desired false positive rate. Application of the method on experimentally verified benchmark phosphorylation data (Phospho.ELM) shows that it performs well compared to existing methods for most kinases. This is to our knowledge that the first report of the use of CRFs to predict post-translational modification sites in protein sequences.

Availability

The source code of the implementation, called CRPhos, is available from http://www.ptools.ua.ac.be/CRPhos/

SUBMITTER: Dang TH 

PROVIDER: S-EPMC2639296 | biostudies-literature | 2008 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Prediction of kinase-specific phosphorylation sites using conditional random fields.

Dang Thanh Hai TH   Van Leemput Koenraad K   Verschoren Alain A   Laukens Kris K  

Bioinformatics (Oxford, England) 20081020 24


<h4>Motivation</h4>Phosphorylation is a crucial post-translational protein modification mechanism with important regulatory functions in biological systems. It is catalyzed by a group of enzymes called kinases, each of which recognizes certain target sites in its substrate proteins. Several authors have built computational models trained from sets of experimentally validated phosphorylation sites to predict these target sites for each given kinase. All of these models suffer from certain limitat  ...[more]

Similar Datasets

| S-EPMC6935449 | biostudies-literature
| S-EPMC3412434 | biostudies-other
| S-EPMC2386138 | biostudies-literature
| S-EPMC3009482 | biostudies-literature
| S-EPMC3243135 | biostudies-literature
| S-EPMC3341732 | biostudies-other
| S-EPMC2651179 | biostudies-literature
| S-EPMC4372350 | biostudies-literature
| S-EPMC3101956 | biostudies-literature
| S-EPMC2387219 | biostudies-literature