Unknown

Dataset Information

0

Discovery of phosphorylation motif mixtures in phosphoproteomics data.


ABSTRACT:

Motivation

Modification of proteins via phosphorylation is a primary mechanism for signal transduction in cells. Phosphorylation sites on proteins are determined in part through particular patterns, or motifs, present in the amino acid sequence.

Results

We describe an algorithm that simultaneously discovers multiple motifs in a set of peptides that were phosphorylated by several different kinases. Such sets of peptides are routinely produced in proteomics experiments.Our motif-finding algorithm uses the principle of minimum description length to determine a mixture of sequence motifs that distinguish a foreground set of phosphopeptides from a background set of unphosphorylated peptides. We show that our algorithm outperforms existing motif-finding algorithms on synthetic datasets consisting of mixtures of known phosphorylation sites. We also derive a motif specificity score that quantifies whether or not the phosphoproteins containing an instance of a motif have a significant number of known interactions. Application of our motif-finding algorithm to recently published human and mouse proteomic studies recovers several known phosphorylation motifs and reveals a number of novel motifs that are enriched for interactions with a particular kinase or phosphatase. Our tools provide a new approach for uncovering the sequence specificities of uncharacterized kinases or phosphatases.

SUBMITTER: Ritz A 

PROVIDER: S-EPMC2638929 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC3459425 | biostudies-literature
| S-EPMC7244511 | biostudies-literature
| S-EPMC3494138 | biostudies-literature
| S-EPMC3371830 | biostudies-literature
| S-EPMC3682869 | biostudies-other
| S-EPMC3102080 | biostudies-literature
| S-EPMC5468353 | biostudies-literature
| S-EPMC5996464 | biostudies-literature
| S-EPMC4176157 | biostudies-literature
| S-EPMC3834837 | biostudies-other