Unknown

Dataset Information

0

Method for Independent Estimation of the False Localization Rate for Phosphoproteomics.


ABSTRACT: Phosphoproteomic methods are commonly employed to identify and quantify phosphorylation sites on proteins. In recent years, various tools have been developed, incorporating scores or statistics related to whether a given phosphosite has been correctly identified or to estimate the global false localization rate (FLR) within a given data set for all sites reported. These scores have generally been calibrated using synthetic datasets, and their statistical reliability on real datasets is largely unknown, potentially leading to studies reporting incorrectly localized phosphosites, due to inadequate statistical control. In this work, we develop the concept of scoring modifications on a decoy amino acid, that is, one that cannot be modified, to allow for independent estimation of global FLR. We test a variety of amino acids, on both synthetic and real data sets, demonstrating that the selection can make a substantial difference to the estimated global FLR. We conclude that while several different amino acids might be appropriate, the most reliable FLR results were achieved using alanine and leucine as decoys. We propose the use of a decoy amino acid to control false reporting in the literature and in public databases that re-distribute the data. Data are available via ProteomeXchange with identifier PXD028840.

SUBMITTER: Ramsbottom KA 

PROVIDER: S-EPMC9251759 | biostudies-literature | 2022 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Method for Independent Estimation of the False Localization Rate for Phosphoproteomics.

Ramsbottom Kerry A KA   Prakash Ananth A   Riverol Yasset Perez YP   Camacho Oscar Martin OM   Martin Maria-Jesus MJ   Vizcaíno Juan Antonio JA   Deutsch Eric W EW   Jones Andrew R AR  

Journal of proteome research 20220531 7


Phosphoproteomic methods are commonly employed to identify and quantify phosphorylation sites on proteins. In recent years, various tools have been developed, incorporating scores or statistics related to whether a given phosphosite has been correctly identified or to estimate the global false localization rate (FLR) within a given data set for all sites reported. These scores have generally been calibrated using synthetic datasets, and their statistical reliability on real datasets is largely u  ...[more]

Similar Datasets

2022-06-09 | PXD028840 | Pride
| S-EPMC10119288 | biostudies-literature
2023-04-04 | PXD037580 |
| S-EPMC3820951 | biostudies-literature
| S-EPMC5944926 | biostudies-literature
| S-EPMC4533616 | biostudies-literature
| S-EPMC9623924 | biostudies-literature
| S-EPMC6708216 | biostudies-literature
| S-EPMC3372940 | biostudies-literature
| S-EPMC8155551 | biostudies-literature