Proteomics

Dataset Information

0

A method for independent estimation of false localisation rate for phosphoproteomics


ABSTRACT: Phosphoproteomics methods are commonly employed in labs to identify and quantify the sites of phosphorylation on proteins. In recent years, various software tools have been developed, incorporating scores or statistics related to whether a given phosphosite has been correctly identified, or to estimate the global false localisation rate (FLR) within a given data set for all sites reported. These scores have generally been calibrated using synthetic data sets, and their statistical reliability on real datasets is largely unknown. As a result, there is considerable problem in the field of reporting incorrectly localised phosphosites, due to inadequate statistical control. In this work, we develop the concept of using scoring and ranking modifications on a decoy amino acid, i.e. one that cannot be modified, to allow for independent estimation of global FLR. We test a variety of different amino acids to act as the decoy, on both synthetic and real data sets, demonstrating that the amino acid selection can make a substantial difference to the estimated global FLR. We conclude that while several different amino acids might be appropriate, the most reliable FLR results were achieved using alanine and leucine as decoys, although we have a preference for alanine due to the risk of potential confusion between leucine and isoleucine amino acids. We propose that the phosphoproteomics field should adopt the use of a decoy amino acid, so that there is better control of false reporting in the literature, and in public databases that re-distribute the data.

INSTRUMENT(S): Orbitrap Fusion ETD, Q Exactive

ORGANISM(S): Homo Sapiens (human) Arabidopsis Thaliana (mouse-ear Cress)

TISSUE(S): Plant Cell Line, Epithelial Cell, Cell Culture

SUBMITTER: Kerry Ramsbottom  

LAB HEAD: Prof. Andrew R Jones

PROVIDER: PXD028840 | Pride | 2022-06-09

REPOSITORIES: Pride

altmetric image

Publications

Method for Independent Estimation of the False Localization Rate for Phosphoproteomics.

Ramsbottom Kerry A KA   Prakash Ananth A   Riverol Yasset Perez YP   Camacho Oscar Martin OM   Martin Maria-Jesus MJ   Vizcaíno Juan Antonio JA   Deutsch Eric W EW   Jones Andrew R AR  

Journal of proteome research 20220531 7


Phosphoproteomic methods are commonly employed to identify and quantify phosphorylation sites on proteins. In recent years, various tools have been developed, incorporating scores or statistics related to whether a given phosphosite has been correctly identified or to estimate the global false localization rate (FLR) within a given data set for all sites reported. These scores have generally been calibrated using synthetic datasets, and their statistical reliability on real datasets is largely u  ...[more]

Similar Datasets

2012-04-01 | E-GEOD-32512 | biostudies-arrayexpress
2016-04-15 | E-MTAB-4615 | biostudies-arrayexpress
2012-04-02 | GSE32512 | GEO
2018-04-04 | GSE112643 | GEO
2014-08-22 | GSE60588 | GEO
2013-04-06 | GSE45556 | GEO
2013-05-06 | GSE46651 | GEO
2022-04-06 | GSE199975 | GEO
2018-07-19 | GSE113751 | GEO
2013-04-06 | E-GEOD-45556 | biostudies-arrayexpress