A method for independent estimation of false localisation rate for phosphoproteomics
Ontology highlight
ABSTRACT: Phosphoproteomics methods are commonly employed in labs to identify and quantify the sites of phosphorylation on proteins. In recent years, various software tools have been developed, incorporating scores or statistics related to whether a given phosphosite has been correctly identified, or to estimate the global false localisation rate (FLR) within a given data set for all sites reported. These scores have generally been calibrated using synthetic data sets, and their statistical reliability on real datasets is largely unknown. As a result, there is considerable problem in the field of reporting incorrectly localised phosphosites, due to inadequate statistical control. In this work, we develop the concept of using scoring and ranking modifications on a decoy amino acid, i.e. one that cannot be modified, to allow for independent estimation of global FLR. We test a variety of different amino acids to act as the decoy, on both synthetic and real data sets, demonstrating that the amino acid selection can make a substantial difference to the estimated global FLR. We conclude that while several different amino acids might be appropriate, the most reliable FLR results were achieved using alanine and leucine as decoys, although we have a preference for alanine due to the risk of potential confusion between leucine and isoleucine amino acids. We propose that the phosphoproteomics field should adopt the use of a decoy amino acid, so that there is better control of false reporting in the literature, and in public databases that re-distribute the data.
INSTRUMENT(S): Orbitrap Fusion ETD, Q Exactive
ORGANISM(S): Homo Sapiens (human) Arabidopsis Thaliana (mouse-ear Cress)
TISSUE(S): Plant Cell Line, Epithelial Cell, Cell Culture
SUBMITTER: Kerry Ramsbottom
LAB HEAD: Prof. Andrew R Jones
PROVIDER: PXD028840 | Pride | 2022-06-09
REPOSITORIES: Pride
ACCESS DATA