Proteomics

Dataset Information

0

EBprot: Bayesian Analysis of Labelling-based Quantitative Proteomics Data


ABSTRACT: Labelling-based proteomics is a powerful method for detection of differentially expressed proteins (DEPs) between biological samples. The current data analysis platform relies on protein-level ratios, where peptide-level ratios are averaged to yield a single summary ratio for each protein. In shotgun proteomics, however, some proteins are quantified with more peptides than others, and this reproducibility information is incorporated into the differential expression (DE) analysis. Here we propose a novel probabilistic framework EBprot that directly models the peptide-to-protein hierarchy and rewards the proteins with reproducible quantification over multiple peptides. To evaluate its performance with known DE states, we first verified that the peptide-level analysis of EBprot provides more accurate estimation of the false discovery rates and better receiver-operating characteristic than other protein ratio analyses using simulation datasets, and confirmed the superior classification performance in a UPS1 mixture spike-in dataset. To illustrate the performance of EBprot in realistic applications, we applied EBprot to a SILAC dataset for lung cancer subtype analysis and an iTRAQ dataset for time course phosphoproteome analysis of EGF-stimulated HeLa cells, each featuring a different experimental design. Through these various examples, we show that the peptide-level analysis of EBprot provides a competitive advantage over alternative methods for the DE analysis of labelling-based quantitative datasets.

INSTRUMENT(S): Q Exactive

ORGANISM(S): Homo Sapiens (human)

TISSUE(S): Cell Culture

SUBMITTER: Rachel Li  

LAB HEAD: Hyung Won Choi

PROVIDER: PXD001426 | Pride | 2015-06-29

REPOSITORIES: Pride

Dataset's files

Source:
Action DRS
2014_01_HUMAN.fasta Fasta
MaxQuantOutput.tar.gz Other
QE_140221_01_UPS1_100fmolspiked.mgf Mgf
QE_140221_01_UPS1_100fmolspiked.raw Raw
QE_140221_02_UPS1_300fmolspiked.mgf Mgf
Items per page:
1 - 5 of 8
altmetric image

Publications

EBprot: Statistical analysis of labeling-based quantitative proteomics data.

Koh Hiromi W L HW   Swa Hannah L F HL   Fermin Damian D   Ler Siok Ghee SG   Gunaratne Jayantha J   Choi Hyungwon H  

Proteomics 20150528 15


Labeling-based proteomics is a powerful method for detection of differentially expressed proteins (DEPs). The current data analysis platform typically relies on protein-level ratios, which is obtained by summarizing peptide-level ratios for each protein. In shotgun proteomics, however, some proteins are quantified with more peptides than others, and this reproducibility information is not incorporated into the differential expression (DE) analysis. Here, we propose a novel probabilistic framewor  ...[more]

Similar Datasets

2021-07-30 | PXD024305 | Pride
2017-03-29 | MSV000080742 | MassIVE
2020-03-03 | PXD009815 | Pride
2017-08-04 | PXD007039 | Pride
2017-08-09 | PXD003472 | Pride
2015-12-01 | PXD001819 | Pride
2015-04-15 | PXD001546 | Pride
2021-06-17 | PXD022778 | Pride
2024-08-28 | PXD055277 | Pride
2022-07-31 | PXD027800 | Pride