Dataset Information

An assessment of false discovery rates and statistical significance in label-free quantitative proteomics with combined filters.

ABSTRACT:

Background

Many studies have provided algorithms or methods to assess a statistical significance in quantitative proteomics when multiple replicates for a protein sample and a LC/MS analysis are available. But, confidence is still lacking in using datasets for a biological interpretation without protein sample replicates. Although a fold-change is a conventional threshold that can be used when there are no sample replicates, it does not provide an assessment of statistical significance such as a false discovery rate (FDR) which is an important indicator of the reliability to identify differentially expressed proteins. In this work, we investigate whether differentially expressed proteins can be detected with a statistical significance from a pair of unlabeled protein samples without replicates and with only duplicate LC/MS injections per sample. A FDR is used to gauge the statistical significance of the differentially expressed proteins.

Results

We have experimented to operate on several parameters to control a FDR, including a fold-change, a statistical test, and a minimum number of permuted significant pairings. Although none of these parameters alone gives a satisfactory control of a FDR, we find that a combination of these parameters provides a very effective means to control a FDR without compromising the sensitivity. The results suggest that it is possible to perform a significance analysis without protein sample replicates. Only duplicate LC/MS injections per sample are needed. We illustrate that differentially expressed proteins can be detected with a FDR between 0 and 15% at a positive rate of 4-16%. The method is evaluated for its sensitivity and specificity by a ROC analysis, and is further validated with a [15N]-labeled internal-standard protein sample and additional unlabeled protein sample replicates.

Conclusion

We demonstrate that a statistical significance can be inferred without protein sample replicates in label-free quantitative proteomics. The approach described in this study would be useful in many exploratory experiments where a sample amount or instrument time is limited. Naturally, this method is also suitable for proteomics experiments where multiple sample replicates are available. It is simple, and is complementary to other more sophisticated algorithms that are not designed for dealing with a small number of sample replicates.

SUBMITTER: Li Q

PROVIDER: S-EPMC2645366 | biostudies-literature | 2009 Feb

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

An assessment of false discovery rates and statistical significance in label-free quantitative proteomics with combined filters.

Li Qingbo Q Roxas Bryan Ap BA

BMC bioinformatics 20090202

<h4>Background</h4>Many studies have provided algorithms or methods to assess a statistical significance in quantitative proteomics when multiple replicates for a protein sample and a LC/MS analysis are available. But, confidence is still lacking in using datasets for a biological interpretation without protein sample replicates. Although a fold-change is a conventional threshold that can be used when there are no sample replicates, it does not provide an assessment of statistical significance s ...[more]

PMID: 19187558

Dataset Information

An assessment of false discovery rates and statistical significance in label-free quantitative proteomics with combined filters.

Background

Results

Conclusion

Publications

An assessment of false discovery rates and statistical significance in label-free quantitative proteomics with combined filters.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

A Bayesian Null Interval Hypothesis Test Controls False Discovery Rates and Improves Sensitivity in Label-Free Quantitative Proteomics.
| S-EPMC7294927 | biostudies-literature

Accurate Estimation of Context-Dependent False Discovery Rates in Top-Down Proteomics.
| S-EPMC6442365 | biostudies-literature

Improved results in proteomics by use of local and peptide-class specific false discovery rates.
| S-EPMC2709624 | biostudies-literature

False discovery rates: a new deal.
| S-EPMC5379932 | biostudies-literature

Protein identification false discovery rates for very large proteomics data sets generated by tandem mass spectrometry.
| S-EPMC2773710 | biostudies-literature

Statistical significance of quantitative PCR.
| S-EPMC1868764 | biostudies-literature

Improved False Discovery Rate Estimation Procedure for Shotgun Proteomics.
| S-EPMC4533616 | biostudies-literature

Assigning significance in label-free quantitative proteomics to include single-peptide-hit proteins with low replicates.
| S-EPMC2997754 | biostudies-literature

Comparison of false-discovery rates of various decoy databases.
| S-EPMC8449453 | biostudies-literature

False discovery rates for rare variants from sequenced data.
| S-EPMC4711769 | biostudies-literature