Unknown

Dataset Information

0

Improved quality control processing of peptide-centric LC-MS proteomics data.


ABSTRACT:

Motivation

In the analysis of differential peptide peak intensities (i.e. abundance measures), LC-MS analyses with poor quality peptide abundance data can bias downstream statistical analyses and hence the biological interpretation for an otherwise high-quality dataset. Although considerable effort has been placed on assuring the quality of the peptide identification with respect to spectral processing, to date quality assessment of the subsequent peptide abundance data matrix has been limited to a subjective visual inspection of run-by-run correlation or individual peptide components. Identifying statistical outliers is a critical step in the processing of proteomics data as many of the downstream statistical analyses [e.g. analysis of variance (ANOVA)] rely upon accurate estimates of sample variance, and their results are influenced by extreme values.

Results

We describe a novel multivariate statistical strategy for the identification of LC-MS runs with extreme peptide abundance distributions. Comparison with current method (run-by-run correlation) demonstrates a significantly better rate of identification of outlier runs by the multivariate strategy. Simulation studies also suggest that this strategy significantly outperforms correlation alone in the identification of statistically extreme liquid chromatography-mass spectrometry (LC-MS) runs.

Availability

https://www.biopilot.org/docs/Software/RMD.php

Contact

bj@pnl.gov

Supplementary information

Supplementary material is available at Bioinformatics online.

SUBMITTER: Matzke MM 

PROVIDER: S-EPMC3187650 | biostudies-literature | 2011 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Improved quality control processing of peptide-centric LC-MS proteomics data.

Matzke Melissa M MM   Waters Katrina M KM   Metz Thomas O TO   Jacobs Jon M JM   Sims Amy C AC   Baric Ralph S RS   Pounds Joel G JG   Webb-Robertson Bobbie-Jo M BJ  

Bioinformatics (Oxford, England) 20110818 20


<h4>Motivation</h4>In the analysis of differential peptide peak intensities (i.e. abundance measures), LC-MS analyses with poor quality peptide abundance data can bias downstream statistical analyses and hence the biological interpretation for an otherwise high-quality dataset. Although considerable effort has been placed on assuring the quality of the peptide identification with respect to spectral processing, to date quality assessment of the subsequent peptide abundance data matrix has been l  ...[more]

Similar Datasets

| S-EPMC7114614 | biostudies-literature
| S-EPMC4098950 | biostudies-literature
| S-EPMC3721451 | biostudies-literature
| S-BSST1067 | biostudies-other
| S-EPMC7341532 | biostudies-literature
| S-EPMC4894284 | biostudies-literature
| S-EPMC1906842 | biostudies-literature
| S-EPMC3101957 | biostudies-literature
| S-EPMC4777305 | biostudies-literature
| S-EPMC4524413 | biostudies-literature