Dataset Information

The effects of nonignorable missing data on label-free mass spectrometry proteomics experiments.

ABSTRACT: An idealized version of a label-free discovery mass spectrometry proteomics experiment would provide absolute abundance measurements for a whole proteome, across varying conditions. Unfortunately, this ideal is not realized. Measurements are made on peptides requiring an inferential step to obtain protein level estimates. The inference is complicated by experimental factors that necessitate relative abundance estimation and result in widespread non-ignorable missing data. Relative abundance on the log scale takes the form of parameter contrasts. In a complete-case analysis, contrast estimates may be biased by missing data and a substantial amount of useful information will often go unused. To avoid problems with missing data, many analysts have turned to single imputation solutions. Unfortunately, these methods often create further difficulties by hiding inestimable contrasts, preventing the recovery of interblock information and failing to account for imputation uncertainty. To mitigate many of the problems caused by missing values, we propose the use of a Bayesian selection model. Our model is tested on simulated data, real data with simulated missing values, and on a ground truth dilution experiment where all of the true relative changes are known. The analysis suggests that our model, compared with various imputation strategies and complete-case analyses, can increase accuracy and provide substantial improvements to interval coverage.

SUBMITTER: O'Brien JJ

PROVIDER: S-EPMC6249692 | biostudies-literature | 2018 Dec

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

The effects of nonignorable missing data on label-free mass spectrometry proteomics experiments.

O'Brien Jonathon J JJ Gunawardena Harsha P HP Paulo Joao A JA Chen Xian X Ibrahim Joseph G JG Gygi Steven P SP Qaqish Bahjat F BF

The annals of applied statistics 20181113 4

An idealized version of a label-free discovery mass spectrometry proteomics experiment would provide absolute abundance measurements for a whole proteome, across varying conditions. Unfortunately, this ideal is not realized. Measurements are made on peptides requiring an inferential step to obtain protein level estimates. The inference is complicated by experimental factors that necessitate relative abundance estimation and result in widespread non-ignorable missing data. Relative abundance on t ...[more]

PMID: 30473739

Dataset Information

The effects of nonignorable missing data on label-free mass spectrometry proteomics experiments.

Publications

The effects of nonignorable missing data on label-free mass spectrometry proteomics experiments.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

MsImpute: Estimation of Missing Peptide Intensity Data in Label-Free Quantitative Mass Spectrometry.
| S-EPMC10368900 | biostudies-literature

Imputation of label-free quantitative mass spectrometry-based proteomics data using self-supervised deep learning.
| S-EPMC11208500 | biostudies-literature

SAINT-MS1: protein-protein interaction scoring using label-free intensity data in affinity purification-mass spectrometry experiments.
| S-EPMC3744231 | biostudies-literature

Mass spectrometry-based, label-free quantitative proteomics of round spermatids in mice.
| S-EPMC4148364 | biostudies-literature

Normalization approaches for removing systematic biases associated with mass spectrometry and label-free proteomics.
| S-EPMC1992440 | biostudies-literature

Comparative Proteomics Analysis of Pig Muscle Exudate through Label-Free Liquid Chromatography-Mass Spectrometry.
| S-EPMC10177093 | biostudies-literature

AMPK phosphosite profiling by label-free mass spectrometry reveals a multitude of mTORC1-regulated substrates
2025-02-27 | GSE272077 | GEO

Nonstandard conditionally specified models for nonignorable missing data.
| S-EPMC7430986 | biostudies-literature

Label-free, normalized quantification of complex mass spectrometry data for proteomic analysis.
| S-EPMC2805705 | biostudies-literature

Automated label-free quantification of metabolites from liquid chromatography-mass spectrometry data.
| S-EPMC3879626 | biostudies-literature