Unknown

Dataset Information

0

Computational approaches to protein inference in shotgun proteomics.


ABSTRACT: Shotgun proteomics has recently emerged as a powerful approach to characterizing proteomes in biological samples. Its overall objective is to identify the form and quantity of each protein in a high-throughput manner by coupling liquid chromatography with tandem mass spectrometry. As a consequence of its high throughput nature, shotgun proteomics faces challenges with respect to the analysis and interpretation of experimental data. Among such challenges, the identification of proteins present in a sample has been recognized as an important computational task. This task generally consists of (1) assigning experimental tandem mass spectra to peptides derived from a protein database, and (2) mapping assigned peptides to proteins and quantifying the confidence of identified proteins. Protein identification is fundamentally a statistical inference problem with a number of methods proposed to address its challenges. In this review we categorize current approaches into rule-based, combinatorial optimization and probabilistic inference techniques, and present them using integer programming and Bayesian inference frameworks. We also discuss the main challenges of protein identification and propose potential solutions with the goal of spurring innovative research in this area.

SUBMITTER: Li YF 

PROVIDER: S-EPMC3489551 | biostudies-literature | 2012

REPOSITORIES: biostudies-literature

altmetric image

Publications

Computational approaches to protein inference in shotgun proteomics.

Li Yong Fuga YF   Radivojac Predrag P  

BMC bioinformatics 20121105


Shotgun proteomics has recently emerged as a powerful approach to characterizing proteomes in biological samples. Its overall objective is to identify the form and quantity of each protein in a high-throughput manner by coupling liquid chromatography with tandem mass spectrometry. As a consequence of its high throughput nature, shotgun proteomics faces challenges with respect to the analysis and interpretation of experimental data. Among such challenges, the identification of proteins present in  ...[more]

Similar Datasets

| S-EPMC3548767 | biostudies-literature
| S-EPMC2352161 | biostudies-other
| S-EPMC2711935 | biostudies-literature
| S-EPMC2736651 | biostudies-literature
| S-EPMC3018820 | biostudies-other
| S-EPMC3012695 | biostudies-literature
| S-EPMC2682515 | biostudies-literature
| S-EPMC3631354 | biostudies-literature
| S-EPMC3065711 | biostudies-literature
| S-EPMC4261935 | biostudies-other