Unknown

Dataset Information

0

MSFragger: ultrafast and comprehensive peptide identification in mass spectrometry-based proteomics.


ABSTRACT: There is a need to better understand and handle the 'dark matter' of proteomics-the vast diversity of post-translational and chemical modifications that are unaccounted in a typical mass spectrometry-based analysis and thus remain unidentified. We present a fragment-ion indexing method, and its implementation in peptide identification tool MSFragger, that enables a more than 100-fold improvement in speed over most existing proteome database search tools. Using several large proteomic data sets, we demonstrate how MSFragger empowers the open database search concept for comprehensive identification of peptides and all their modified forms, uncovering dramatic differences in modification rates across experimental samples and conditions. We further illustrate its utility using protein-RNA cross-linked peptide data and using affinity purification experiments where we observe, on average, a 300% increase in the number of identified spectra for enriched proteins. We also discuss the benefits of open searching for improved false discovery rate estimation in proteomics.

SUBMITTER: Kong AT 

PROVIDER: S-EPMC5409104 | biostudies-literature | 2017 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

MSFragger: ultrafast and comprehensive peptide identification in mass spectrometry-based proteomics.

Kong Andy T AT   Leprevost Felipe V FV   Avtonomov Dmitry M DM   Mellacheruvu Dattatreya D   Nesvizhskii Alexey I AI  

Nature methods 20170410 5


There is a need to better understand and handle the 'dark matter' of proteomics-the vast diversity of post-translational and chemical modifications that are unaccounted in a typical mass spectrometry-based analysis and thus remain unidentified. We present a fragment-ion indexing method, and its implementation in peptide identification tool MSFragger, that enables a more than 100-fold improvement in speed over most existing proteome database search tools. Using several large proteomic data sets,  ...[more]

Similar Datasets

| S-EPMC4748730 | biostudies-literature
| S-EPMC4587597 | biostudies-literature
| S-EPMC3697753 | biostudies-literature
| S-EPMC11373339 | biostudies-literature
| EGAC00001002236 | EGA
| S-EPMC7044164 | biostudies-literature
| S-EPMC2605478 | biostudies-literature
| S-EPMC6495254 | biostudies-literature
2018-12-14 | GSE118974 | GEO
| S-EPMC4049470 | biostudies-literature