Proteomics

Dataset Information

0

Bolt: A new age peptide search engine for comprehensive MS/MS sequencing through vast protein databases in minutes


ABSTRACT: The standard platform for proteomics experiments today is mass spectrometry, particularly for samples derived from complex matrices. Recent increases in mass spectrometry sequencing speed, sensitivity and resolution now permit comprehensive coverage of even the most precious and limited samples, particularly when coupled with improvements in protein extraction techniques and chromatographic separation. However, the results obtained from laborious sample extraction and expensive instrumentation are often hindered by a sub optimal data processing pipelines. One critical data processing piece is peptide sequencing which is most commonly done through database search engines. In almost all MS/MS search engines users must limit their search space due to time constraints and q-value considerations. In nearly all experiments, the search is limited to a canonical database that typically does not reflect the individual genetic variations of the organism being studied. Searching for posttranslational modifications can exponentially increase the search space thus careful consideration must be used during the selection process. In addition, engines will nearly always assume the presence of only fully tryptic peptides. Despite these stringent parameters, proteomic data searches may take hours or even days to complete and opening even one of these criteria to more realistic biological settings will lead to detrimental increases in search time on expensive and custom data processing towers. Even on high performance servers, these search engines are computationally expensive, and most users decide to dial back their search parameters. We present Bolt, a new search engine that can search more than nine hundred thousand protein sequences (canonical, isoform, mutations, and contaminants) with 31 post translation modifications and N-terminal and C-terminal partial tryptic search in a matter of minutes on a standard configuration laptop. Along with increases in speed, Bolt provides an additional benefit of improvement in high confidence identifications, as demonstrated by manual validation of unique peptides identified by Bolt that were missed with parallel searching using standard engines. When in disagreement, 67% of peptides identified by Bolt may be manually validated by strong fragmentation patterns, compared to 14% of peptides uniquely identified by SEQUEST. Bolt represents, to the best of our knowledge, the first fully scalable, cloud based quantitative proteomic solution that can be operated within a user-friendly GUI interface.

INSTRUMENT(S): LTQ Orbitrap Elite

ORGANISM(S): Homo Sapiens (human)

TISSUE(S): Hela Cell

SUBMITTER: Amol Prakash  

LAB HEAD: Amol Prakash

PROVIDER: PXD012700 | Pride | 2019-08-29

REPOSITORIES: Pride

Dataset's files

Source:
Action DRS
HeLa_Digest_1ug_130min-1.msf Msf
HeLa_Digest_1ug_130min.raw Raw
HeLa_Digest_1ug_60min-1.msf Msf
HeLa_Digest_1ug_60min.raw Raw
ResultsHela130min.csv Csv
Items per page:
1 - 5 of 8
altmetric image

Publications

Bolt: a New Age Peptide Search Engine for Comprehensive MS/MS Sequencing Through Vast Protein Databases in Minutes.

Prakash Amol A   Ahmad Shadab S   Majumder Swetaketu S   Jenkins Conor C   Orsburn Ben B  

Journal of the American Society for Mass Spectrometry 20190826 11


Recent increases in mass spectrometry speed, sensitivity, and resolution now permit comprehensive proteomics coverage. However, the results are often hindered by sub-optimal data processing pipelines. In almost all MS/MS peptide search engines, users must limit their search space to a canonical database due to time constraints and q value considerations, but this typically does not reflect the individual genetic variations of the organism being studied. In addition, engines will nearly always as  ...[more]

Similar Datasets

2021-05-17 | PXD025486 | Pride
2020-07-28 | PXD014337 | Pride
2014-09-01 | PXD001118 | Pride
2013-01-23 | PXD000021 | Pride
2019-10-29 | PXD007693 | Pride
2024-07-27 | PXD048016 | Pride
2013-02-22 | E-GEOD-44541 | biostudies-arrayexpress
2014-08-28 | PXD000874 | Pride
2014-09-16 | PXD000333 | Pride
2021-08-10 | PXD025655 | Pride