Unknown

Dataset Information

0

ProteoStorm: An Ultrafast Metaproteomics Database Search Framework.


ABSTRACT: Shotgun metaproteomics has the potential to reveal the functional landscape of microbial communities but lacks appropriate methods for complex samples with unknown compositions. In the absence of prior taxonomic information, tandem mass spectra would be searched against large pan-microbial databases, which requires heavy computational workload and reduces sensitivity. We present ProteoStorm, an efficient database search framework for large-scale metaproteomics studies, which identifies high-confidence peptide-spectrum matches (PSMs) while achieving a two-to-three orders-of-magnitude speedup over popular tools. A reanalysis of a urinary tract infection (UTI) dataset of 110 individuals revealed a complex pattern of polymicrobial expression, including sub-types of UTIs, cases of bacterial vaginosis, and evidence of no underlying disease. Importantly, compared to the initial UTI study that restricted the search database to a manually curated list of 20 genera, ProteoStorm identified additional genera that were previously unreported, including a case of infection with the rare pathogen Propionimicrobium.

SUBMITTER: Beyter D 

PROVIDER: S-EPMC6231400 | biostudies-literature | 2018 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

ProteoStorm: An Ultrafast Metaproteomics Database Search Framework.

Beyter Doruk D   Lin Miin S MS   Yu Yanbao Y   Pieper Rembert R   Bafna Vineet V  

Cell systems 20180926 4


Shotgun metaproteomics has the potential to reveal the functional landscape of microbial communities but lacks appropriate methods for complex samples with unknown compositions. In the absence of prior taxonomic information, tandem mass spectra would be searched against large pan-microbial databases, which requires heavy computational workload and reduces sensitivity. We present ProteoStorm, an efficient database search framework for large-scale metaproteomics studies, which identifies high-conf  ...[more]

Similar Datasets

| S-EPMC4986259 | biostudies-literature
| S-EPMC3892073 | biostudies-literature
| S-EPMC3633484 | biostudies-literature
2019-07-12 | PXD014582 |
| S-EPMC7767584 | biostudies-literature
| S-EPMC6192206 | biostudies-literature
2015-07-31 | GSE59956 | GEO
2015-07-31 | E-GEOD-59956 | biostudies-arrayexpress
| S-EPMC6420049 | biostudies-literature
2021-09-21 | PXD028558 | Pride