Unknown

Dataset Information

0

Data analysis strategy for maximizing high-confidence protein identifications in complex proteomes such as human tumor secretomes and human serum.


ABSTRACT: Detection of biologically interesting, low-abundance proteins in complex proteomes such as serum typically requires extensive fractionation and high-performance mass spectrometers. Processing of the resulting large data sets involves trade-offs between confidence of identification and depth of protein coverage; that is, higher stringency filters preferentially reduce the number of low-abundance proteins identified. In the current study, an alternative database search and results filtering strategies were evaluated using test samples ranging from purified proteins to ovarian tumor secretomes and human serum to maximize peptide and protein coverage. Full and partial tryptic searches were compared because substantial numbers of partial tryptic peptides were observed in all samples, and the proportion of partial tryptic peptides was particularly high for serum. When data filters that yielded similar false discovery rates (FDR) were used, full tryptic searches detected far fewer peptides than partial tryptic searches. In contrast to the common practice of using full tryptic specificity and a narrow precursor mass tolerance, more proteins and peptides could be confidently identified using a partial tryptic database search with a 100 ppm precursor mass tolerance followed by filtering of results using 10 ppm mass error and full tryptic boundaries.

SUBMITTER: Wang H 

PROVIDER: S-EPMC3221390 | biostudies-literature | 2011 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Data analysis strategy for maximizing high-confidence protein identifications in complex proteomes such as human tumor secretomes and human serum.

Wang Huan H   Tang Hsin-Yao HY   Tan Glenn C GC   Speicher David W DW  

Journal of proteome research 20111018 11


Detection of biologically interesting, low-abundance proteins in complex proteomes such as serum typically requires extensive fractionation and high-performance mass spectrometers. Processing of the resulting large data sets involves trade-offs between confidence of identification and depth of protein coverage; that is, higher stringency filters preferentially reduce the number of low-abundance proteins identified. In the current study, an alternative database search and results filtering strate  ...[more]

Similar Datasets

| S-EPMC2757279 | biostudies-literature
| S-EPMC5939896 | biostudies-literature
| S-EPMC5493283 | biostudies-literature
2018-10-04 | GSE112623 | GEO
| S-EPMC3337972 | biostudies-literature
| S-EPMC3076744 | biostudies-literature
| S-EPMC7221165 | biostudies-literature
| S-EPMC6739480 | biostudies-literature
| S-EPMC4406858 | biostudies-literature
| S-EPMC6030475 | biostudies-literature