Unknown

Dataset Information

0

A nonparametric model for quality control of database search results in shotgun proteomics.


ABSTRACT: BACKGROUND: Analysis of complex samples with tandem mass spectrometry (MS/MS) has become routine in proteomic research. However, validation of database search results creates a bottleneck in MS/MS data processing. Recently, methods based on a randomized database have become popular for quality control of database search results. However, a consequent problem is the ignorance of how to combine different database search scores to improve the sensitivity of randomized database methods. RESULTS: In this paper, a multivariate nonlinear discriminate function (DF) based on the multivariate nonparametric density estimation technique was used to filter out false-positive database search results with a predictable false positive rate (FPR). Application of this method to control datasets of different instruments (LCQ, LTQ, and LTQ/FT) yielded an estimated FPR close to the actual FPR. As expected, the method was more sensitive when more features were used. Furthermore, the new method was shown to be more sensitive than two commonly used methods on 3 complex sample datasets and 3 control datasets. CONCLUSION: Using the nonparametric model, a more flexible DF can be obtained, resulting in improved sensitivity and good FPR estimation. This nonparametric statistical technique is a powerful tool for tackling the complexity and diversity of datasets in shotgun proteomics.

SUBMITTER: Zhang J 

PROVIDER: S-EPMC2267700 | biostudies-literature | 2008

REPOSITORIES: biostudies-literature

altmetric image

Publications

A nonparametric model for quality control of database search results in shotgun proteomics.

Zhang Jiyang J   Li Jianqi J   Liu Xin X   Xie Hongwei H   Zhu Yunping Y   He Fuchu F  

BMC bioinformatics 20080121


<h4>Background</h4>Analysis of complex samples with tandem mass spectrometry (MS/MS) has become routine in proteomic research. However, validation of database search results creates a bottleneck in MS/MS data processing. Recently, methods based on a randomized database have become popular for quality control of database search results. However, a consequent problem is the ignorance of how to combine different database search scores to improve the sensitivity of randomized database methods.<h4>Re  ...[more]

Similar Datasets

2019-01-17 | PXD012394 |
| S-EPMC3744223 | biostudies-literature
| S-EPMC6490964 | biostudies-literature
| S-EPMC5096980 | biostudies-literature
| S-EPMC4515955 | biostudies-literature
| S-EPMC3506577 | biostudies-literature
| S-EPMC2352161 | biostudies-other
| S-EPMC3608465 | biostudies-literature
| S-EPMC7332369 | biostudies-literature
| S-EPMC3769318 | biostudies-literature