Unknown

Dataset Information

0

A large scale test dataset to determine optimal retention index threshold based on three mass spectral similarity measures.


ABSTRACT: Retention index (RI) is useful for metabolite identification. However, when RI is integrated with mass spectral similarity for metabolite identification, many controversial RI threshold setup are reported in literatures. In this study, a large scale test dataset of 5844 compounds with both mass spectra and RI information were created from National Institute of Standards and Technology (NIST) repetitive mass spectra (MS) and RI library. Three MS similarity measures: NIST composite measure, the real part of Discrete Fourier Transform (DFT.R) and the detail of Discrete Wavelet Transform (DWT.D) were used to investigate the accuracy of compound identification using the test dataset. To imitate real identification experiments, NIST MS main library was employed as reference library and the test dataset was used as search data. Our study shows that the optimal RI thresholds are 22, 15, and 15 i.u. for the NIST composite, DFT.R and DWT.D measures, respectively, when the RI and mass spectral similarity are integrated for compound identification. Compared to the mass spectrum matching, using both RI and mass spectral matching can improve the identification accuracy by 1.7%, 3.5%, and 3.5% for the three mass spectral similarity measures, respectively. It is concluded that the improvement of RI matching for compound identification heavily depends on the method of MS spectral similarity measure and the accuracy of RI data.

SUBMITTER: Zhang J 

PROVIDER: S-EPMC3430127 | biostudies-literature | 2012 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

A large scale test dataset to determine optimal retention index threshold based on three mass spectral similarity measures.

Zhang Jun J   Koo Imhoi I   Wang Bing B   Gao Qing-Wei QW   Zheng Chun-Hou CH   Zhang Xiang X  

Journal of chromatography. A 20120619


Retention index (RI) is useful for metabolite identification. However, when RI is integrated with mass spectral similarity for metabolite identification, many controversial RI threshold setup are reported in literatures. In this study, a large scale test dataset of 5844 compounds with both mass spectra and RI information were created from National Institute of Standards and Technology (NIST) repetitive mass spectra (MS) and RI library. Three MS similarity measures: NIST composite measure, the re  ...[more]

Similar Datasets

2010-08-13 | GSE17705 | GEO
| S-EPMC3787630 | biostudies-literature
| S-EPMC7671987 | biostudies-literature
2010-08-13 | E-GEOD-17705 | biostudies-arrayexpress
| S-EPMC6388043 | biostudies-literature
| S-EPMC6454479 | biostudies-literature
| S-EPMC2615214 | biostudies-literature
| S-EPMC4433014 | biostudies-literature
| S-EPMC4412153 | biostudies-literature
| S-EPMC4521860 | biostudies-literature