Unknown

Dataset Information

0

Searching molecular structure databases with tandem mass spectra using CSI:FingerID.


ABSTRACT: Metabolites provide a direct functional signature of cellular state. Untargeted metabolomics experiments usually rely on tandem MS to identify the thousands of compounds in a biological sample. Today, the vast majority of metabolites remain unknown. We present a method for searching molecular structure databases using tandem MS data of small molecules. Our method computes a fragmentation tree that best explains the fragmentation spectrum of an unknown molecule. We use the fragmentation tree to predict the molecular structure fingerprint of the unknown compound using machine learning. This fingerprint is then used to search a molecular structure database such as PubChem. Our method is shown to improve on the competing methods for computational metabolite identification by a considerable margin.

SUBMITTER: Duhrkop K 

PROVIDER: S-EPMC4611636 | biostudies-other | 2015 Oct

REPOSITORIES: biostudies-other

Similar Datasets

| S-EPMC3166376 | biostudies-literature
| S-EPMC2689316 | biostudies-literature
| S-EPMC3905687 | biostudies-literature
| S-EPMC3107871 | biostudies-literature
| S-EPMC2533155 | biostudies-literature
| S-EPMC8057055 | biostudies-literature
| S-EPMC2938093 | biostudies-literature
| S-EPMC6889911 | biostudies-literature
| S-EPMC1865584 | biostudies-literature
| S-EPMC4159664 | biostudies-literature