Unknown

Dataset Information

0

IDSL_MINT: a deep learning framework to predict molecular fingerprints from mass spectra.


ABSTRACT: The majority of tandem mass spectrometry (MS/MS) spectra in untargeted metabolomics and exposomics studies lack any annotation. Our deep learning framework, Integrated Data Science Laboratory for Metabolomics and Exposomics-Mass INTerpreter (IDSL_MINT) can translate MS/MS spectra into molecular fingerprint descriptors. IDSL_MINT allows users to leverage the power of the transformer model for mass spectrometry data, similar to the large language models. Models are trained on user-provided reference MS/MS libraries via any customizable molecular fingerprint descriptors. IDSL_MINT was benchmarked using the LipidMaps database and improved the annotation rate of a test study for MS/MS spectra that were not originally annotated using existing mass spectral libraries. IDSL_MINT may improve the overall annotation rates in untargeted metabolomics and exposomics studies. The IDSL_MINT framework and tutorials are available in the GitHub repository at https://github.com/idslme/IDSL_MINT .Scientific contribution statement.Structural annotation of MS/MS spectra from untargeted metabolomics and exposomics datasets is a major bottleneck in gaining new biological insights. Machine learning models to convert spectra into molecular fingerprints can help in the annotation process. Here, we present IDSL_MINT, a new, easy-to-use and customizable deep-learning framework to train and utilize new models to predict molecular fingerprints from spectra for the compound annotation workflows.

SUBMITTER: Baygi SF 

PROVIDER: S-EPMC10797927 | biostudies-literature | 2024 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

IDSL_MINT: a deep learning framework to predict molecular fingerprints from mass spectra.

Baygi Sadjad Fakouri SF   Barupal Dinesh Kumar DK  

Journal of cheminformatics 20240118 1


The majority of tandem mass spectrometry (MS/MS) spectra in untargeted metabolomics and exposomics studies lack any annotation. Our deep learning framework, Integrated Data Science Laboratory for Metabolomics and Exposomics-Mass INTerpreter (IDSL_MINT) can translate MS/MS spectra into molecular fingerprint descriptors. IDSL_MINT allows users to leverage the power of the transformer model for mass spectrometry data, similar to the large language models. Models are trained on user-provided referen  ...[more]

Similar Datasets

| S-EPMC10290119 | biostudies-literature
| S-EPMC10951270 | biostudies-literature
| S-EPMC6498126 | biostudies-literature
| S-EPMC7613299 | biostudies-literature
| S-EPMC8896601 | biostudies-literature
| S-EPMC8372319 | biostudies-literature
| S-EPMC8556919 | biostudies-literature
| S-EPMC10557501 | biostudies-literature
| S-EPMC8486166 | biostudies-literature
| S-EPMC11436754 | biostudies-literature