Random Forest Processing of Direct Analysis in Real-Time Mass Spectrometric Data Enables Species Identification of Psychoactive Plants from Their Headspace Chemical Signatures.
Ontology highlight
ABSTRACT: The United Nations Office on Drugs and Crime has designated several "legal highs" as "plants of concern" because of the dangers associated with their increasing recreational abuse. Routine identification of these products is hampered by the difficulty in distinguishing them from innocuous plant materials such as foods, herbs, and spices. It is demonstrated here that several of these products have unique but consistent headspace chemical profiles and that multivariate statistical analysis processing of their chemical signatures can be used to accurately identify the species of plants from which the materials are derived. For this study, the headspace volatiles of several species were analyzed by direct analysis in real-time high-resolution mass spectrometry (DART-HRMS). These species include Althaea officinalis, Calea zacatechichi, Cannabis indica, Cannabis sativa, Echinopsis pachanoi, Lactuca virosa, Leonotis leonurus, Mimosa hositlis, Mitragyna speciosa, Ocimum basilicum, Origanum vulgare, Piper methysticum, Salvia divinorum, Turnera diffusa, and Voacanga africana. The results of the DART-HRMS analysis revealed intraspecies similarities and interspecies differences. Exploratory statistical analysis of the data using principal component analysis and global t-distributed stochastic neighbor embedding showed clustering of like species and separation of different species. This led to the use of supervised random forest (RF), which resulted in a model with 99% accuracy. A conformal predictor based on the RF classifier was created and proved to be valid for a significance level of 8% with an efficiency of 0.1, an observed fuzziness of 0, and an error rate of 0. The variables used for the statistical analysis processing were ranked in terms of the ability to enable clustering and discrimination between species using principal component analysis-variable importance of projection scores and RF variable importance indices. The variables that ranked the highest were then identified as m/z values consistent with molecules previously identified in plant material. This technique therefore shows proof-of-concept for the creation of a database for the detection and identification of plant-based legal highs through headspace analysis.
SUBMITTER: Appley MG
PROVIDER: S-EPMC6761758 | biostudies-literature | 2019 Sep
REPOSITORIES: biostudies-literature
ACCESS DATA