Unknown

Dataset Information

0

Highly accurate and large-scale collision cross sections prediction with graph neural networks.


ABSTRACT: The collision cross section (CCS) values derived from ion mobility spectrometry can be used to improve the accuracy of compound identification. Here, we have developed the Structure included graph merging with adduct method for CCS prediction (SigmaCCS) based on graph neural networks using 3D conformers as inputs. A model was trained, evaluated, and tested with >5,000 experimental CCS values. It achieved a coefficient of determination of 0.9945 and a median relative error of 1.1751% on the test set. The model-agnostic interpretation method and the visualization of the learned representations were used to investigate the chemical rationality of SigmaCCS. An in-silico database with 282 million CCS values was generated for three different adduct types of 94 million compounds. Its source code is publicly available at https://github.com/zmzhang/SigmaCCS . Altogether, SigmaCCS is an accurate, rational, and off-the-shelf method to directly predict CCS values from molecular structures.

SUBMITTER: Guo R 

PROVIDER: S-EPMC10319785 | biostudies-literature | 2023 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Highly accurate and large-scale collision cross sections prediction with graph neural networks.

Guo Renfeng R   Zhang Youjia Y   Liao Yuxuan Y   Yang Qiong Q   Xie Ting T   Fan Xiaqiong X   Lin Zhonglong Z   Chen Yi Y   Lu Hongmei H   Zhang Zhimin Z  

Communications chemistry 20230704 1


The collision cross section (CCS) values derived from ion mobility spectrometry can be used to improve the accuracy of compound identification. Here, we have developed the Structure included graph merging with adduct method for CCS prediction (SigmaCCS) based on graph neural networks using 3D conformers as inputs. A model was trained, evaluated, and tested with >5,000 experimental CCS values. It achieved a coefficient of determination of 0.9945 and a median relative error of 1.1751% on the test  ...[more]

Similar Datasets

| S-EPMC9170855 | biostudies-literature
| S-EPMC1578593 | biostudies-literature
| S-EPMC8294856 | biostudies-literature
| S-EPMC11536843 | biostudies-literature
| S-EPMC11209944 | biostudies-literature
| S-EPMC10499216 | biostudies-literature
| S-EPMC11363525 | biostudies-literature
| S-EPMC11904306 | biostudies-literature
| S-EPMC10583285 | biostudies-literature
| S-EPMC11459932 | biostudies-literature