Unknown

Dataset Information

0

Coherent Point Drift Peak Alignment Algorithms Using Distance and Similarity Measures for Two-Dimensional Gas Chromatography Mass Spectrometry Data.


ABSTRACT: The peak alignment is a vital preprocessing step before downstream analysis, such as biomarker discovery and pathway analysis, for two-dimensional gas chromatography mass spectrometry (2DGCMS)-based metabolomics data. Due to uncontrollable experimental conditions, e.g., the differences in temperature or pressure, matrix effects on samples, and stationary phase degradation, a shift of retention times among samples inevitably occurs during 2DGCMS experiments, making it difficult to align peaks. Various peak alignment algorithms have been developed to correct retention time shifts for homogeneous, heterogeneous or both type of mass spectrometry data. However, almost all existing algorithms have been focused on a local alignment and are suffering from low accuracy especially when aligning dense biological data with many peaks. We have developed four global peak alignment (GPA) algorithms using coherent point drift (CPD) point matching algorithms: retention time-based CPD-GPA (RT), prior CPD-GPA (P), mixture CPD-GPA (M), and prior mixture CPD-GPA (P+M). The method RT performs the peak alignment based only on the retention time distance, while the methods P, M, and P+M carry out the peak alignment using both the retention time distance and mass spectral similarity. The method P incorporates the mass spectral similarity through prior information and the methods M and P+M use the mixture distance measure. Four developed algorithms are applied to homogeneous and heterogeneous spiked-in data as well as two real biological data and compared with three existing algorithms, mSPA, SWPA, and BiPACE-2D. The results show that our CPD-GPA algorithms perform better than all existing algorithms in terms of F1 score.

SUBMITTER: Li Z 

PROVIDER: S-EPMC7837599 | biostudies-literature | 2020 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Coherent Point Drift Peak Alignment Algorithms Using Distance and Similarity Measures for Two-Dimensional Gas Chromatography Mass Spectrometry Data.

Li Zeyu Z   Kim Seongho S   Zhong Sikai S   Zhong Zichun Z   Kato Ikuko I   Zhang Xiang X  

Journal of chemometrics 20200328 8


The peak alignment is a vital preprocessing step before downstream analysis, such as biomarker discovery and pathway analysis, for two-dimensional gas chromatography mass spectrometry (2DGCMS)-based metabolomics data. Due to uncontrollable experimental conditions, e.g., the differences in temperature or pressure, matrix effects on samples, and stationary phase degradation, a shift of retention times among samples inevitably occurs during 2DGCMS experiments, making it difficult to align peaks. Va  ...[more]

Similar Datasets

| S-EPMC3106184 | biostudies-literature
| S-EPMC3787630 | biostudies-literature
| S-EPMC3133553 | biostudies-literature
| S-EPMC3546004 | biostudies-literature
| S-EPMC6248453 | biostudies-literature
| S-EPMC3323827 | biostudies-other
| S-EPMC4448113 | biostudies-literature
| S-EPMC4760236 | biostudies-literature
| S-EPMC3694766 | biostudies-other
2012-09-21 | MTBLS21 | MetaboLights