Unknown

Dataset Information

0

Visualization, Quantification, and Alignment of Spectral Drift in Population Scale Untargeted Metabolomics Data.


ABSTRACT: Untargeted liquid-chromatography-mass spectrometry (LC-MS)-based metabolomics analysis of human biospecimens has become among the most promising strategies for probing the underpinnings of human health and disease. Analysis of spectral data across population scale cohorts, however, is precluded by day-to-day nonlinear signal drifts in LC retention time or batch effects that complicate comparison of thousands of untargeted peaks. To date, there exists no efficient means of visualization and quantitative assessment of signal drift, correction of drift when present, and automated filtering of unstable spectral features, particularly across thousands of data files in population scale experiments. Herein, we report the development of a set of R-based scripts that allow for pre- and postprocessing of raw LC-MS data. These methods can be integrated with existing data analysis workflows by providing initial preprocessing bulk nonlinear retention time correction at the raw data level. Further, this approach provides postprocessing visualization and quantification of peak alignment accuracy, as well as peak-reliability-based parsing of processed data through hierarchical clustering of signal profiles. In a metabolomics data set derived from ?3000 human plasma samples, we find that application of our alignment tools resulted in substantial improvement in peak alignment accuracy, automated data filtering, and ultimately statistical power for detection of metabolite correlates of clinical measures. These tools will enable metabolomics studies of population scale cohorts.

SUBMITTER: Watrous JD 

PROVIDER: S-EPMC5455767 | biostudies-literature | 2017 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Visualization, Quantification, and Alignment of Spectral Drift in Population Scale Untargeted Metabolomics Data.

Watrous Jeramie D JD   Henglin Mir M   Claggett Brian B   Lehmann Kim A KA   Larson Martin G MG   Cheng Susan S   Jain Mohit M  

Analytical chemistry 20170126 3


Untargeted liquid-chromatography-mass spectrometry (LC-MS)-based metabolomics analysis of human biospecimens has become among the most promising strategies for probing the underpinnings of human health and disease. Analysis of spectral data across population scale cohorts, however, is precluded by day-to-day nonlinear signal drifts in LC retention time or batch effects that complicate comparison of thousands of untargeted peaks. To date, there exists no efficient means of visualization and quant  ...[more]

Similar Datasets

| S-EPMC5031781 | biostudies-literature
| S-EPMC4379709 | biostudies-literature
| S-EPMC6570933 | biostudies-literature
| S-EPMC5441461 | biostudies-literature
| S-EPMC5860222 | biostudies-literature
| S-EPMC5684233 | biostudies-literature
| S-EPMC6835889 | biostudies-literature
| S-EPMC7346099 | biostudies-literature
| S-EPMC7828469 | biostudies-literature
| S-EPMC7281133 | biostudies-literature