Dataset Information

A Python-Based Pipeline for Preprocessing LC-MS Data for Untargeted Metabolomics Workflows.

ABSTRACT: Preprocessing data in a reproducible and robust way is one of the current challenges in untargeted metabolomics workflows. Data curation in liquid chromatography-mass spectrometry (LC-MS) involves the removal of biologically non-relevant features (retention time, m/z pairs) to retain only high-quality data for subsequent analysis and interpretation. The present work introduces TidyMS, a package for the Python programming language for preprocessing LC-MS data for quality control (QC) procedures in untargeted metabolomics workflows. It is a versatile strategy that can be customized or fit for purpose according to the specific metabolomics application. It allows performing quality control procedures to ensure accuracy and reliability in LC-MS measurements, and it allows preprocessing metabolomics data to obtain cleaned matrices for subsequent statistical analysis. The capabilities of the package are shown with pipelines for an LC-MS system suitability check, system conditioning, signal drift evaluation, and data curation. These applications were implemented to preprocess data corresponding to a new suite of candidate plasma reference materials developed by the National Institute of Standards and Technology (NIST; hypertriglyceridemic, diabetic, and African-American plasma pools) to be used in untargeted metabolomics studies in addition to NIST SRM 1950 Metabolites in Frozen Human Plasma. The package offers a rapid and reproducible workflow that can be used in an automated or semi-automated fashion, and it is an open and free tool available to all users.

SUBMITTER: Riquelme G

PROVIDER: S-EPMC7602939 | biostudies-literature | 2020 Oct

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

A Python-Based Pipeline for Preprocessing LC-MS Data for Untargeted Metabolomics Workflows.

Riquelme Gabriel G Zabalegui Nicolás N Marchi Pablo P Jones Christina M CM Monge María Eugenia ME

Metabolites 20201016 10

Preprocessing data in a reproducible and robust way is one of the current challenges in untargeted metabolomics workflows. Data curation in liquid chromatography-mass spectrometry (LC-MS) involves the removal of biologically non-relevant features (retention time, <i>m/z</i> pairs) to retain only high-quality data for subsequent analysis and interpretation. The present work introduces TidyMS, a package for the Python programming language for preprocessing LC-MS data for quality control (QC) proce ...[more]

PMID: 33081373

Dataset Information

A Python-Based Pipeline for Preprocessing LC-MS Data for Untargeted Metabolomics Workflows.

Publications

A Python-Based Pipeline for Preprocessing LC-MS Data for Untargeted Metabolomics Workflows.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

A Python-based pipeline for preprocessing LC-MS data for untargeted metabolomics workflows
2020-11-21 | MTBLS1919 | MetaboLights

Filtering procedures for untargeted LC-MS metabolomics data.
| S-EPMC6570933 | biostudies-literature

Comparison of Three Untargeted Data Processing Workflows for Evaluating LC-HRMS Metabolomics Data.
| S-EPMC7570355 | biostudies-literature

Network Marker Selection for Untargeted LC-MS Metabolomics Data.
| S-EPMC5441461 | biostudies-literature

Addressing the batch effect issue for LC/MS metabolomics data in data preprocessing.
| S-EPMC7431853 | biostudies-literature

Deep annotation of untargeted LC-MS metabolomics data with Binner.
| S-EPMC7828469 | biostudies-literature

Automated Annotation of Untargeted All-Ion Fragmentation LC-MS Metabolomics Data with MetaboAnnotatoR.
| S-EPMC8892435 | biostudies-literature

MARS: A Multipurpose Software for Untargeted LC-MS-Based Metabolomics and Exposomics.
| S-EPMC10831794 | biostudies-literature

Augmented region of interest for untargeted metabolomics mass spectrometry (AriumMS) of multi-platform-based CE-MS and LC-MS data.
| S-EPMC10287804 | biostudies-literature

eMZed: an open source framework in Python for rapid and interactive development of LC/MS data analysis workflows.
| S-EPMC3605603 | biostudies-literature