Dataset Information

A Python-based pipeline for preprocessing LC-MS data for untargeted metabolomics workflows

ABSTRACT: Preprocessing data in a reproducible and robust way is one of the current challenges in untargeted metabolomics workflows. Data curation in liquid chromatography-mass spectrometry (LC-MS) involves the removal of unwanted features (retention time; m/z pairs) to retain only high-quality data for subsequent analysis and interpretation. The present work introduces a package for the Python programming language for pre-processing LC-MS data for quality control procedures in untargeted metabolomics workflows. It is a versatile strategy that can be customized or fit for purpose according to the specific metabolomics application. It allows performing quality control procedures to ensure accuracy and reliability in LC-MS measurements, and it allows preprocessing metabolomics data to obtain cleaned matrices for subsequent statistical analysis. The capabilities of the package are showcased with pipelines for an LC-MS system suitability check, system conditioning, signal drift evaluation, and data curation. These applications were implemented to preprocess data corresponding to a new suite of plasma candidate plasma reference materials developed by the National Institute of Standards and Technology (NIST; hypertriglyceridemic, diabetic, and African-American plasma pools) to be used in untargeted metabolomics studies. in addition to NIST SRM 1950 – Metabolites in Frozen Human Plasma. The package offers a rapid and reproducible workflow that can be used in an automated or semi-automated fashion, and it is an open and free tool available to all users.

INSTRUMENT(S): Liquid Chromatography MS - positive - reverse phase

PROVIDER: MTBLS1919 | MetaboLights | 2020-11-21

REPOSITORIES: MetaboLights

ACCESS DATA

Dataset's files

Source:

Items per page:

1 - 5 of 191

Publications

A Python-Based Pipeline for Preprocessing LC-MS Data for Untargeted Metabolomics Workflows.

Riquelme Gabriel G Zabalegui Nicolás N Marchi Pablo P Jones Christina M CM Monge María Eugenia ME

Metabolites 20201016 10

Preprocessing data in a reproducible and robust way is one of the current challenges in untargeted metabolomics workflows. Data curation in liquid chromatography-mass spectrometry (LC-MS) involves the removal of biologically non-relevant features (retention time, <i>m/z</i> pairs) to retain only high-quality data for subsequent analysis and interpretation. The present work introduces TidyMS, a package for the Python programming language for preprocessing LC-MS data for quality control (QC) proce ...[more]

PMID: 33081373

			Action	DRS
	NZ_20200226_005.mzML	Mzml
	NZ_20200226_007.mzML	Mzml
	NZ_20200226_009.mzML	Mzml
	NZ_20200226_011.mzML	Mzml
	NZ_20200226_013.mzML	Mzml

Dataset Information

A Python-based pipeline for preprocessing LC-MS data for untargeted metabolomics workflows

Dataset's files

Publications

A Python-Based Pipeline for Preprocessing LC-MS Data for Untargeted Metabolomics Workflows.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Streamlining LC-MS/MS Data Analysis in R with Open-Source *xcms* and *RforMassSpectrometry*: An End-to-End Workflow
2024-08-26 | MTBLS8735 | MetaboLights

Data set belonging to the chapter "LC-MS data processing using xcms", in Computational Methods and Data Analysis for Metabolomics.
| MSV000099121 | MassIVE

Plasma samples processed with depletion and non depletion
2022-08-12 | PXD031919 | Pride

GNPS - Data set belonging to the chapter
| MSV000099121 | GNPS

Navigating the hydroxymethylome: experimental biases and quality control tools for the tandem bisulfite and oxidative bisulfite Illumina microarrays
2022-01-14 | GSE182919 | GEO

A chemoenzymatic method for simultaneous profiling N- and O-glycans on glycoproteins using one-pot format
2024-08-01 | PXD052172 | Pride

eRah: A computational tool integrating spectral deconvolution and alignment with quantification and identification of metabolites in GCMS- based metabolomics
2016-09-22 | MTBLS321 | MetaboLights

Single-cell transcriptome landscape in colorectal cancer
2024-12-13 | GSE277669 | GEO

Frontiers in Plasma Proteome Profiling Platforms: Innovations and Applications.
2024-07-03 | PXD050425 | Pride

Benchmark data set for MSPypeline, a python package for streamlined mass spectrometry-based proteomics data analysis
2021-07-22 | PXD025792 | Pride