Dataset Information

An integrated MS data processing strategy for fast identification, in-depth and reproducible quantification of protein O-glycosylation in large cohorts of human urine samples

ABSTRACT: Protein O-glycosylation has long been recognized to be closely associated with many diseases, particularly with tumor proliferation, invasion and metastasis. The ability to efﬁciently proﬁle the variation of O-glycosylation in large-scale clinical samples provides an important approach for the development of biomarkers for cancer diagnosis and for therapeutic response evaluation. Therefore, mass spectrometry (MS)-based techniques for high throughput, in-depth and reliable elucidation of protein O-glycosylation in large clinical cohorts are in high demand. However, the wide existence of serine and threonine residues in the proteome and the tens of mammalian O-glycan types lead to extremely large searching space composed of millions of theoretical combinations of peptides and O-glycans for intact O-glycopeptide database searching. As a result, exceptionally long time is required for database searching which is a major obstacle in O-glycoproteome studies of large clinical cohorts. More importantly, due to the low abundance and poor ionization of intact O-glycopeptides and the stochastic nature of data-dependent MS2 acquisition, substantially elevated missing data levels are inevitable as the sample number increases, which undermines the quantitative comparison across samples. Therefore, we report a new MS data processing strategy that integrates glycoform-specific database searching, reference library-based MS1 feature matching and MS2 identification propagation for fast identification, in-depth and reproducible label-free quantification of O-glycosylation of human urinary proteins. This strategy increases the database searching speeds by up to 20-fold and leads to a 30-40% enhanced intact O-glycopeptide quantification in individual samples with an obviously improved reproducibility. In total, we obtained quantitative information for 1068 intact O-glycopeptides across 36 healthy human urine samples with a 30-40% reduction in the amount of missing data. This is currently the largest dataset of urinary O-glycoproteome and demonstrates the application potential of this new strategy in large-scale clinical investigations.

INSTRUMENT(S):

ORGANISM(S): Homo Sapiens (human)

TISSUE(S): Urine

SUBMITTER: Xinyuan Zhao

LAB HEAD: Weijie Qin

PROVIDER: PXD015987 | Pride | 2020-02-04

REPOSITORIES: Pride

ACCESS DATA

Dataset's files

Source:

Items per page:

1 - 5 of 47

Publications

An Integrated Mass Spectroscopy Data Processing Strategy for Fast Identification, In-Depth, and Reproducible Quantification of Protein O-Glycosylation in a Large Cohort of Human Urine Samples.

Zhao Xinyuan X Zheng Shanshan S Li Yuanyuan Y Huang Junjie J Zhang Wanjun W Xie Yuping Y Qin Weijie W Qian Xiaohong X

Analytical chemistry 20191220 1

Protein O-glycosylation has long been recognized to be closely associated with many diseases, particularly with tumor proliferation, invasion, and metastasis. The ability to efficiently profile the variation of O-glycosylation in large-scale clinical samples provides an important approach for the development of biomarkers for cancer diagnosis and for therapeutic response evaluation. Therefore, mass spectrometry (MS)-based techniques for high throughput, in-depth and reliable elucid ...[more]

PMID: 31859485

			Action	DRS
	EThcD_1.raw	Raw
	EThcD_2.raw	Raw
	EThcD_3.raw	Raw
	EThcD_4.raw	Raw
	TEST_1.raw	Raw

Dataset Information

An integrated MS data processing strategy for fast identification, in-depth and reproducible quantification of protein O-glycosylation in large cohorts of human urine samples

Dataset's files

Publications

An Integrated Mass Spectroscopy Data Processing Strategy for Fast Identification, In-Depth, and Reproducible Quantification of Protein <i>O</i>-Glycosylation in a Large Cohort of Human Urine Samples.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Urinary polypeptide ETD/CID analysis, part 3
2016-02-15 | PXD002372 | Pride

Characterization of sialylated N- and O-glycopeptides from complex proteomes on an Orbitrap Fusion Tribrid mass spectrometer by Isotope Targeted Glycoproteomics (IsoTaG). Application to azido and alkynyl sugars
2017-02-22 | PXD004302 | Pride

Urinary polypeptide ETD/CID analysis, part 2
2016-02-15 | PXD002346 | Pride

Urinary polypeptide ETD/CID analysis, part 1
2016-02-15 | PXD002312 | Pride

Human serum glycoproteome - Glyco-Decipher
2022-03-04 | PXD031025 | Pride

Human urinary protein and glycopeotein LC-MSMS
2022-02-17 | PXD024639 | Pride

Re-analysis of glycoproteomics data with Glyco-Decipher
2022-03-04 | PXD031032 | Pride

Peptidomics profiling of biopsies from the human jejunum before and after gastrectomy
2019-02-15 | PXD011498 | Pride

Label-Free LC-MS/MS Proteomic Analysis of Urinary Identification in Diabetic Vascular Dementia
2021-01-14 | PXD022189 | Pride

Global Identification of Protein PTMs in a Single-pass Database Search
2015-07-31 | GSE59956 | GEO