Unknown

Dataset Information

0

Use of an informed search space maximizes confidence of site-specific assignment of glycoprotein glycosylation.


ABSTRACT: In order to interpret glycopeptide tandem mass spectra, it is necessary to estimate the theoretical glycan compositions and peptide sequences, known as the search space. The simplest way to do this is to build a naïve search space from sets of glycan compositions from public databases and to assume that the target glycoprotein is pure. Often, however, purified glycoproteins contain co-purified glycoprotein contaminants that have the potential to confound assignment of tandem mass spectra based on naïve assumptions. In addition, there is increasing need to characterize glycopeptides from complex biological mixtures. Fortunately, liquid chromatography-mass spectrometry (LC-MS) methods for glycomics and proteomics are now mature and accessible. We demonstrate the value of using an informed search space built from measured glycomes and proteomes to define the search space for interpretation of glycoproteomics data. We show this using ?-1-acid glycoprotein (AGP) mixed into a set of increasingly complex matrices. As the mixture complexity increases, the naïve search space balloons and the ability to assign glycopeptides with acceptable confidence diminishes. In addition, it is not possible to identify glycopeptides not foreseen as part of the naïve search space. A search space built from released glycan glycomics and proteomics data is smaller than its naïve counterpart while including the full range of proteins detected in the mixture. This maximizes the ability to assign glycopeptide tandem mass spectra with confidence. As the mixture complexity increases, the number of tandem mass spectra per glycopeptide precursor ion decreases, resulting in lower overall scores and reduced depth of coverage for the target glycoprotein. We suggest use of ?-1-acid glycoprotein as a standard to gauge effectiveness of analytical methods and bioinformatics search parameters for glycoproteomics studies. Graphical Abstract Assignment of site specific glycosylation from LC-tandemMS data.

SUBMITTER: Khatri K 

PROVIDER: S-EPMC5283608 | biostudies-literature | 2017 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Use of an informed search space maximizes confidence of site-specific assignment of glycoprotein glycosylation.

Khatri Kshitij K   Klein Joshua A JA   Zaia Joseph J  

Analytical and bioanalytical chemistry 20161012 2


In order to interpret glycopeptide tandem mass spectra, it is necessary to estimate the theoretical glycan compositions and peptide sequences, known as the search space. The simplest way to do this is to build a naïve search space from sets of glycan compositions from public databases and to assume that the target glycoprotein is pure. Often, however, purified glycoproteins contain co-purified glycoprotein contaminants that have the potential to confound assignment of tandem mass spectra based o  ...[more]

Similar Datasets

| S-EPMC5379070 | biostudies-literature
| S-EPMC4184449 | biostudies-literature
| S-EPMC10311258 | biostudies-literature
2018-10-04 | GSE112623 | GEO
| S-EPMC3790305 | biostudies-literature
| S-EPMC1550431 | biostudies-literature
| S-EPMC2658881 | biostudies-other
| S-EPMC3692395 | biostudies-literature
| S-EPMC5708630 | biostudies-literature
| S-EPMC6824209 | biostudies-literature