Unknown

Dataset Information

0

A high-throughput approach reveals distinct peptide charging behaviors in electrospray ionization mass spectrometry.


ABSTRACT: Electrospray ionization is a powerful and prevalent technique used to ionize analytes in mass spectrometry. The distribution of charges that an analyte receives (charge state distribution, CSD) is an important consideration for interpreting mass spectra. However, due to an incomplete understanding of the ionization mechanism, the analyte properties that influence CSDs are not fully understood. Here, we employ a machine learning-based high-throughput approach and analyze CSDs of hundreds of thousands of peptides. Interestingly, half of the peptides exhibit charges that differ from what one would naively expect (number of basic sites). We find that these peptides can be classified into two regimes-undercharging and overcharging-and that these two regimes display markedly different charging characteristics. Strikingly, peptides in the overcharging regime show minimal dependence on basic site count, and more generally, the two regimes exhibit distinct sequence determinants. These findings highlight the rich ionization behavior of peptides and the potential of CSDs for enhancing peptide identification.

SUBMITTER: Xu AM 

PROVIDER: S-EPMC10103939 | biostudies-literature | 2023 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

A high-throughput approach reveals distinct peptide charging behaviors in electrospray ionization mass spectrometry.

Xu Allyn M AM   Tang Lauren C LC   Jovanovic Marko M   Regev Oded O  

bioRxiv : the preprint server for biology 20230403


Electrospray ionization is a powerful and prevalent technique used to ionize analytes in mass spectrometry. The distribution of charges that an analyte receives (charge state distribution, CSD) is an important consideration for interpreting mass spectra. However, due to an incomplete understanding of the ionization mechanism, the analyte properties that influence CSDs are not fully understood. Here, we employ a machine learning-based high-throughput approach and analyze CSDs of hundreds of thous  ...[more]

Similar Datasets

| S-EPMC10767741 | biostudies-literature
2023-03-12 | MSV000091473 | MassIVE
| S-EPMC11622369 | biostudies-literature
| S-EPMC5887808 | biostudies-literature
| S-EPMC4165461 | biostudies-literature
| S-EPMC9640715 | biostudies-literature
| S-EPMC11622223 | biostudies-literature
| S-EPMC3176668 | biostudies-literature
| S-EPMC2932825 | biostudies-literature
| S-EPMC6422691 | biostudies-literature