Unknown

Dataset Information

0

Database-independent Protein Sequencing (DiPS) Enables Full-length de Novo Protein and Antibody Sequence Determination.


ABSTRACT: Traditional "bottom-up" proteomic approaches use proteolytic digestion, LC-MS/MS, and database searching to elucidate peptide identities and their parent proteins. Protein sequences absent from the database cannot be identified, and even if present in the database, complete sequence coverage is rarely achieved even for the most abundant proteins in the sample. Thus, sequencing of unknown proteins such as antibodies or constituents of metaproteomes remains a challenging problem. To date, there is no available method for full-length protein sequencing, independent of a reference database, in high throughput. Here, we present Database-independent Protein Sequencing, a method for unambiguous, rapid, database-independent, full-length protein sequencing. The method is a novel combination of non-enzymatic, semi-random cleavage of the protein, LC-MS/MS analysis, peptide de novo sequencing, extraction of peptide tags, and their assembly into a consensus sequence using an algorithm named "Peptide Tag Assembler." As proof-of-concept, the method was applied to samples of three known proteins representing three size classes and to a previously un-sequenced, clinically relevant monoclonal antibody. Excluding leucine/isoleucine and glutamic acid/deamidated glutamine ambiguities, end-to-end full-length de novo sequencing was achieved with 99-100% accuracy for all benchmarking proteins and the antibody light chain. Accuracy of the sequenced antibody heavy chain, including the entire variable region, was also 100%, but there was a 23-residue gap in the constant region sequence.

SUBMITTER: Savidor A 

PROVIDER: S-EPMC5461544 | biostudies-literature | 2017 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Database-independent Protein Sequencing (DiPS) Enables Full-length <i>de Novo</i> Protein and Antibody Sequence Determination.

Savidor Alon A   Barzilay Rotem R   Elinger Dalia D   Yarden Yosef Y   Lindzen Moshit M   Gabashvili Alexandra A   Adiv Tal Ophir O   Levin Yishai Y  

Molecular & cellular proteomics : MCP 20170327 6


Traditional "bottom-up" proteomic approaches use proteolytic digestion, LC-MS/MS, and database searching to elucidate peptide identities and their parent proteins. Protein sequences absent from the database cannot be identified, and even if present in the database, complete sequence coverage is rarely achieved even for the most abundant proteins in the sample. Thus, sequencing of unknown proteins such as antibodies or constituents of metaproteomes remains a challenging problem. To date, there is  ...[more]

Similar Datasets

| S-EPMC3133153 | biostudies-literature
| S-EPMC8771625 | biostudies-literature
| S-EPMC3968166 | biostudies-literature
| S-EPMC2866332 | biostudies-literature
| S-EPMC6944239 | biostudies-literature
| S-EPMC4780649 | biostudies-literature
| S-EPMC7391827 | biostudies-literature
| S-EPMC10400622 | biostudies-literature
| S-EPMC4595759 | biostudies-literature
2019-11-20 | PXD015083 | Pride