Unknown

Dataset Information

0

High throughput discovery of protein variants using proteomics informed by transcriptomics.


ABSTRACT: Proteomics informed by transcriptomics (PIT), in which proteomic MS/MS spectra are searched against open reading frames derived from de novo assembled transcripts, can reveal previously unknown translated genomic elements (TGEs). However, determining which TGEs are truly novel, which are variants of known proteins, and which are simply artefacts of poor sequence assembly, is challenging. We have designed and implemented an automated solution that classifies putative TGEs by comparing to reference proteome sequences. This allows large-scale identification of sequence polymorphisms, splice isoforms and novel TGEs supported by presence or absence of variant-specific peptide evidence. Unlike previously reported methods, ours does not require a catalogue of known variants, making it more applicable to non-model organisms. The method was validated on human PIT data, then applied to Mus musculus, Pteropus alecto and Aedes aegypti. Novel discoveries included 60 human protein isoforms, 32 392 polymorphisms in P. alecto, and TGEs with non-methionine start sites including tyrosine.

SUBMITTER: Saha S 

PROVIDER: S-EPMC6007231 | biostudies-literature | 2018 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

High throughput discovery of protein variants using proteomics informed by transcriptomics.

Saha Shyamasree S   Matthews David A DA   Bessant Conrad C  

Nucleic acids research 20180601 10


Proteomics informed by transcriptomics (PIT), in which proteomic MS/MS spectra are searched against open reading frames derived from de novo assembled transcripts, can reveal previously unknown translated genomic elements (TGEs). However, determining which TGEs are truly novel, which are variants of known proteins, and which are simply artefacts of poor sequence assembly, is challenging. We have designed and implemented an automated solution that classifies putative TGEs by comparing to referenc  ...[more]

Similar Datasets

| S-EPMC9736226 | biostudies-literature
| S-EPMC4102724 | biostudies-literature
| S-EPMC3592458 | biostudies-other
| S-EPMC137474 | biostudies-literature
| S-EPMC5342740 | biostudies-literature
| S-EPMC4224174 | biostudies-literature
| S-EPMC4638048 | biostudies-literature
| S-EPMC2838866 | biostudies-other
| S-EPMC4269970 | biostudies-literature
| S-EPMC5556373 | biostudies-literature