Unknown

Dataset Information

0

An automated proteogenomic method uses mass spectrometry to reveal novel genes in Zea mays.


ABSTRACT: New technologies in genomics and proteomics have influenced the emergence of proteogenomics, a field at the confluence of genomics, transcriptomics, and proteomics. First generation proteogenomic toolkits employ peptide mass spectrometry to identify novel protein coding regions. We extend first generation proteogenomic tools to achieve greater accuracy and enable the analysis of large, complex genomes. We apply our pipeline to Zea mays, which has a genome comparable in size to human. Our pipeline begins with the comparison of mass spectra to a putative translation of the genome. We select novel peptides, those that match a region of the genome that was not previously known to be protein coding, for grouping into refinement events. We present a novel, probabilistic framework for evaluating the accuracy of each event. Our calculated event probability, or eventProb, considers the number of supporting peptides and spectra, and the quality of each supporting peptide-spectrum match. Our pipeline predicts 165 novel protein-coding genes and proposes updated models for 741 additional genes.

SUBMITTER: Castellana NE 

PROVIDER: S-EPMC3879611 | biostudies-literature | 2014 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

An automated proteogenomic method uses mass spectrometry to reveal novel genes in Zea mays.

Castellana Natalie E NE   Shen Zhouxin Z   He Yupeng Y   Walley Justin W JW   Cassidy California Jack CJ   Briggs Steven P SP   Bafna Vineet V  

Molecular & cellular proteomics : MCP 20131018 1


New technologies in genomics and proteomics have influenced the emergence of proteogenomics, a field at the confluence of genomics, transcriptomics, and proteomics. First generation proteogenomic toolkits employ peptide mass spectrometry to identify novel protein coding regions. We extend first generation proteogenomic tools to achieve greater accuracy and enable the analysis of large, complex genomes. We apply our pipeline to Zea mays, which has a genome comparable in size to human. Our pipelin  ...[more]

Similar Datasets

| S-EPMC3275902 | biostudies-other
| S-EPMC7548895 | biostudies-literature
| S-EPMC1462307 | biostudies-other
| S-EPMC11243803 | biostudies-literature
| S-EPMC4821088 | biostudies-literature
| S-EPMC5944917 | biostudies-literature
| S-EPMC4081654 | biostudies-other
| S-EPMC3205572 | biostudies-other
| S-EPMC10744276 | biostudies-literature
| S-EPMC5619761 | biostudies-literature