Unknown

Dataset Information

0

An integrated mass-spectrometry pipeline identifies novel protein coding-regions in the human genome.


ABSTRACT: Most protein mass spectrometry (MS) experiments rely on searches against a database of known or predicted proteins, limiting their ability as a gene discovery tool.Using a search against an in silico translation of the entire human genome, combined with a series of annotation filters, we identified 346 putative novel peptides [False Discovery Rate (FDR)<5%] in a MS dataset derived from two human breast epithelial cell lines. A subset of these were then successfully validated by a different MS technique. Two of these correspond to novel isoforms of Heterogeneous Ribonuclear Proteins, while the rest correspond to novel loci.MS technology can be used for ab initio gene discovery in human data, which, since it is based on different underlying assumptions, identifies protein-coding genes not found by other techniques. As MS technology continues to evolve, such approaches will become increasingly powerful.

SUBMITTER: Bitton DA 

PROVIDER: S-EPMC2812506 | biostudies-literature | 2010 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

An integrated mass-spectrometry pipeline identifies novel protein coding-regions in the human genome.

Bitton Danny A DA   Smith Duncan L DL   Connolly Yvonne Y   Scutt Paul J PJ   Miller Crispin J CJ  

PloS one 20100128 1


<h4>Background</h4>Most protein mass spectrometry (MS) experiments rely on searches against a database of known or predicted proteins, limiting their ability as a gene discovery tool.<h4>Results</h4>Using a search against an in silico translation of the entire human genome, combined with a series of annotation filters, we identified 346 putative novel peptides [False Discovery Rate (FDR)<5%] in a MS dataset derived from two human breast epithelial cell lines. A subset of these were then successf  ...[more]

Similar Datasets

| S-EPMC5757624 | biostudies-literature
| S-EPMC4322365 | biostudies-literature
| S-EPMC2796817 | biostudies-other
| S-EPMC5834625 | biostudies-literature
| S-EPMC5800494 | biostudies-literature
| S-SCDT-MSB-19-8994 | biostudies-other
| S-EPMC4655184 | biostudies-literature
| S-EPMC4481266 | biostudies-literature
| S-EPMC6646405 | biostudies-literature
2015-05-15 | GSE58113 | GEO