Unknown

Dataset Information

0

A Regression-Based Analysis of Ribosome-Profiling Data Reveals a Conserved Complexity to Mammalian Translation.


ABSTRACT: A fundamental goal of genomics is to identify the complete set of expressed proteins. Automated annotation strategies rely on assumptions about protein-coding sequences (CDSs), e.g., they are conserved, do not overlap, and exceed a minimum length. However, an increasing number of newly discovered proteins violate these rules. Here we present an experimental and analytical framework, based on ribosome profiling and linear regression, for systematic identification and quantification of translation. Application of this approach to lipopolysaccharide-stimulated mouse dendritic cells and HCMV-infected human fibroblasts identifies thousands of novel CDSs, including micropeptides and variants of known proteins, that bear the hallmarks of canonical translation and exhibit translation levels and dynamics comparable to that of annotated CDSs. Remarkably, many translation events are identified in both mouse and human cells even when the peptide sequence is not conserved. Our work thus reveals an unexpected complexity to mammalian translation suited to provide both conserved regulatory or protein-based functions.

SUBMITTER: Fields AP 

PROVIDER: S-EPMC4720255 | biostudies-literature | 2015 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

A Regression-Based Analysis of Ribosome-Profiling Data Reveals a Conserved Complexity to Mammalian Translation.

Fields Alexander P AP   Rodriguez Edwin H EH   Jovanovic Marko M   Stern-Ginossar Noam N   Haas Brian J BJ   Mertins Philipp P   Raychowdhury Raktima R   Hacohen Nir N   Carr Steven A SA   Ingolia Nicholas T NT   Regev Aviv A   Weissman Jonathan S JS  

Molecular cell 20151201 5


A fundamental goal of genomics is to identify the complete set of expressed proteins. Automated annotation strategies rely on assumptions about protein-coding sequences (CDSs), e.g., they are conserved, do not overlap, and exceed a minimum length. However, an increasing number of newly discovered proteins violate these rules. Here we present an experimental and analytical framework, based on ribosome profiling and linear regression, for systematic identification and quantification of translation  ...[more]

Similar Datasets

2015-12-03 | E-GEOD-74139 | biostudies-arrayexpress
2015-12-03 | GSE74139 | GEO
2015-10-30 | MSV000079361 | MassIVE
| S-EPMC5793785 | biostudies-literature
| S-EPMC3225288 | biostudies-literature
| S-EPMC3863897 | biostudies-literature
| S-EPMC4216110 | biostudies-literature
2011-11-03 | GSE30839 | GEO
| S-EPMC4537299 | biostudies-literature
2011-11-03 | E-GEOD-30839 | biostudies-arrayexpress