Unknown

Dataset Information

0

Analysis of nested alternate open reading frames and their encoded proteins.


ABSTRACT: Transcriptional and post-transcriptional mechanisms diversify the proteome beyond gene number, while maintaining a sequence relationship between original and altered proteins. A new mechanism breaks this paradigm, generating novel proteins by translating alternative open reading frames (Alt-ORFs) within canonical host mRNAs. Uniquely, 'alt-proteins' lack sequence homology with host ORF-derived proteins. We show global amino acid frequencies, and consequent biochemical characteristics of Alt-ORFs nested within host ORFs (nAlt-ORFs), are genetically-driven, and predicted by summation of frequencies of hundreds of encompassing host codon-pairs. Analysis of 101 human nAlt-ORFs of length ≥150 codons confirms the theoretical predictions, revealing an extraordinarily high median isoelectric point (pI) of 11.68, due to anomalous charged amino acid levels. Also, nAlt-ORF proteins exhibit a >2-fold preference for reading frame 2 versus 3, predicted mitochondrial and nuclear localization, and elevated codon adaptation index indicative of natural selection. Our results provide a theoretical and conceptual framework for exploration of these largely unannotated, but potentially significant, alternative ORFs and their encoded proteins.

SUBMITTER: Vasu K 

PROVIDER: S-EPMC9580016 | biostudies-literature | 2022 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Analysis of nested alternate open reading frames and their encoded proteins.

Vasu Kommireddy K   Khan Debjit D   Ramachandiran Iyappan I   Blankenberg Daniel D   Fox Paul L PL  

NAR genomics and bioinformatics 20221019 4


Transcriptional and post-transcriptional mechanisms diversify the proteome beyond gene number, while maintaining a sequence relationship between original and altered proteins. A new mechanism breaks this paradigm, generating novel proteins by translating alternative open reading frames (Alt-ORFs) within canonical host mRNAs. Uniquely, 'alt-proteins' lack sequence homology with host ORF-derived proteins. We show global amino acid frequencies, and consequent biochemical characteristics of Alt-ORFs  ...[more]

Similar Datasets

| S-EPMC3754608 | biostudies-literature
2020-03-14 | GSE131650 | GEO
| S-EPMC528919 | biostudies-literature
| S-EPMC8195866 | biostudies-literature
2021-04-28 | GSE154491 | GEO
| S-EPMC9122824 | biostudies-literature
| S-EPMC9757701 | biostudies-literature
| S-EPMC9974104 | biostudies-literature
2020-03-06 | PXD014031 | Pride
2019-07-03 | GSE125218 | GEO