Unknown

Dataset Information

0

Evidence for conservation and selection of upstream open reading frames suggests probable encoding of bioactive peptides.


ABSTRACT:

Background

Approximately 40% of mammalian mRNA sequences contain AUG trinucleotides upstream of the main coding sequence, with a quarter of these AUGs demarcating open reading frames of 20 or more codons. In order to investigate whether these open reading frames may encode functional peptides, we have carried out a comparative genomic analysis of human and mouse mRNA 'untranslated regions' using sequences from the RefSeq mRNA sequence database.

Results

We have identified over 200 upstream open reading frames which are strongly conserved between the human and mouse genomes. Consensus sequences associated with efficient initiation of translation are overrepresented at the AUG trinucleotides of these upstream open reading frames, while comparative analysis of their DNA and putative peptide sequences shows evidence of purifying selection.

Conclusion

The occurrence of a large number of conserved upstream open reading frames, in association with features consistent with protein translation, strongly suggests evolutionary maintenance of the coding sequence and indicates probable functional expression of the peptides encoded within these upstream open reading frames.

SUBMITTER: Crowe ML 

PROVIDER: S-EPMC1402274 | biostudies-literature | 2006 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Evidence for conservation and selection of upstream open reading frames suggests probable encoding of bioactive peptides.

Crowe Mark L ML   Wang Xue-Qing XQ   Rothnagel Joseph A JA  

BMC genomics 20060126


<h4>Background</h4>Approximately 40% of mammalian mRNA sequences contain AUG trinucleotides upstream of the main coding sequence, with a quarter of these AUGs demarcating open reading frames of 20 or more codons. In order to investigate whether these open reading frames may encode functional peptides, we have carried out a comparative genomic analysis of human and mouse mRNA 'untranslated regions' using sequences from the RefSeq mRNA sequence database.<h4>Results</h4>We have identified over 200  ...[more]

Similar Datasets

| S-EPMC9122824 | biostudies-literature
| S-EPMC2527020 | biostudies-literature
| S-EPMC5587730 | biostudies-literature
| S-EPMC9851245 | biostudies-literature
| S-EPMC2813248 | biostudies-literature
| S-EPMC3710870 | biostudies-literature
2018-04-30 | GSE105082 | GEO
2018-09-07 | GSE119615 | GEO
| S-EPMC10983949 | biostudies-literature
| S-EPMC6528255 | biostudies-literature