Unknown

Dataset Information

0

A subset of conserved mammalian long non-coding RNAs are fossils of ancestral protein-coding genes.


ABSTRACT: BACKGROUND:Only a small portion of human long non-coding RNAs (lncRNAs) appear to be conserved outside of mammals, but the events underlying the birth of new lncRNAs in mammals remain largely unknown. One potential source is remnants of protein-coding genes that transitioned into lncRNAs. RESULTS:We systematically compare lncRNA and protein-coding loci across vertebrates, and estimate that up to 5% of conserved mammalian lncRNAs are derived from lost protein-coding genes. These lncRNAs have specific characteristics, such as broader expression domains, that set them apart from other lncRNAs. Fourteen lncRNAs have sequence similarity with the loci of the contemporary homologs of the lost protein-coding genes. We propose that selection acting on enhancer sequences is mostly responsible for retention of these regions. As an example of an RNA element from a protein-coding ancestor that was retained in the lncRNA, we describe in detail a short translated ORF in the JPX lncRNA that was derived from an upstream ORF in a protein-coding gene and retains some of its functionality. CONCLUSIONS:We estimate that?~?55 annotated conserved human lncRNAs are derived from parts of ancestral protein-coding genes, and loss of coding potential is thus a non-negligible source of new lncRNAs. Some lncRNAs inherited regulatory elements influencing transcription and translation from their protein-coding ancestors and those elements can influence the expression breadth and functionality of these lncRNAs.

SUBMITTER: Hezroni H 

PROVIDER: S-EPMC5577775 | biostudies-literature | 2017 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

A subset of conserved mammalian long non-coding RNAs are fossils of ancestral protein-coding genes.

Hezroni Hadas H   Ben-Tov Perry Rotem R   Meir Zohar Z   Housman Gali G   Lubelsky Yoav Y   Ulitsky Igor I  

Genome biology 20170830 1


<h4>Background</h4>Only a small portion of human long non-coding RNAs (lncRNAs) appear to be conserved outside of mammals, but the events underlying the birth of new lncRNAs in mammals remain largely unknown. One potential source is remnants of protein-coding genes that transitioned into lncRNAs.<h4>Results</h4>We systematically compare lncRNA and protein-coding loci across vertebrates, and estimate that up to 5% of conserved mammalian lncRNAs are derived from lost protein-coding genes. These ln  ...[more]

Similar Datasets

| S-EPMC3492712 | biostudies-other
| S-EPMC10487962 | biostudies-literature
| S-EPMC2859052 | biostudies-literature
| S-EPMC3699063 | biostudies-literature
| S-EPMC8112034 | biostudies-literature
| S-EPMC6261887 | biostudies-literature
2020-01-09 | PXD014553 | Pride
| S-EPMC3477000 | biostudies-literature
| S-EPMC3633045 | biostudies-literature
| S-EPMC3441637 | biostudies-literature