Unknown

Dataset Information

0

Comparison of splice sites reveals that long noncoding RNAs are evolutionarily well conserved.


ABSTRACT: Large-scale RNA sequencing has revealed a large number of long mRNA-like transcripts (lncRNAs) that do not code for proteins. The evolutionary history of these lncRNAs has been notoriously hard to study systematically due to their low level of sequence conservation that precludes comprehensive homology-based surveys and makes them nearly impossible to align. An increasing number of special cases, however, has been shown to be at least as old as the vertebrate lineage. Here we use the conservation of splice sites to trace the evolution of lncRNAs. We show that >85% of the human GENCODE lncRNAs were already present at the divergence of placental mammals and many hundreds of these RNAs date back even further. Nevertheless, we observe a fast turnover of intron/exon structures. We conclude that lncRNA genes are evolutionary ancient components of vertebrate genomes that show an unexpected and unprecedented evolutionary plasticity. We offer a public web service (http://splicemap.bioinf.uni-leipzig.de) that allows to retrieve sets of orthologous splice sites and to produce overview maps of evolutionarily conserved splice sites for visualization and further analysis. An electronic supplement containing the ncRNA data sets used in this study is available at http://www.bioinf.uni-leipzig.de/publications/supplements/12-001.

SUBMITTER: Nitsche A 

PROVIDER: S-EPMC4408788 | biostudies-literature | 2015 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Comparison of splice sites reveals that long noncoding RNAs are evolutionarily well conserved.

Nitsche Anne A   Rose Dominic D   Fasold Mario M   Reiche Kristin K   Stadler Peter F PF  

RNA (New York, N.Y.) 20150323 5


Large-scale RNA sequencing has revealed a large number of long mRNA-like transcripts (lncRNAs) that do not code for proteins. The evolutionary history of these lncRNAs has been notoriously hard to study systematically due to their low level of sequence conservation that precludes comprehensive homology-based surveys and makes them nearly impossible to align. An increasing number of special cases, however, has been shown to be at least as old as the vertebrate lineage. Here we use the conservatio  ...[more]

Similar Datasets

| S-EPMC4615893 | biostudies-other
| S-EPMC3699063 | biostudies-literature
| S-EPMC6451187 | biostudies-literature
| S-EPMC4010157 | biostudies-literature
| S-EPMC10078953 | biostudies-literature
| S-EPMC4428586 | biostudies-literature
2018-04-06 | GSE69455 | GEO
2018-04-06 | GSE69451 | GEO
| S-EPMC6333433 | biostudies-literature
| S-EPMC5566112 | biostudies-other