Unknown

Dataset Information

0

The vast, conserved mammalian lincRNome.


ABSTRACT: We compare the sets of experimentally validated long intergenic non-coding (linc)RNAs from human and mouse and apply a maximum likelihood approach to estimate the total number of lincRNA genes as well as the size of the conserved part of the lincRNome. Under the assumption that the sets of experimentally validated lincRNAs are random samples of the lincRNomes of the corresponding species, we estimate the total lincRNome size at approximately 40,000 to 50,000 species, at least twice the number of protein-coding genes. We further estimate that the fraction of the human and mouse euchromatic genomes encoding lincRNAs is more than twofold greater than the fraction of protein-coding sequences. Although the sequences of most lincRNAs are much less strongly conserved than protein sequences, the extent of orthology between the lincRNomes is unexpectedly high, with 60 to 70% of the lincRNA genes shared between human and mouse. The orthologous mammalian lincRNAs can be predicted to perform equivalent functions; accordingly, it appears likely that thousands of evolutionarily conserved functional roles of lincRNAs remain to be characterized.

SUBMITTER: Managadze D 

PROVIDER: S-EPMC3585383 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

The vast, conserved mammalian lincRNome.

Managadze David D   Lobkovsky Alexander E AE   Wolf Yuri I YI   Shabalina Svetlana A SA   Rogozin Igor B IB   Koonin Eugene V EV  

PLoS computational biology 20130228 2


We compare the sets of experimentally validated long intergenic non-coding (linc)RNAs from human and mouse and apply a maximum likelihood approach to estimate the total number of lincRNA genes as well as the size of the conserved part of the lincRNome. Under the assumption that the sets of experimentally validated lincRNAs are random samples of the lincRNomes of the corresponding species, we estimate the total lincRNome size at approximately 40,000 to 50,000 species, at least twice the number of  ...[more]

Similar Datasets

| S-EPMC2809750 | biostudies-literature
| S-EPMC3539715 | biostudies-literature
| S-EPMC548335 | biostudies-literature
| S-EPMC3157836 | biostudies-literature
| S-EPMC4197820 | biostudies-literature
| S-EPMC3679889 | biostudies-literature
| S-EPMC2176071 | biostudies-literature
| S-EPMC4254068 | biostudies-literature
| S-EPMC8202008 | biostudies-literature
| S-EPMC8360379 | biostudies-literature