Expression of novel long noncoding RNAs during erythro-megakaryopoiesis and GATA1-induced erythroid differentiation using RNA-seq
Ontology highlight
ABSTRACT: Mammals express thousands of long noncoding (lnc) RNAs, a few of which are shown to function in tissue development. However, the entire repertoire of lncRNAs and the extent to which they regulate biological processes in different tissues and species are not defined. Indeed, most lncRNAs are not conserved between species, raising questions about function. We used RNA-Seq to identify lncRNAs in primary murine fetal liver erythroblasts expressing the lineage marker TER119, megakaryocytes (CD41+) cultured from embryonic day (E) 14.5 murine fetal liver and megakaryocyte erythroid progenitors (MEPs) isolated from mouse bone marrow. Strand-specific, paired-end, deep sequencing was performed on polyA+ mRNA from biological replicates of each sample. We assembled the transcriptomes using the Cufflinks package and generated a high-confidence transcriptome of 13,131 genes expressed in the three cell types. We identified 683 and 594 polyadenylated lncRNAs expressed in red blood cell (erythroid) precursors of mice and humans, respectively. More than one half of erythroid lncRNAs are un-annotated, emphasizing the opportunity for new discovery through studies of specialized cell types. Analysis of the murine erythroid lncRNA transcriptome indicates that ~75% arise from promoters and 25% from enhancers, many of which are regulated by the key erythroid transcription factors GATA1 and SCL/TAL1. Erythroid lncRNA expression is largely conserved among 8 different mouse strains, yet only 15% of mouse lncRNAs are expressed in humans and vice versa, reflecting dramatically greater species-specificity than coding genes. We investigated potential functions of 21 relatively abundant erythroid-specific murine lncRNAs (both conserved and non-conserved) by RNA interference in primary mouse erythroid precursors, and identified 7 whose knockdown inhibited features of terminal erythroid maturation including cell size reduction and enucleation. Strikingly, at least 6 of the 7 lncRNAs have no detectable expression in human erythroblasts, demonstrating that lack of conservation between mammalian species does not predict lack of function. These results reflect marked evolutionary differences between protein-coding genes and lncRNAs and indicate that the latter exert tissue- and species-specific roles in development. For data usage terms and conditions, please refer to http://www.genome.gov/27528022 and http://www.genome.gov/Pages/Research/ENCODE/ENCODEDataReleasePolicyFinal2008.pdf
ORGANISM(S): Mus musculus
PROVIDER: GSE51667 | GEO | 2014/02/20
SECONDARY ACCESSION(S): PRJNA224576
REPOSITORIES: GEO
ACCESS DATA