Integrative classification of human coding and non-coding genes based on RNA metabolism profiles
Ontology highlight
ABSTRACT: The pervasive transcription of the human genome results in a heterogeneous mix of coding and long non-coding RNAs (lncRNAs). Only a small fraction of lncRNAs possess demonstrated regulatory functions, making it difficult to distinguish functional lncRNAs from non-functional transcriptional byproducts. This has resulted in numerous competing incoherent annotations of human lncRNA. To address these challenges, we quantitatively examined transcription, splicing, degradation, localization and translation for coding and non-coding human genes. Annotated lncRNAs had lower synthesis and higher degradation rates than mRNAs. We discovered mechanistic differences explaining the slower splicing of lncRNAs. We grouped genes into coherent classes by performing annotation-agnostic unsupervised classification of RNA metabolism profiles. These classes contained both mRNAs and lncRNAs to varying degrees; they exhibited distinct relationships between steps of RNA metabolism, evolutionary patterns, and sensitivity to cellular RNA regulatory pathways. Our classification provides a functionally coherent alternative to genomic context-driven annotations of lncRNAs.
ORGANISM(S): Homo sapiens
PROVIDER: GSE84722 | GEO | 2016/11/04
SECONDARY ACCESSION(S): PRJNA336176
REPOSITORIES: GEO
ACCESS DATA