Landscape and variation of RNA secondary structure across the human transcriptome
Ontology highlight
ABSTRACT: In parallel to the genetic code for protein synthesis, a second layer of information is embedded in all RNA transcripts in the form of RNA structure. The ability of RNA to base pair with itself and other nucleic acids endow RNA with the capacity to form extensive structures, which are known to influence practically every step in the gene expression program1. Yet the nature of most RNA structures or effects of sequence variation on structure are not known. Here we report the initial landscape and variation of RNA secondary structures (RSS) in a human family trio, providing a comprehensive RSS map of human coding and noncoding RNAs. We identify unique RSS signatures that demarcate open reading frames, splicing junctions, and define authentic microRNA binding sites. Comparison of native deproteinized RNA isolated from cells versus refolded purified RNA suggests that the majority of the RSS information is encoded within RNA sequence. Over one thousand transcribed single nucleotide variants (~15% of all transcribed SNVs) alter local RNA structure; these “RiboSNitches”2 occur in disease-associated variants. We discover simple sequence and spacing rules that determine the ability of point mutations to impact RSS. Selective depletion of RiboSNitches versus structurally synonymous variants at precise locations suggests selection for specific RNA shapes at thousands of sites, including 3’UTRs, binding sites of miRNAs and RNA binding proteins genome-wide. These results highlight the potentially broad contribution of RNA structure and its variation to gene regulation.
ORGANISM(S): Homo sapiens
PROVIDER: GSE50676 | GEO | 2013/12/19
SECONDARY ACCESSION(S): PRJNA218468
REPOSITORIES: GEO
ACCESS DATA