Dataset Information

RNA editing in the human ENCODE RNA-seq data.

ABSTRACT: RNA-seq data can be mined for sequence differences relative to the reference genome to identify both genomic SNPs and RNA editing events. We analyzed the long, polyA-selected, unstranded, deeply sequenced RNA-seq data from the ENCODE Project across 14 human cell lines for candidate RNA editing events. On average, 43% of the RNA sequencing variants that are not in dbSNP and are within gene boundaries are A-to-G(I) RNA editing candidates. The vast majority of A-to-G(I) edits are located in introns and 3' UTRs, with only 123 located in protein-coding sequence. In contrast, the majority of non-A-to-G variants (60%-80%) map near exon boundaries and have the characteristics of splice-mapping artifacts. After filtering out all candidates with evidence of private genomic variation using genome resequencing or ChIP-seq data, we find that up to 85% of the high-confidence RNA variants are A-to-G(I) editing candidates. Genes with A-to-G(I) edits are enriched in Gene Ontology terms involving cell division, viral defense, and translation. The distribution and character of the remaining non-A-to-G variants closely resemble known SNPs. We find no reproducible A-to-G(I) edits that result in nonsynonymous substitutions in all three lymphoblastoid cell lines in our study, unlike RNA editing in the brain. Given that only a fraction of sites are reproducibly edited in multiple cell lines and that we find a stronger association of editing and specific genes suggests that the editing of the transcript is more important than the editing of any individual site.

SUBMITTER: Park E

PROVIDER: S-EPMC3431480 | biostudies-literature | 2012 Sep

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

RNA editing in the human ENCODE RNA-seq data.

Park Eddie E Williams Brian B Wold Barbara J BJ Mortazavi Ali A

Genome research 20120901 9

RNA-seq data can be mined for sequence differences relative to the reference genome to identify both genomic SNPs and RNA editing events. We analyzed the long, polyA-selected, unstranded, deeply sequenced RNA-seq data from the ENCODE Project across 14 human cell lines for candidate RNA editing events. On average, 43% of the RNA sequencing variants that are not in dbSNP and are within gene boundaries are A-to-G(I) RNA editing candidates. The vast majority of A-to-G(I) edits are located in introns ...[more]

PMID: 22955975

Dataset Information

RNA editing in the human ENCODE RNA-seq data.

Publications

RNA editing in the human ENCODE RNA-seq data.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Differential expression analysis of human endogenous retroviruses based on ENCODE RNA-seq data.
| S-EPMC4632268 | biostudies-literature

Detection of regulatory SNPs in human genome using ChIP-seq ENCODE data.
| S-EPMC3812152 | biostudies-literature

ChromNet: Learning the human chromatin network from all ENCODE ChIP-seq data.
| S-EPMC4852466 | biostudies-literature

Splicing and editing of ionotropic glutamate receptors: a comprehensive analysis based on human RNA-Seq data.
| S-EPMC8257547 | biostudies-literature

Genome-Wide Characterization of RNA Editing Sites in Primary Gastric Adenocarcinoma through RNA-seq Data Analysis.
| S-EPMC7768588 | biostudies-literature

A novel computational strategy to identify A-to-I RNA editing sites by RNA-Seq data: de novo detection in human spinal cord tissue.
| S-EPMC3434223 | biostudies-literature

Viewing RNA-seq data on the entire human genome.
| S-EPMC5605993 | biostudies-literature

L-GIREMI uncovers RNA editing sites in long-read RNA-seq.
| S-EPMC10360234 | biostudies-literature

Nascent-seq indicates widespread cotranscriptional RNA editing in Drosophila.
| S-EPMC3409466 | biostudies-literature

Assessing the consistency of public human tissue RNA-seq data sets.
| S-EPMC4652619 | biostudies-literature