Analysis and design of RNA sequencing experiments for identifying mRNA isoform regulation
Ontology highlight
ABSTRACT: Through alternative splicing, most human genes express multiple isoforms that may have distinct or even antagonistic functions. To infer isoform regulation based on data from high-throughput sequencing of cDNA fragments (RNA-Seq), we have developed MISO, a computational model that estimates the expression level of alternatively spliced exons and mRNA isoforms and provides intuitive measures of confidence in these estimates. Incorporation of the length distribution of inserted cDNA fragments in paired-end RNA-Seq analysis in MISO enables dramatic improvements in estimation of alternative splicing levels relative to previous methods. We show that one lane of paired-end RNA-Seq data can provide far more information about splicing than two lanes of single-end data, depending critically on properties of the distribution of cDNA fragment lengths in the sequenced library. MISO also leads to an intuitive method to detect differentially regulated exons or isoforms. Application of this method implicates the RNA splicing factor hnRNP H in regulation of alternative cleavage and polyadenylation, a role that is supported by UV crosslinking/immunoprecipitation/high-throughput sequencing (CLIP-Seq) analysis. Together, our results provide a probabilistic framework for RNA-Seq analysis, derive functional insights into pre-mRNA processing, and yield guidelines for the optimal design of RNA-Seq experiments for studies of gene and isoform expression.
ORGANISM(S): Mus musculus Homo sapiens
PROVIDER: GSE23694 | GEO | 2010/11/07
SECONDARY ACCESSION(S): PRJNA130865
REPOSITORIES: GEO
ACCESS DATA