Project description:Through alternative splicing, most human genes express multiple isoforms that may have distinct or even antagonistic functions. To infer isoform regulation based on data from high-throughput sequencing of cDNA fragments (RNA-Seq), we have developed MISO, a computational model that estimates the expression level of alternatively spliced exons and mRNA isoforms and provides intuitive measures of confidence in these estimates. Incorporation of the length distribution of inserted cDNA fragments in paired-end RNA-Seq analysis in MISO enables dramatic improvements in estimation of alternative splicing levels relative to previous methods. We show that one lane of paired-end RNA-Seq data can provide far more information about splicing than two lanes of single-end data, depending critically on properties of the distribution of cDNA fragment lengths in the sequenced library. MISO also leads to an intuitive method to detect differentially regulated exons or isoforms. Application of this method implicates the RNA splicing factor hnRNP H in regulation of alternative cleavage and polyadenylation, a role that is supported by UV crosslinking/immunoprecipitation/high-throughput sequencing (CLIP-Seq) analysis. Together, our results provide a probabilistic framework for RNA-Seq analysis, derive functional insights into pre-mRNA processing, and yield guidelines for the optimal design of RNA-Seq experiments for studies of gene and isoform expression.
Project description:Purpose: Mutations in TP53 induce autoantibody immune responses in a subset of cancer patients, which have been proposed as biomarkers for early detection. Here, we investigate the association of p53 specific autoantibodies with multiple tumor subtypes and determine the association with p53 mutation status and epitope specificity. Experimental Design: IgG p53 autoantibodies (p53-AAb), were quantified in 412 serum saples using a programmable ELISA assay from patients with serous ovarian, pancreatic adenocarcinoma, and breast cancer. To determine if patients generated mutation specific autoantibodies we designed a panel of the most relevant 51 p53 point mutant proteins, to be displayed on custom programmable protein microarrays. To determine the epitope specificity we displayed 12 overlapping tiling fragments and 38 N- and C-terminal deletions spanning the length of the wild-type p53 proteins. Results: We detected p53-AAb with sensitivities of 58.8% (ovarian), 22% (pancreatic), 32% (triple negative breast cancer), and 10.2% (HER2+ breast cancer) at 94% specificity. Sera with p53-AAb contained broadly-reactive autoantibodies to 51 displayed p53 mutant proteins, demonstrating a polyclonal response to common epitopes. All p53-AAb displayed broad polyclonal immune response to both continuous and discontinuous epitopes at the N- and C-terminus as well as the DNA binding domain. Conclusion and clinical relevance: In this comprehensive analysis, mutations in tumor p53 induce strong, polyclonal autoantibodies with broadly reactive epitope specificity. The immunoreactivity was compared between 60 pancreactic ductal adenocarcinoma cases and 63 benign pancreatic disease controls against 52 unique mutant p53 and 379 human proteins that were printed on microscope slides. [Contributor] Arizona State University
Project description:Through alternative splicing, most human genes express multiple isoforms that may have distinct or even antagonistic functions. To infer isoform regulation based on data from high-throughput sequencing of cDNA fragments (RNA-Seq), we have developed MISO, a computational model that estimates the expression level of alternatively spliced exons and mRNA isoforms and provides intuitive measures of con?dence in these estimates. Incorporation of the length distribution of inserted cDNA fragments in paired-end RNA-Seq analysis in MISO enables dramatic improvements in estimation of alternative splicing levels relative to previous methods. We show that one lane of paired-end RNA-Seq data can provide far more information about splicing than two lanes of single-end data, depending critically on properties of the distribution of cDNA fragment lengths in the sequenced library. MISO also leads to an intuitive method to detect di?erentially regulated exons or isoforms. Application of this method implicates the RNA splicing factor hnRNP H in regulation of alternative cleavage and polyadenylation, a role that is supported by UV crosslinking/immunoprecipitation/high-throughput sequencing (CLIP-Seq) analysis. Together, our results provide a probabilistic framework for RNA-Seq analysis, derive functional insights into pre-mRNA processing, and yield guidelines for the optimal design of RNA-Seq experiments for studies of gene and isoform expression. CLIPseq of hnRNP H in HEK 293T cells. RNAseq of polyA+ RNA from C2C12 mouse myoblasts stably expressing an empty vector or a vector containing an shRNA against CUGBP1. Libraries of two different insert lengths were created and examined.
Project description:Although long-read single-cell RNA isoform sequencing (scISO-Seq) can reveal alternative RNA splicing in individual cells, it suffers from a low read throughput. Here, we introduce HIT-scISOseq, a method that removes most artifact cDNAs and concatenates multiple cDNAs for PacBio circular consensus sequencing (CCS) to achieve high-throughput and high-accuracy single-cell RNA isoform sequencing. HIT-scISOseq can yield >10 million high-accuracy long-reads in a single PacBio Sequel II SMRT Cell 8M. We also report the development of scISA-Tools that demultiplex HIT-scISOseq concatenated reads into single-cell cDNA reads with >99.99% accuracy and specificity. We apply HIT-scISOseq to characterize the transcriptomes of 3375 corneal limbus cells and reveal cell-type-specific isoform expression in them. HIT-scISOseq is a high-throughput, high-accuracy, technically accessible method and it can accelerate the burgeoning field of long-read single-cell transcriptomics.
Project description:The widespread application of single-cell genomics technologies has accelerated our understanding of the breadth and depth of heterogeneity of cell states across diverse contexts. As single-cell RNA sequencing (scRNA-seq) has been the most popular modality used for profiling, many populations have been described primarily based on specific marker transcript profiles compared to classical cytometry approaches relying on protein expression. Additionally, many single cell studies require the isolation of nuclei from tissue, eliminating the ability to enrich learned rare cell states based on extranuclear protein markers. To address this limitation, we describe Programmable Enrichment via RNA Flow-FISH by sequencing (PERFF-seq), a scalable assay that enables single cell and single nuclei RNA-seq profiling from subpopulations of complex cellular mixtures solely defined by the abundance of RNA transcripts. Across vignettes of immune cell populations as well as nuclei from fresh frozen and formalin-fixed paraffin-embedded brain tissue, we demonstrate the enrichment of cell populations via RNA-based cytometry upstream of high-throughput scRNA-seq. Together, our approach provides a rational, programmable method for studying cell identities and transcriptional heterogeneity of rare populations identifiable by as few as one marker transcript, advancing the rational study of cellular diversity across fresh and archived tissue materials.