Unknown

Dataset Information

0

Quantitative RNA-seq meta-analysis of alternative exon usage in C. elegans.


ABSTRACT: Almost 20 years after the completion of the C. elegans genome sequence, gene structure annotation is still an ongoing process with new evidence for gene variants still being regularly uncovered by additional in-depth transcriptome studies. While alternative splice forms can allow a single gene to encode several functional isoforms, the question of how much spurious splicing is tolerated is still heavily debated. Here we gathered a compendium of 1682 publicly available C. elegans RNA-seq data sets to increase the dynamic range of detection of RNA isoforms, and obtained robust measurements of the relative abundance of each splicing event. While most of the splicing reads come from reproducibly detected splicing events, a large fraction of purported junctions is only supported by a very low number of reads. We devised an automated curation method that takes into account the expression level of each gene to discriminate robust splicing events from potential biological noise. We found that rarely used splice sites disproportionately come from highly expressed genes and are significantly less conserved in other nematode genomes than splice sites with a higher usage frequency. Our increased detection power confirmed trans-splicing for at least 84% of C. elegans protein coding genes. The genes for which trans-splicing was not observed are overwhelmingly low expression genes, suggesting that the mechanism is pervasive but not fully captured by organism-wide RNA-seq. We generated annotated gene models including quantitative exon usage information for the entire C. elegans genome. This allows users to visualize at a glance the relative expression of each isoform for their gene of interest.

SUBMITTER: Tourasse NJ 

PROVIDER: S-EPMC5741048 | biostudies-literature | 2017 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Quantitative RNA-seq meta-analysis of alternative exon usage in <i>C. elegans</i>.

Tourasse Nicolas J NJ   Millet Jonathan R M JRM   Dupuy Denis D  

Genome research 20171031 12


Almost 20 years after the completion of the <i>C. elegans</i> genome sequence, gene structure annotation is still an ongoing process with new evidence for gene variants still being regularly uncovered by additional in-depth transcriptome studies. While alternative splice forms can allow a single gene to encode several functional isoforms, the question of how much spurious splicing is tolerated is still heavily debated. Here we gathered a compendium of 1682 publicly available <i>C. elegans</i> RN  ...[more]

Similar Datasets

| S-EPMC4542614 | biostudies-literature
| S-EPMC3097375 | biostudies-literature
| S-EPMC7656758 | biostudies-literature
| S-EPMC2879520 | biostudies-literature
2020-11-18 | GSE148028 | GEO
| S-EPMC4343181 | biostudies-literature
2015-07-15 | E-GEOD-60391 | biostudies-arrayexpress
| PRJNA622919 | ENA
| S-EPMC4914109 | biostudies-literature
| S-EPMC3548711 | biostudies-literature