Unknown

Dataset Information

0

TEtranscripts: a package for including transposable elements in differential expression analysis of RNA-seq datasets.


ABSTRACT: Most RNA-seq data analysis software packages are not designed to handle the complexities involved in properly apportioning short sequencing reads to highly repetitive regions of the genome. These regions are often occupied by transposable elements (TEs), which make up between 20 and 80% of eukaryotic genomes. They can contribute a substantial portion of transcriptomic and genomic sequence reads, but are typically ignored in most analyses.Here, we present a method and software package for including both gene- and TE-associated ambiguously mapped reads in differential expression analysis. Our method shows improved recovery of TE transcripts over other published expression analysis methods, in both synthetic data and qPCR/NanoString-validated published datasets.The source code, associated GTF files for TE annotation, and testing data are freely available at http://hammelllab.labsites.cshl.edu/software.mhammell@cshl.edu.Supplementary data are available at Bioinformatics online.

SUBMITTER: Jin Y 

PROVIDER: S-EPMC4757950 | biostudies-literature | 2015 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

TEtranscripts: a package for including transposable elements in differential expression analysis of RNA-seq datasets.

Jin Ying Y   Tam Oliver H OH   Paniagua Eric E   Hammell Molly M  

Bioinformatics (Oxford, England) 20150723 22


<h4>Motivation</h4>Most RNA-seq data analysis software packages are not designed to handle the complexities involved in properly apportioning short sequencing reads to highly repetitive regions of the genome. These regions are often occupied by transposable elements (TEs), which make up between 20 and 80% of eukaryotic genomes. They can contribute a substantial portion of transcriptomic and genomic sequence reads, but are typically ignored in most analyses.<h4>Results</h4>Here, we present a meth  ...[more]

Similar Datasets

| S-EPMC9812909 | biostudies-literature
| S-EPMC4635655 | biostudies-literature
| S-EPMC4666377 | biostudies-literature
| S-EPMC9537157 | biostudies-literature
| S-EPMC7849386 | biostudies-literature
| S-EPMC4393055 | biostudies-literature
| S-EPMC7458328 | biostudies-literature
| S-EPMC3663822 | biostudies-literature
| S-EPMC9220773 | biostudies-literature
| S-EPMC8550402 | biostudies-literature