Unknown

Dataset Information

0

Tools and best practices for data processing in allelic expression analysis.


ABSTRACT: Allelic expression analysis has become important for integrating genome and transcriptome data to characterize various biological phenomena such as cis-regulatory variation and nonsense-mediated decay. We analyze the properties of allelic expression read count data and technical sources of error, such as low-quality or double-counted RNA-seq reads, genotyping errors, allelic mapping bias, and technical covariates due to sample preparation and sequencing, and variation in total read depth. We provide guidelines for correcting such errors, show that our quality control measures improve the detection of relevant allelic expression, and introduce tools for the high-throughput production of allelic expression data from RNA-sequencing data.

SUBMITTER: Castel SE 

PROVIDER: S-EPMC4574606 | biostudies-literature | 2015 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Tools and best practices for data processing in allelic expression analysis.

Castel Stephane E SE   Levy-Moonshine Ami A   Mohammadi Pejman P   Banks Eric E   Lappalainen Tuuli T  

Genome biology 20150917


Allelic expression analysis has become important for integrating genome and transcriptome data to characterize various biological phenomena such as cis-regulatory variation and nonsense-mediated decay. We analyze the properties of allelic expression read count data and technical sources of error, such as low-quality or double-counted RNA-seq reads, genotyping errors, allelic mapping bias, and technical covariates due to sample preparation and sequencing, and variation in total read depth. We pro  ...[more]

Similar Datasets

| S-EPMC6935493 | biostudies-literature
2018-06-08 | GSE107768 | GEO
2018-06-08 | GSE107767 | GEO
2018-06-08 | GSE107766 | GEO
2024-04-30 | GSE230765 | GEO
| S-EPMC5685169 | biostudies-literature
| S-EPMC4312887 | biostudies-literature
| S-EPMC7870349 | biostudies-literature
| PRJNA421341 | ENA
2021-02-24 | GSE158480 | GEO