Dataset Information

A new approach to bias correction in RNA-Seq.

ABSTRACT:

Motivation

Quantification of sequence abundance in RNA-Seq experiments is often conflated by protocol-specific sequence bias. The exact sources of the bias are unknown, but may be influenced by polymerase chain reaction amplification, or differing primer affinities and mixtures, for example. The result is decreased accuracy in many applications, such as de novo gene annotation and transcript quantification.

Results

We present a new method to measure and correct for these influences using a simple graphical model. Our model does not rely on existing gene annotations, and model selection is performed automatically making it applicable with few assumptions. We evaluate our method on several datasets, and by multiple criteria, demonstrating that it effectively decreases bias and increases uniformity. Additionally, we provide theoretical and empirical results showing that the method is unlikely to have any effect on unbiased data, suggesting it can be applied with little risk of spurious adjustment.

Availability

The method is implemented in the seqbias R/Bioconductor package, available freely under the LGPL license from http://bioconductor.org

Contact

dcjones@cs.washington.edu

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Jones DC

PROVIDER: S-EPMC3315719 | biostudies-literature | 2012 Apr

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

A new approach to bias correction in RNA-Seq.

Jones Daniel C DC Ruzzo Walter L WL Peng Xinxia X Katze Michael G MG

Bioinformatics (Oxford, England) 20120128 7

<h4>Motivation</h4>Quantification of sequence abundance in RNA-Seq experiments is often conflated by protocol-specific sequence bias. The exact sources of the bias are unknown, but may be influenced by polymerase chain reaction amplification, or differing primer affinities and mixtures, for example. The result is decreased accuracy in many applications, such as de novo gene annotation and transcript quantification.<h4>Results</h4>We present a new method to measure and correct for these influence ...[more]

PMID: 22285831

Dataset Information

A new approach to bias correction in RNA-Seq.

Motivation

Results

Availability

Contact

Supplementary information

Publications

A new approach to bias correction in RNA-Seq.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

BCseq: accurate single cell RNA-seq quantification with bias correction.
| S-EPMC6101504 | biostudies-literature

Length bias correction for RNA-seq data in gene set analyses.
| S-EPMC3042188 | biostudies-literature

A new strategy to reduce allelic bias in RNA-Seq readmapping.
| S-EPMC3439884 | biostudies-literature

Bias detection and correction in RNA-Sequencing data.
| S-EPMC3149584 | biostudies-literature

IVT-seq reveals extreme bias in RNA sequencing.
| S-EPMC4197826 | biostudies-literature

iMapSplice: Alleviating reference bias through personalized RNA-seq alignment.
| S-EPMC6086400 | biostudies-literature

Correction of transposase sequence bias in ATAC-seq data with rule ensemble modeling.
| S-EPMC10236359 | biostudies-literature

Universal correction of enzymatic sequence bias
2016-12-21 | GSE92674 | GEO

Improving RNA-Seq expression estimates by correcting for fragment bias.
| S-EPMC3129672 | biostudies-literature

Gene ontology analysis for RNA-seq: accounting for selection bias.
| S-EPMC2872874 | biostudies-literature