Unknown

Dataset Information

0

Modeling non-uniformity in short-read rates in RNA-Seq data.


ABSTRACT: After mapping, RNA-Seq data can be summarized by a sequence of read counts commonly modeled as Poisson variables with constant rates along each transcript, which actually fit data poorly. We suggest using variable rates for different positions, and propose two models to predict these rates based on local sequences. These models explain more than 50% of the variations and can lead to improved estimates of gene and isoform expressions for both Illumina and Applied Biosystems data.

SUBMITTER: Li J 

PROVIDER: S-EPMC2898062 | biostudies-literature | 2010

REPOSITORIES: biostudies-literature

altmetric image

Publications

Modeling non-uniformity in short-read rates in RNA-Seq data.

Li Jun J   Jiang Hui H   Wong Wing Hung WH  

Genome biology 20100511 5


After mapping, RNA-Seq data can be summarized by a sequence of read counts commonly modeled as Poisson variables with constant rates along each transcript, which actually fit data poorly. We suggest using variable rates for different positions, and propose two models to predict these rates based on local sequences. These models explain more than 50% of the variations and can lead to improved estimates of gene and isoform expressions for both Illumina and Applied Biosystems data. ...[more]

Similar Datasets

| S-EPMC8044432 | biostudies-literature
| S-EPMC3663818 | biostudies-literature
| S-EPMC11332977 | biostudies-literature
| S-EPMC3485621 | biostudies-literature
| S-EPMC3287467 | biostudies-literature
| S-EPMC6237323 | biostudies-literature
| S-EPMC11329654 | biostudies-literature
| S-EPMC3919567 | biostudies-literature
| S-EPMC6901072 | biostudies-literature
| S-EPMC4622496 | biostudies-literature