Unknown

Dataset Information

0

A Bayesian approach for accurate de novo transcriptome assembly.


ABSTRACT: De novo transcriptome assembly from billions of RNA-seq reads is very challenging due to alternative splicing and various levels of expression, which often leads to incorrect, mis-assembled transcripts. BayesDenovo addresses this problem by using both a read-guided strategy to accurately reconstruct splicing graphs from the RNA-seq data and a Bayesian strategy to estimate, from these graphs, the probability of transcript expression without penalizing poorly expressed transcripts. Simulation and cell line benchmark studies demonstrate that BayesDenovo is very effective in reducing false positives and achieves much higher accuracy than other assemblers, especially for alternatively spliced genes and for highly or poorly expressed transcripts. Moreover, BayesDenovo is more robust on multiple replicates by assembling a larger portion of common transcripts. When applied to breast cancer data, BayesDenovo identifies phenotype-specific transcripts associated with breast cancer recurrence.

SUBMITTER: Shi X 

PROVIDER: S-EPMC8417280 | biostudies-literature | 2021 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

A Bayesian approach for accurate de novo transcriptome assembly.

Shi Xu X   Wang Xiao X   Neuwald Andrew F AF   Halakivi-Clarke Leena L   Clarke Robert R   Xuan Jianhua J  

Scientific reports 20210903 1


De novo transcriptome assembly from billions of RNA-seq reads is very challenging due to alternative splicing and various levels of expression, which often leads to incorrect, mis-assembled transcripts. BayesDenovo addresses this problem by using both a read-guided strategy to accurately reconstruct splicing graphs from the RNA-seq data and a Bayesian strategy to estimate, from these graphs, the probability of transcript expression without penalizing poorly expressed transcripts. Simulation and  ...[more]

Similar Datasets

| S-EPMC4495290 | biostudies-literature
| S-EPMC8590762 | biostudies-literature
| S-EPMC3288049 | biostudies-literature
| S-EPMC4070175 | biostudies-literature
| S-EPMC5200869 | biostudies-literature
| S-EPMC4892416 | biostudies-literature
| S-EPMC6078068 | biostudies-literature
| S-EPMC5411768 | biostudies-literature
| S-EPMC4664767 | biostudies-literature
| S-EPMC4878842 | biostudies-literature