Unknown

Dataset Information

0

Ryuto: network-flow based transcriptome reconstruction.


ABSTRACT: BACKGROUND:The rapid increase in High-throughput sequencing of RNA (RNA-seq) has led to tremendous improvements in the detection and reconstruction of both expressed coding and non-coding RNA transcripts. Yet, the complete and accurate annotation of the complex transcriptional output of not only the human genome has remained elusive. One of the critical bottlenecks in this endeavor is the computational reconstruction of transcript structures, due to high noise levels, technological limits, and other biases in the raw data. RESULTS:We introduce several new and improved algorithms in a novel workflow for transcript assembly and quantification. We propose an extension of the common splice graph framework that combines aspects of overlap and bin graphs and makes it possible to efficiently use both multi-splice and paired-end information to the fullest extent. Phasing information of reads is used to further resolve loci. The decomposition of read coverage patterns is modeled as a minimum-cost flow problem to account for the unavoidable non-uniformities of RNA-seq data. CONCLUSION:Its performance compares favorably with state of the art methods on both simulated and real-life datasets. Ry?t? calls 1-4% more true transcripts, while calling 5-35% less false predictions compared to the next best competitor.

SUBMITTER: Gatter T 

PROVIDER: S-EPMC6469118 | biostudies-literature | 2019 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Ryūtō: network-flow based transcriptome reconstruction.

Gatter Thomas T   Stadler Peter F PF  

BMC bioinformatics 20190416 1


<h4>Background</h4>The rapid increase in High-throughput sequencing of RNA (RNA-seq) has led to tremendous improvements in the detection and reconstruction of both expressed coding and non-coding RNA transcripts. Yet, the complete and accurate annotation of the complex transcriptional output of not only the human genome has remained elusive. One of the critical bottlenecks in this endeavor is the computational reconstruction of transcript structures, due to high noise levels, technological limit  ...[more]

Similar Datasets

| S-EPMC9129043 | biostudies-literature
| S-EPMC4229114 | biostudies-literature
| S-EPMC4070074 | biostudies-literature
| S-EPMC5353859 | biostudies-literature
| S-EPMC9122910 | biostudies-literature
| S-EPMC5751815 | biostudies-literature
| S-EPMC4975510 | biostudies-literature
| S-EPMC9677477 | biostudies-literature
| S-EPMC9235085 | biostudies-literature
| S-EPMC6019143 | biostudies-literature