Unknown

Dataset Information

0

Comprehensive evaluation of fusion transcript detection algorithms and a meta-caller to combine top performing methods in paired-end RNA-seq data.


ABSTRACT: Fusion transcripts are formed by either fusion genes (DNA level) or trans-splicing events (RNA level). They have been recognized as a promising tool for diagnosing, subtyping and treating cancers. RNA-seq has become a precise and efficient standard for genome-wide screening of such aberration events. Many fusion transcript detection algorithms have been developed for paired-end RNA-seq data but their performance has not been comprehensively evaluated to guide practitioners. In this paper, we evaluated 15 popular algorithms by their precision and recall trade-off, accuracy of supporting reads and computational cost. We further combine top-performing methods for improved ensemble detection.Fifteen fusion transcript detection tools were compared using three synthetic data sets under different coverage, read length, insert size and background noise, and three real data sets with selected experimental validations. No single method dominantly performed the best but SOAPfuse generally performed well, followed by FusionCatcher and JAFFA. We further demonstrated the potential of a meta-caller algorithm by combining top performing methods to re-prioritize candidate fusion transcripts with high confidence that can be followed by experimental validation.Our result provides insightful recommendations when applying individual tool or combining top performers to identify fusion transcript candidates.

SUBMITTER: Liu S 

PROVIDER: S-EPMC4797269 | biostudies-other | 2016 Mar

REPOSITORIES: biostudies-other

altmetric image

Publications

Comprehensive evaluation of fusion transcript detection algorithms and a meta-caller to combine top performing methods in paired-end RNA-seq data.

Liu Silvia S   Tsai Wei-Hsiang WH   Ding Ying Y   Chen Rui R   Fang Zhou Z   Huo Zhiguang Z   Kim SungHwan S   Ma Tianzhou T   Chang Ting-Yu TY   Priedigkeit Nolan Michael NM   Lee Adrian V AV   Luo Jianhua J   Wang Hsei-Wei HW   Chung I-Fang IF   Tseng George C GC  

Nucleic acids research 20151117 5


<h4>Background</h4>Fusion transcripts are formed by either fusion genes (DNA level) or trans-splicing events (RNA level). They have been recognized as a promising tool for diagnosing, subtyping and treating cancers. RNA-seq has become a precise and efficient standard for genome-wide screening of such aberration events. Many fusion transcript detection algorithms have been developed for paired-end RNA-seq data but their performance has not been comprehensively evaluated to guide practitioners. In  ...[more]

Similar Datasets

| S-EPMC5737728 | biostudies-literature
| S-EPMC6211471 | biostudies-literature
| S-EPMC4054009 | biostudies-literature
| S-EPMC3691734 | biostudies-literature
| S-EPMC2708976 | biostudies-literature
| S-EPMC2916723 | biostudies-literature
| S-EPMC10594700 | biostudies-literature
| S-EPMC2919714 | biostudies-literature
| S-EPMC3091304 | biostudies-literature
| S-EPMC4516138 | biostudies-literature