Unknown

Dataset Information

0

InFusion: Advancing Discovery of Fusion Genes and Chimeric Transcripts from Deep RNA-Sequencing Data.


ABSTRACT: Analysis of fusion transcripts has become increasingly important due to their link with cancer development. Since high-throughput sequencing approaches survey fusion events exhaustively, several computational methods for the detection of gene fusions from RNA-seq data have been developed. This kind of analysis, however, is complicated by native trans-splicing events, the splicing-induced complexity of the transcriptome and biases and artefacts introduced in experiments and data analysis. There are a number of tools available for the detection of fusions from RNA-seq data; however, certain differences in specificity and sensitivity between commonly used approaches have been found. The ability to detect gene fusions of different types, including isoform fusions and fusions involving non-coding regions, has not been thoroughly studied yet. Here, we propose a novel computational toolkit called InFusion for fusion gene detection from RNA-seq data. InFusion introduces several unique features, such as discovery of fusions involving intergenic regions, and detection of anti-sense transcription in chimeric RNAs based on strand-specificity. Our approach demonstrates superior detection accuracy on simulated data and several public RNA-seq datasets. This improved performance was also evident when evaluating data from RNA deep-sequencing of two well-established prostate cancer cell lines. InFusion identified 26 novel fusion events that were validated in vitro, including alternatively spliced gene fusion isoforms and chimeric transcripts that include intergenic regions. The toolkit is freely available to download from http:/bitbucket.org/kokonech/infusion.

SUBMITTER: Okonechnikov K 

PROVIDER: S-EPMC5132003 | biostudies-literature | 2016

REPOSITORIES: biostudies-literature

altmetric image

Publications

InFusion: Advancing Discovery of Fusion Genes and Chimeric Transcripts from Deep RNA-Sequencing Data.

Okonechnikov Konstantin K   Imai-Matsushima Aki A   Paul Lukas L   Seitz Alexander A   Meyer Thomas F TF   Garcia-Alcalde Fernando F  

PloS one 20161201 12


Analysis of fusion transcripts has become increasingly important due to their link with cancer development. Since high-throughput sequencing approaches survey fusion events exhaustively, several computational methods for the detection of gene fusions from RNA-seq data have been developed. This kind of analysis, however, is complicated by native trans-splicing events, the splicing-induced complexity of the transcriptome and biases and artefacts introduced in experiments and data analysis. There a  ...[more]

Similar Datasets

2014-12-01 | GSE56512 | GEO
| S-EPMC6041974 | biostudies-literature
| S-EPMC3439898 | biostudies-other
| S-EPMC8673554 | biostudies-literature
| S-EPMC3245612 | biostudies-literature
| S-EPMC6437215 | biostudies-literature
| S-EPMC3799197 | biostudies-literature
| S-EPMC1183565 | biostudies-literature
| S-EPMC3060884 | biostudies-literature
| S-EPMC4823188 | biostudies-literature