Unknown

Dataset Information

0

A fast detection of fusion genes from paired-end RNA-seq data.


ABSTRACT: BACKGROUND:Fusion genes are known to be drivers of many common cancers, so they are potential markers for diagnosis, prognosis or therapy response. The advent of paired-end RNA sequencing enhances our ability to discover fusion genes. While there are available methods, routine analyses of large number of samples are still limited due to high computational demands. RESULTS:We develop FuSeq, a fast and accurate method to discover fusion genes based on quasi-mapping to quickly map the reads, extract initial candidates from split reads and fusion equivalence classes of mapped reads, and finally apply multiple filters and statistical tests to get the final candidates. We apply FuSeq to four validated datasets: breast cancer, melanoma and glioma datasets, and one spike-in dataset. The results reveal high sensitivity and specificity in all datasets, and compare well against other methods such as FusionMap, TRUP, TopHat-Fusion, SOAPfuse and JAFFA. In terms of computational time, FuSeq is two-fold faster than FusionMap and orders of magnitude faster than the other methods. CONCLUSIONS:With this advantage of less computational demands, FuSeq makes it practical to investigate fusion genes in large numbers of samples. FuSeq is implemented in C++ and R, and available at https://github.com/nghiavtr/FuSeq for non-commercial uses.

SUBMITTER: Vu TN 

PROVIDER: S-EPMC6211471 | biostudies-literature | 2018 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

A fast detection of fusion genes from paired-end RNA-seq data.

Vu Trung Nghia TN   Deng Wenjiang W   Trac Quang Thinh QT   Calza Stefano S   Hwang Woochang W   Pawitan Yudi Y  

BMC genomics 20181101 1


<h4>Background</h4>Fusion genes are known to be drivers of many common cancers, so they are potential markers for diagnosis, prognosis or therapy response. The advent of paired-end RNA sequencing enhances our ability to discover fusion genes. While there are available methods, routine analyses of large number of samples are still limited due to high computational demands.<h4>Results</h4>We develop FuSeq, a fast and accurate method to discover fusion genes based on quasi-mapping to quickly map th  ...[more]

Similar Datasets

| S-EPMC4054009 | biostudies-literature
| S-EPMC2919714 | biostudies-literature
| S-EPMC3691734 | biostudies-literature
| S-EPMC5737728 | biostudies-literature
| S-EPMC2916723 | biostudies-literature
| S-EPMC4797269 | biostudies-other
| S-EPMC3278765 | biostudies-literature
| S-EPMC3091304 | biostudies-literature
| S-EPMC5209911 | biostudies-literature
| S-EPMC6065480 | biostudies-literature