Unknown

Dataset Information

0

SeqOthello: querying RNA-seq experiments at scale.


ABSTRACT: We present SeqOthello, an ultra-fast and memory-efficient indexing structure to support arbitrary sequence query against large collections of RNA-seq experiments. It takes SeqOthello only 5 min and 19.1 GB memory to conduct a global survey of 11,658 fusion events against 10,113 TCGA Pan-Cancer RNA-seq datasets. The query recovers 92.7% of tier-1 fusions curated by TCGA Fusion Gene Database and reveals 270 novel occurrences, all of which are present as tumor-specific. By providing a reference-free, alignment-free, and parameter-free sequence search system, SeqOthello will enable large-scale integrative studies using sequence-level data, an undertaking not previously practicable for many individual labs.

SUBMITTER: Yu Y 

PROVIDER: S-EPMC6194578 | biostudies-literature | 2018 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

SeqOthello: querying RNA-seq experiments at scale.

Yu Ye Y   Liu Jinpeng J   Liu Xinan X   Zhang Yi Y   Magner Eamonn E   Lehnert Erik E   Qian Chen C   Liu Jinze J  

Genome biology 20181019 1


We present SeqOthello, an ultra-fast and memory-efficient indexing structure to support arbitrary sequence query against large collections of RNA-seq experiments. It takes SeqOthello only 5 min and 19.1 GB memory to conduct a global survey of 11,658 fusion events against 10,113 TCGA Pan-Cancer RNA-seq datasets. The query recovers 92.7% of tier-1 fusions curated by TCGA Fusion Gene Database and reveals 270 novel occurrences, all of which are present as tumor-specific. By providing a reference-fre  ...[more]

Similar Datasets

| S-EPMC5870547 | biostudies-literature
| S-EPMC3166838 | biostudies-literature
| S-EPMC9302581 | biostudies-literature
| S-EPMC6030839 | biostudies-literature
| S-EPMC4481848 | biostudies-literature
| S-EPMC4520150 | biostudies-literature
| S-EPMC4764726 | biostudies-literature
| S-EPMC4589503 | biostudies-literature
| S-EPMC3535702 | biostudies-literature
| S-EPMC5294840 | biostudies-literature