Unknown

Dataset Information

0

Cloud accelerated alignment and assembly of full-length single-cell RNA-seq data using Falco.


ABSTRACT: BACKGROUND:Read alignment and transcript assembly are the core of RNA-seq analysis for transcript isoform discovery. Nonetheless, current tools are not designed to be scalable for analysis of full-length bulk or single cell RNA-seq (scRNA-seq) data. The previous version of our cloud-based tool Falco only focuses on RNA-seq read counting, but does not allow for more flexible steps such as alignment and read assembly. RESULTS:The Falco framework can harness the parallel and distributed computing environment in modern cloud platforms to accelerate read alignment and transcript assembly of full-length bulk RNA-seq and scRNA-seq data. There are two new modes in Falco: alignment-only and transcript assembly. In the alignment-only mode, Falco can speed up the alignment process by 2.5-16.4x based on two public scRNA-seq datasets when compared to alignment on a highly optimised standalone computer. Furthermore, it also provides a 10x average speed-up compared to alignment using published cloud-enabled tool for read alignment, Rail-RNA. In the transcript assembly mode, Falco can speed up the transcript assembly process by 1.7-16.5x compared to performing transcript assembly on a highly optimised computer. CONCLUSION:Falco is a significantly updated open source big data processing framework that enables scalable and accelerated alignment and assembly of full-length scRNA-seq data on the cloud. The source code can be found at https://github.com/VCCRI/Falco.

SUBMITTER: Yang A 

PROVIDER: S-EPMC6936136 | biostudies-literature | 2019 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Cloud accelerated alignment and assembly of full-length single-cell RNA-seq data using Falco.

Yang Andrian A   Kishore Abhinav A   Phipps Benjamin B   Ho Joshua W K JWK  

BMC genomics 20191230 Suppl 10


<h4>Background</h4>Read alignment and transcript assembly are the core of RNA-seq analysis for transcript isoform discovery. Nonetheless, current tools are not designed to be scalable for analysis of full-length bulk or single cell RNA-seq (scRNA-seq) data. The previous version of our cloud-based tool Falco only focuses on RNA-seq read counting, but does not allow for more flexible steps such as alignment and read assembly.<h4>Results</h4>The Falco framework can harness the parallel and distribu  ...[more]

Similar Datasets

| S-EPMC3571712 | biostudies-literature
| S-EPMC7893963 | biostudies-literature
| S-EPMC9546769 | biostudies-literature
| S-ECPF-GEOD-38495 | biostudies-other
| S-EPMC3467340 | biostudies-literature
| S-EPMC8145802 | biostudies-literature
| S-EPMC8418522 | biostudies-literature
| S-EPMC8626966 | biostudies-literature
| S-EPMC5706670 | biostudies-literature