Ontology highlight
ABSTRACT:
SUBMITTER: Guo R
PROVIDER: S-EPMC6113509 | biostudies-literature | 2018 Aug
REPOSITORIES: biostudies-literature
Guo Runxin R Zhao Yi Y Zou Quan Q Fang Xiaodong X Peng Shaoliang S
GigaScience 20180801 8
With the rapid development of next-generation sequencing technology, ever-increasing quantities of genomic data pose a tremendous challenge to data processing. Therefore, there is an urgent need for highly scalable and powerful computational systems. Among the state-of-the-art parallel computing platforms, Apache Spark is a fast, general-purpose, in-memory, iterative computing framework for large-scale data processing that ensures high fault tolerance and high scalability by introducing the resi ...[more]