Unknown

Dataset Information

0

QUARTIC: QUick pArallel algoRithms for high-Throughput sequencIng data proCessing.


ABSTRACT: Life science has entered the so-called 'big data era' where biologists, clinicians and bioinformaticians are overwhelmed with high-throughput sequencing data. While they offer new insights to decipher the genome structure they also raise major challenges to use them for daily clinical practice care and diagnosis purposes as they are bigger and bigger. Therefore, we implemented a software to reduce the time to delivery for the alignment and the sorting of high-throughput sequencing data.  Our solution is implemented using Message Passing Interface and is intended for high-performance computing architecture. The software scales linearly with respect to the size of the data and ensures a total reproducibility with the traditional tools. For example, a 300X whole genome can be aligned and sorted within less than 9 hours with 128 cores. The software offers significant speed-up using multi-cores and multi-nodes parallelization.

SUBMITTER: Jarlier F 

PROVIDER: S-EPMC7429925 | biostudies-literature | 2020

REPOSITORIES: biostudies-literature

altmetric image

Publications

QUARTIC: QUick pArallel algoRithms for high-Throughput sequencIng data proCessing.

Jarlier Frédéric F   Joly Nicolas N   Fedy Nicolas N   Magalhaes Thomas T   Sirotti Leonor L   Paganiban Paul P   Martin Firmin F   McManus Michael M   Hupé Philippe P  

F1000Research 20200406


Life science has entered the so-called 'big data era' where biologists, clinicians and bioinformaticians are overwhelmed with high-throughput sequencing data. While they offer new insights to decipher the genome structure they also raise major challenges to use them for daily clinical practice care and diagnosis purposes as they are bigger and bigger. Therefore, we implemented a software to reduce the time to delivery for the alignment and the sorting of high-throughput sequencing data.  Our sol  ...[more]

Similar Datasets

| S-EPMC4051166 | biostudies-literature
| S-EPMC4110453 | biostudies-literature
| S-EPMC5860173 | biostudies-literature
| S-EPMC3832420 | biostudies-literature
| S-EPMC3464612 | biostudies-literature
| S-EPMC3458526 | biostudies-other
| S-EPMC6029622 | biostudies-literature
| S-EPMC3590217 | biostudies-literature
| S-EPMC3783187 | biostudies-literature
| S-EPMC4596663 | biostudies-literature