Ontology highlight
ABSTRACT:
SUBMITTER: Khan J
PROVIDER: S-EPMC9454175 | biostudies-literature | 2022 Sep
REPOSITORIES: biostudies-literature
Khan Jamshed J Kokot Marek M Deorowicz Sebastian S Patro Rob R
Genome biology 20220908 1
The de Bruijn graph is a key data structure in modern computational genomics, and construction of its compacted variant resides upstream of many genomic analyses. As the quantity of genomic data grows rapidly, this often forms a computational bottleneck. We present Cuttlefish 2, significantly advancing the state-of-the-art for this problem. On a commodity server, it reduces the graph construction time for 661K bacterial genomes, of size 2.58Tbp, from 4.5 days to 17-23 h; and it constructs the gr ...[more]