Ontology highlight
ABSTRACT: Motivation
In 2012, Iqbal et al. introduced the colored de Bruijn graph, a variant of the classic de Bruijn graph, which is aimed at 'detecting and genotyping simple and complex genetic variants in an individual or population'. Because they are intended to be applied to massive population level data, it is essential that the graphs be represented efficiently. Unfortunately, current succinct de Bruijn graph representations are not directly applicable to the colored de Bruijn graph, which requires additional information to be succinctly encoded as well as support for non-standard traversal operations.Results
Our data structure dramatically reduces the amount of memory required to store and use the colored de Bruijn graph, with some penalty to runtime, allowing it to be applied in much larger and more ambitious sequence projects than was previously possible.Availability and implementation
https://github.com/cosmo-team/cosmo/tree/VARI.Contact
martin.muggli@colostate.edu.Supplementary information
Supplementary data are available at Bioinformatics online.
SUBMITTER: Muggli MD
PROVIDER: S-EPMC5872255 | biostudies-literature | 2017 Oct
REPOSITORIES: biostudies-literature
Muggli Martin D MD Bowe Alexander A Noyes Noelle R NR Morley Paul S PS Belk Keith E KE Raymond Robert R Gagie Travis T Puglisi Simon J SJ Boucher Christina C
Bioinformatics (Oxford, England) 20171001 20
<h4>Motivation</h4>In 2012, Iqbal et al. introduced the colored de Bruijn graph, a variant of the classic de Bruijn graph, which is aimed at 'detecting and genotyping simple and complex genetic variants in an individual or population'. Because they are intended to be applied to massive population level data, it is essential that the graphs be represented efficiently. Unfortunately, current succinct de Bruijn graph representations are not directly applicable to the colored de Bruijn graph, which ...[more]