Unknown

Dataset Information

0

Linearization of genome sequence graphs revisited.


ABSTRACT: The need to include the genetic variation within a population into a reference genome led to the concept of a genome sequence graph. Nodes of such a graph are labeled with DNA sequences occurring in represented genomes. Due to double-stranded nature of DNA, each node may be oriented in one of two possible ways, resulting in marking one end of the labeling sequence as in-side and the other as out-side. Edges join pairs of sides and reflect adjacency between node sequences in genomes constituting the graph. Linearization of a sequence graph aims at orienting and ordering graph nodes in a way that makes it more efficient for visualization and further analysis, e.g. access and traversal. We propose a new linearization algorithm, called ALIBI - Algorithm for Linearization by Incremental graph BuIlding. The evaluation shows that ALIBI is computationally very efficient and generates high-quality results.

SUBMITTER: Lisiecka A 

PROVIDER: S-EPMC8264155 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC11362851 | biostudies-literature
| S-EPMC5455116 | biostudies-literature
| S-EPMC6521551 | biostudies-literature
| S-EPMC4924849 | biostudies-other
| S-EPMC1393169 | biostudies-literature
| S-EPMC4809059 | biostudies-literature
| S-EPMC5613400 | biostudies-literature
| S-EPMC8884192 | biostudies-literature
| S-EPMC4208594 | biostudies-literature
| S-EPMC3421212 | biostudies-literature