Unknown

Dataset Information

0

The design and construction of reference pangenome graphs with minigraph.


ABSTRACT: The recent advances in sequencing technologies enable the assembly of individual genomes to the quality of the reference genome. How to integrate multiple genomes from the same species and make the integrated representation accessible to biologists remains an open challenge. Here, we propose a graph-based data model and associated formats to represent multiple genomes while preserving the coordinate of the linear reference genome. We implement our ideas in the minigraph toolkit and demonstrate that we can efficiently construct a pangenome graph and compactly encode tens of thousands of structural variants missing from the current reference genome.

SUBMITTER: Li H 

PROVIDER: S-EPMC7568353 | biostudies-literature | 2020 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

The design and construction of reference pangenome graphs with minigraph.

Li Heng H   Feng Xiaowen X   Chu Chong C  

Genome biology 20201016 1


The recent advances in sequencing technologies enable the assembly of individual genomes to the quality of the reference genome. How to integrate multiple genomes from the same species and make the integrated representation accessible to biologists remains an open challenge. Here, we propose a graph-based data model and associated formats to represent multiple genomes while preserving the coordinate of the linear reference genome. We implement our ideas in the minigraph toolkit and demonstrate t  ...[more]

Similar Datasets

| S-EPMC9237687 | biostudies-literature
| S-EPMC8388040 | biostudies-literature
| S-EPMC10172123 | biostudies-literature
| S-EPMC11368177 | biostudies-literature
| S-EPMC7017486 | biostudies-literature
| S-EPMC8519448 | biostudies-literature
| S-EPMC6881350 | biostudies-literature
| S-EPMC10322713 | biostudies-literature
| S-EPMC10803329 | biostudies-literature
| S-EPMC8275641 | biostudies-literature