Unknown

Dataset Information

0

ODGI: understanding pangenome graphs.


ABSTRACT:

Motivation

Pangenome graphs provide a complete representation of the mutual alignment of collections of genomes. These models offer the opportunity to study the entire genomic diversity of a population, including structurally complex regions. Nevertheless, analyzing hundreds of gigabase-scale genomes using pangenome graphs is difficult as it is not well-supported by existing tools. Hence, fast and versatile software is required to ask advanced questions to such data in an efficient way.

Results

We wrote ODGI, a novel suite of tools that implements scalable algorithms and has an efficient in-memory representation of DNA pangenome graphs in the form of variation graphs. ODGI supports pre-built graphs in the Graphical Fragment Assembly format. ODGI includes tools for detecting complex regions, extracting pangenomic loci, removing artifacts, exploratory analysis, manipulation, validation, and visualization. Its fast parallel execution facilitates routine pangenomic tasks, as well as pipelines that can quickly answer complex biological questions of gigabase-scale pangenome graphs.

Availability

ODGI is published as free software under the MIT open source license. Source code can be downloaded from https://github.com/pangenome/odgi and documentation is available at https://odgi.readthedocs.io. ODGI can be installed via Bioconda https://bioconda.github.io/recipes/odgi/README.html or GNU Guix https://github.com/pangenome/odgi/blob/master/guix.scm.

SUBMITTER: Guarracino A 

PROVIDER: S-EPMC9237687 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC8388040 | biostudies-literature
| S-EPMC7017486 | biostudies-literature
| S-EPMC8519448 | biostudies-literature
| S-EPMC7568353 | biostudies-literature
| S-EPMC6881350 | biostudies-literature
| S-EPMC8275641 | biostudies-literature
| S-EPMC7461444 | biostudies-literature
| PRJEB15004 | ENA
| S-EPMC6419457 | biostudies-literature
| PRJEB4961 | ENA