Unknown

Dataset Information

0

KAGE: fast alignment-free graph-based genotyping of SNPs and short indels.


ABSTRACT: Genotyping is a core application of high-throughput sequencing. We present KAGE, a genotyper for SNPs and short indels that is inspired by recent developments within graph-based genome representations and alignment-free methods. KAGE uses a pan-genome representation of the population to efficiently and accurately predict genotypes. Two novel ideas improve both the speed and accuracy: a Bayesian model incorporates genotypes from thousands of individuals to improve prediction accuracy, and a computationally efficient method leverages correlation between variants. We show that the accuracy of KAGE is at par with the best existing alignment-free genotypers, while being an order of magnitude faster.

SUBMITTER: Grytten I 

PROVIDER: S-EPMC9531401 | biostudies-literature | 2022 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

KAGE: fast alignment-free graph-based genotyping of SNPs and short indels.

Grytten Ivar I   Dagestad Rand Knut K   Sandve Geir Kjetil GK  

Genome biology 20221004 1


Genotyping is a core application of high-throughput sequencing. We present KAGE, a genotyper for SNPs and short indels that is inspired by recent developments within graph-based genome representations and alignment-free methods. KAGE uses a pan-genome representation of the population to efficiently and accurately predict genotypes. Two novel ideas improve both the speed and accuracy: a Bayesian model incorporates genotypes from thousands of individuals to improve prediction accuracy, and a compu  ...[more]

Similar Datasets

| S-EPMC7605509 | biostudies-literature
| S-EPMC3926339 | biostudies-literature
| S-EPMC6913027 | biostudies-literature
| S-EPMC3206323 | biostudies-literature
| S-EPMC2442091 | biostudies-literature
| S-EPMC5013917 | biostudies-literature
| S-EPMC11863517 | biostudies-literature
| S-EPMC6436989 | biostudies-literature
| S-EPMC10311327 | biostudies-literature
| S-EPMC4080745 | biostudies-literature