Unknown

Dataset Information

0

Allelic decomposition and exact genotyping of highly polymorphic and structurally variant genes.


ABSTRACT: High-throughput sequencing provides the means to determine the allelic decomposition for any gene of interest-the number of copies and the exact sequence content of each copy of a gene. Although many clinically and functionally important genes are highly polymorphic and have undergone structural alterations, no high-throughput sequencing data analysis tool has yet been designed to effectively solve the full allelic decomposition problem. Here we introduce a combinatorial optimization framework that successfully resolves this challenging problem, including for genes with structural alterations. We provide an associated computational tool Aldy that performs allelic decomposition of highly polymorphic, multi-copy genes through using whole or targeted genome sequencing data. For a large diverse sequencing data set, Aldy identifies multiple rare and novel alleles for several important pharmacogenes, significantly improving upon the accuracy and utility of current genotyping assays. As more data sets become available, we expect Aldy to become an essential component of genotyping toolkits.

SUBMITTER: Numanagic I 

PROVIDER: S-EPMC5826927 | biostudies-literature | 2018 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Allelic decomposition and exact genotyping of highly polymorphic and structurally variant genes.

Numanagić Ibrahim I   Malikić Salem S   Ford Michael M   Qin Xiang X   Toji Lorraine L   Radovich Milan M   Skaar Todd C TC   Pratt Victoria M VM   Berger Bonnie B   Scherer Steve S   Sahinalp S Cenk SC  

Nature communications 20180226 1


High-throughput sequencing provides the means to determine the allelic decomposition for any gene of interest-the number of copies and the exact sequence content of each copy of a gene. Although many clinically and functionally important genes are highly polymorphic and have undergone structural alterations, no high-throughput sequencing data analysis tool has yet been designed to effectively solve the full allelic decomposition problem. Here we introduce a combinatorial optimization framework t  ...[more]

Similar Datasets

| S-EPMC2876125 | biostudies-literature
| S-EPMC4162610 | biostudies-literature
| S-EPMC6742235 | biostudies-literature
| S-EPMC4542776 | biostudies-literature
| S-EPMC4976312 | biostudies-literature
| S-EPMC3277347 | biostudies-literature
| S-EPMC3409271 | biostudies-literature
| S-EPMC5969302 | biostudies-literature
| S-EPMC1317586 | biostudies-literature
2010-11-12 | GSE21667 | GEO