Unknown

Dataset Information

0

Detecting tandem repeat variants in coding regions using code-adVNTR.


ABSTRACT: The human genome contains more than one million tandem repeats (TRs), DNA sequences containing multiple approximate copies of a motif repeated contiguously. TRs account for significant genetic variation, with 50 + diseases attributed to changes in motif number. A few diseases have been to be caused by small indels in variable number tandem repeats (VNTRs) including poly-cystic kidney disease type 1 (MCKD1) and monogenic type 1 diabetes. However, small indels in VNTRs are largely unexplored mainly due to the long and complex structure of VNTRs with multiple motifs. We developed a method, code-adVNTR, that utilizes multi-motif hidden Markov models to detect both, motif count variation and small indels, within VNTRs. In simulated data, code-adVNTR outperformed GATK-HaplotypeCaller in calling small indels within large VNTRs. We used code-adVNTR to characterize coding VNTRs in the 1000 genomes data identifying many population-specific variants, and to reliably call MUC1 mutations for MCKD1.

SUBMITTER: Park J 

PROVIDER: S-EPMC9379575 | biostudies-literature | 2022 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Detecting tandem repeat variants in coding regions using code-adVNTR.

Park Jonghun J   Bakhtiari Mehrdad M   Popp Bernt B   Wiesener Michael M   Bafna Vineet V  

iScience 20220719 8


The human genome contains more than one million tandem repeats (TRs), DNA sequences containing multiple approximate copies of a motif repeated contiguously. TRs account for significant genetic variation, with 50 + diseases attributed to changes in motif number. A few diseases have been to be caused by small indels in variable number tandem repeats (VNTRs) including poly-cystic kidney disease type 1 (MCKD1) and monogenic type 1 diabetes. However, small indels in VNTRs are largely unexplored mainl  ...[more]

Similar Datasets

| S-EPMC1273636 | biostudies-literature
| S-EPMC6292491 | biostudies-literature
| S-EPMC6102892 | biostudies-literature
| S-EPMC9174224 | biostudies-literature
| S-EPMC6917647 | biostudies-literature
| S-EPMC5135796 | biostudies-literature
| S-EPMC6460572 | biostudies-literature
| S-EPMC7918261 | biostudies-literature
| S-EPMC10101126 | biostudies-literature
| S-EPMC5552289 | biostudies-literature