Unknown

Dataset Information

0

Genome-wide survey of tandem repeats by nanopore sequencing shows that disease-associated repeats are more polymorphic in the general population.


ABSTRACT:

Background

Tandem repeats are highly mutable and contribute to the development of human disease by a variety of mechanisms. It is difficult to predict which tandem repeats may cause a disease. One hypothesis is that changeable tandem repeats are the source of genetic diseases, because disease-causing repeats are polymorphic in healthy individuals. However, it is not clear whether disease-causing repeats are more polymorphic than other repeats.

Methods

We performed a genome-wide survey of the millions of human tandem repeats using publicly available long read genome sequencing data from 21 humans. We measured tandem repeat copy number changes using tandem-genotypes. Length variation of known disease-associated repeats was compared to other repeat loci.

Results

We found that known Mendelian disease-causing or disease-associated repeats, especially CAG and 5'UTR GGC repeats, are relatively long and polymorphic in the general population. We also show that repeat lengths of two disease-causing tandem repeats, in ATXN3 and GLS, are correlated with near-by GWAS SNP genotypes.

Conclusions

We provide a catalog of polymorphic tandem repeats across a variety of repeat unit lengths and sequences, from long read sequencing data. This method especially if used in genome wide association study, may indicate possible new candidates of pathogenic or biologically important tandem repeats in human genomes.

SUBMITTER: Mitsuhashi S 

PROVIDER: S-EPMC7791882 | biostudies-literature | 2021 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Genome-wide survey of tandem repeats by nanopore sequencing shows that disease-associated repeats are more polymorphic in the general population.

Mitsuhashi Satomi S   Frith Martin C MC   Matsumoto Naomichi N  

BMC medical genomics 20210107 1


<h4>Background</h4>Tandem repeats are highly mutable and contribute to the development of human disease by a variety of mechanisms. It is difficult to predict which tandem repeats may cause a disease. One hypothesis is that changeable tandem repeats are the source of genetic diseases, because disease-causing repeats are polymorphic in healthy individuals. However, it is not clear whether disease-causing repeats are more polymorphic than other repeats.<h4>Methods</h4>We performed a genome-wide su  ...[more]

Similar Datasets

| S-EPMC9052667 | biostudies-literature
| S-EPMC5986539 | biostudies-literature
| S-EPMC9889824 | biostudies-literature
| S-EPMC9117641 | biostudies-literature
| S-EPMC8256013 | biostudies-literature
| S-EPMC4417121 | biostudies-literature
| S-EPMC2928880 | biostudies-literature
| S-EPMC10726533 | biostudies-literature
| S-EPMC10075192 | biostudies-literature
| S-EPMC2291630 | biostudies-other