Unknown

Dataset Information

0

The landscape of human STR variation.


ABSTRACT: Short tandem repeats are among the most polymorphic loci in the human genome. These loci play a role in the etiology of a range of genetic diseases and have been frequently utilized in forensics, population genetics, and genetic genealogy. Despite this plethora of applications, little is known about the variation of most STRs in the human population. Here, we report the largest-scale analysis of human STR variation to date. We collected information for nearly 700,000 STR loci across more than 1000 individuals in Phase 1 of the 1000 Genomes Project. Extensive quality controls show that reliable allelic spectra can be obtained for close to 90% of the STR loci in the genome. We utilize this call set to analyze determinants of STR variation, assess the human reference genome's representation of STR alleles, find STR loci with common loss-of-function alleles, and obtain initial estimates of the linkage disequilibrium between STRs and common SNPs. Overall, these analyses further elucidate the scale of genetic variation beyond classical point mutations.

SUBMITTER: Willems T 

PROVIDER: S-EPMC4216929 | biostudies-literature | 2014 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

The landscape of human STR variation.

Willems Thomas T   Gymrek Melissa M   Highnam Gareth G   Mittelman David D   Erlich Yaniv Y  

Genome research 20140818 11


Short tandem repeats are among the most polymorphic loci in the human genome. These loci play a role in the etiology of a range of genetic diseases and have been frequently utilized in forensics, population genetics, and genetic genealogy. Despite this plethora of applications, little is known about the variation of most STRs in the human population. Here, we report the largest-scale analysis of human STR variation to date. We collected information for nearly 700,000 STR loci across more than 10  ...[more]

Similar Datasets

| S-EPMC3504113 | biostudies-literature
| S-EPMC3536929 | biostudies-literature
| S-EPMC6063297 | biostudies-literature
| S-EPMC5777382 | biostudies-literature
| S-EPMC5510864 | biostudies-literature
| S-EPMC3973747 | biostudies-literature
| S-EPMC5093907 | biostudies-literature
| S-EPMC4417122 | biostudies-literature
| S-EPMC5300907 | biostudies-literature
| S-EPMC6723042 | biostudies-literature