Unknown

Dataset Information

0

Population structure in a comprehensive genomic data set on human microsatellite variation.


ABSTRACT: Over the past two decades, microsatellite genotypes have provided the data for landmark studies of human population-genetic variation. However, the various microsatellite data sets have been prepared with different procedures and sets of markers, so that it has been difficult to synthesize available data for a comprehensive analysis. Here, we combine eight human population-genetic data sets at the 645 microsatellite loci they share in common, accounting for procedural differences in the production of the different data sets, to assemble a single data set containing 5795 individuals from 267 worldwide populations. We perform a systematic analysis of genetic relatedness, detecting 240 intra-population and 92 inter-population pairs of previously unidentified close relatives and proposing standardized subsets of unrelated individuals for use in future studies. We then augment the human data with a data set of 84 chimpanzees at the 246 loci they share in common with the human samples. Multidimensional scaling and neighbor-joining analyses of these data sets offer new insights into the structure of human populations and enable a comparison of genetic variation patterns in chimpanzees with those in humans. Our combined data sets are the largest of their kind reported to date and provide a resource for use in human population-genetic studies.

SUBMITTER: Pemberton TJ 

PROVIDER: S-EPMC3656735 | biostudies-literature | 2013 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Population structure in a comprehensive genomic data set on human microsatellite variation.

Pemberton Trevor J TJ   DeGiorgio Michael M   Rosenberg Noah A NA  

G3 (Bethesda, Md.) 20130520 5


Over the past two decades, microsatellite genotypes have provided the data for landmark studies of human population-genetic variation. However, the various microsatellite data sets have been prepared with different procedures and sets of markers, so that it has been difficult to synthesize available data for a comprehensive analysis. Here, we combine eight human population-genetic data sets at the 645 microsatellite loci they share in common, accounting for procedural differences in the producti  ...[more]

Similar Datasets

| S-EPMC3002246 | biostudies-literature
| S-EPMC4282691 | biostudies-literature
| S-EPMC1311907 | biostudies-literature
| PRJEB76848 | ENA
| S-EPMC9758492 | biostudies-literature
| S-EPMC9069078 | biostudies-literature
| S-EPMC3004464 | biostudies-literature
| S-EPMC3484644 | biostudies-literature
| S-EPMC6298057 | biostudies-literature
| S-EPMC5274581 | biostudies-literature