Unknown

Dataset Information

0

Differentiation of Hispanic biogeographic ancestry with 80 ancestry informative markers.


ABSTRACT: Ancestry informative single nucleotide polymorphisms (SNPs) can identify biogeographic ancestry (BGA); however, population substructure and relatively recent admixture can make differentiation difficult in heterogeneous Hispanic populations. Utilizing unrelated individuals from the Genomic Origins and Admixture in Latinos dataset (GOAL, n?=?160), we designed an 80 SNP panel (Setser80) that accurately depicts BGA through STRUCTURE and PCA. We compared our Setser80 to the Seldin and Kidd panels via resampling simulations, which models data based on allele frequencies. We incorporated Admixed American 1000 Genomes populations (1000?G, n?=?347), into a combined populations dataset to determine robustness. Using multinomial logistic regression (MLR), we compared the 3 panels on the combined dataset and found overall MLR classification accuracies: 93.2% Setser80, 87.9% Seldin panel, 71.4% Kidd panel. Naïve Bayesian classification had similar results on the combined dataset: 91.5% Setser80, 84.7% Seldin panel, 71.1% Kidd panel. Although Peru and Mexico were absent from panel design, we achieved high classification accuracy on the combined populations for Peru (MLR?=?100%, naïve Bayes?=?98%), and Mexico (MLR?=?90%, naïve Bayes?=?83.4%) as evidence of the portability of the Setser80. Our results indicate the Setser80 SNP panel can reliably classify BGA for individuals of presumed Hispanic origin.

SUBMITTER: Setser CH 

PROVIDER: S-EPMC7210943 | biostudies-literature | 2020 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Differentiation of Hispanic biogeographic ancestry with 80 ancestry informative markers.

Setser Casandra H CH   Planz John V JV   Barber Robert C RC   Phillips Nicole R NR   Chakraborty Ranajit R   Cross Deanna S DS  

Scientific reports 20200508 1


Ancestry informative single nucleotide polymorphisms (SNPs) can identify biogeographic ancestry (BGA); however, population substructure and relatively recent admixture can make differentiation difficult in heterogeneous Hispanic populations. Utilizing unrelated individuals from the Genomic Origins and Admixture in Latinos dataset (GOAL, n = 160), we designed an 80 SNP panel (Setser80) that accurately depicts BGA through STRUCTURE and PCA. We compared our Setser80 to the Seldin and Kidd panels vi  ...[more]

Similar Datasets

2022-02-16 | PXD029323 | Pride
| S-EPMC9997643 | biostudies-literature
| S-EPMC6421339 | biostudies-literature
| S-EPMC6936141 | biostudies-literature
| S-EPMC8753123 | biostudies-literature
| S-EPMC2674805 | biostudies-literature
| S-EPMC5701827 | biostudies-literature
| S-EPMC10452561 | biostudies-literature
| S-EPMC6189140 | biostudies-literature
| S-EPMC2719660 | biostudies-literature