Unknown

Dataset Information

0

Factor analysis of ancient population genomic samples.


ABSTRACT: The recent years have seen a growing number of studies investigating evolutionary questions using ancient DNA. To address these questions, one of the most frequently-used method is principal component analysis (PCA). When PCA is applied to temporal samples, the sample dates are, however, ignored during analysis, leading to imperfect representations of samples in PC plots. Here, we present a factor analysis (FA) method in which individual scores are corrected for the effect of allele frequency drift over time. We obtained exact solutions for the estimates of corrected factors, and we provided a fast algorithm for their computation. Using computer simulations and ancient European samples, we compared geometric representations obtained from FA with PCA and with ancestry estimation programs. In admixture analyses, FA estimates agreed with tree-based statistics, and they were more accurate than those obtained from PCA projections and from ancestry estimation programs. A great advantage of FA over existing approaches is to improve descriptive analyses of ancient DNA samples without requiring inclusion of outgroup or present-day samples.

SUBMITTER: Francois O 

PROVIDER: S-EPMC7494920 | biostudies-literature | 2020 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Factor analysis of ancient population genomic samples.

François Olivier O   Jay Flora F  

Nature communications 20200916 1


The recent years have seen a growing number of studies investigating evolutionary questions using ancient DNA. To address these questions, one of the most frequently-used method is principal component analysis (PCA). When PCA is applied to temporal samples, the sample dates are, however, ignored during analysis, leading to imperfect representations of samples in PC plots. Here, we present a factor analysis (FA) method in which individual scores are corrected for the effect of allele frequency dr  ...[more]

Similar Datasets

| S-EPMC7248073 | biostudies-literature
| S-EPMC10692198 | biostudies-literature
| S-EPMC8175948 | biostudies-literature
| S-EPMC5867878 | biostudies-literature
| S-EPMC3390907 | biostudies-other
| S-EPMC8956381 | biostudies-literature
| S-EPMC3124491 | biostudies-literature
| PRJEB24629 | ENA
2013-03-27 | E-GEOD-45048 | biostudies-arrayexpress
| S-EPMC4275892 | biostudies-other