Unknown

Dataset Information

0

UMAP reveals cryptic population structure and phenotype heterogeneity in large genomic cohorts.


ABSTRACT: Human populations feature both discrete and continuous patterns of variation. Current analysis approaches struggle to jointly identify these patterns because of modelling assumptions, mathematical constraints, or numerical challenges. Here we apply uniform manifold approximation and projection (UMAP), a non-linear dimension reduction tool, to three well-studied genotype datasets and discover overlooked subpopulations within the American Hispanic population, fine-scale relationships between geography, genotypes, and phenotypes in the UK population, and cryptic structure in the Thousand Genomes Project data. This approach is well-suited to the influx of large and diverse data and opens new lines of inquiry in population-scale datasets.

SUBMITTER: Diaz-Papkovich A 

PROVIDER: S-EPMC6853336 | biostudies-literature | 2019 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

UMAP reveals cryptic population structure and phenotype heterogeneity in large genomic cohorts.

Diaz-Papkovich Alex A   Anderson-Trocmé Luke L   Ben-Eghan Chief C   Gravel Simon S  

PLoS genetics 20191101 11


Human populations feature both discrete and continuous patterns of variation. Current analysis approaches struggle to jointly identify these patterns because of modelling assumptions, mathematical constraints, or numerical challenges. Here we apply uniform manifold approximation and projection (UMAP), a non-linear dimension reduction tool, to three well-studied genotype datasets and discover overlooked subpopulations within the American Hispanic population, fine-scale relationships between geogr  ...[more]

Similar Datasets

| S-EPMC7477237 | biostudies-literature
| S-EPMC6923883 | biostudies-literature
| S-EPMC6886512 | biostudies-literature
| S-EPMC4774245 | biostudies-literature
| S-EPMC7452818 | biostudies-literature
| S-EPMC6785074 | biostudies-literature
| S-EPMC8585452 | biostudies-literature
| S-EPMC7897976 | biostudies-literature
2018-05-30 | GSE97850 | GEO
| S-EPMC6191429 | biostudies-literature