Genomics

Dataset Information

0

SG10K Pilot - Large-scale whole-genome sequencing of three diverse Asian populations in Singapore


ABSTRACT: Underrepresentation of Asian genomes has hindered population and medical genetics research on Asians, leading to population disparities in precision medicine. By whole-genome sequencing of 4,810 Singapore Chinese, Malays, and Indians, we found 98.3 million SNPs and small insertions/deletions, over half of which are novel. Population structure analysis demonstrated great representation of Asian genetic diversity by three ethnicities in Singapore, and revealed a Malay-related novel ancestry component. Furthermore, demographic inference suggested that Malays split from Chinese ~24,800 years ago, and experienced significant admixture with East Asians ~1,700 years ago, coinciding with the Austronesian expansion. Additionally, we identified 20 candidate loci for natural selection, among which 14 harbored robust associations with complex traits and diseases. Finally, we showed that our data can substantially improve genotype imputation in diverse Asian and Oceanian populations. These results highlight the value of our data as a resource to empower human genetics discovery across broad geographic regions.

PROVIDER: EGAS00001003875 | EGA |

REPOSITORIES: EGA

Similar Datasets

2019-07-23 | GSE117339 | GEO
| EGAS00001005379 | EGA
2015-01-08 | GSE53898 | GEO
2015-01-08 | E-GEOD-53898 | biostudies-arrayexpress
2016-02-04 | GSE77508 | GEO
| phs000361 | dbGaP
2022-10-04 | GSE182409 | GEO
| PRJNA251357 | ENA
2024-06-10 | PXD040566 | Pride
2015-01-08 | E-GEOD-53901 | biostudies-arrayexpress