Unknown

Dataset Information

0

Genetic substructure and complex demographic history of South African Bantu speakers.


ABSTRACT: South Eastern Bantu-speaking (SEB) groups constitute more than 80% of the population in South Africa. Despite clear linguistic and geographic diversity, the genetic differences between these groups have not been systematically investigated. Based on genome-wide data of over 5000 individuals, representing eight major SEB groups, we provide strong evidence for fine-scale population structure that broadly aligns with geographic distribution and is also congruent with linguistic phylogeny (separation of Nguni, Sotho-Tswana and Tsonga speakers). Although differential Khoe-San admixture plays a key role, the structure persists after Khoe-San ancestry-masking. The timing of admixture, levels of sex-biased gene flow and population size dynamics also highlight differences in the demographic histories of individual groups. The comparisons with five Iron Age farmer genomes further support genetic continuity over ~400 years in certain regions of the country. Simulated trait genome-wide association studies further show that the observed population structure could have major implications for biomedical genomics research in South Africa.

SUBMITTER: Sengupta D 

PROVIDER: S-EPMC8027885 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC2953791 | biostudies-literature
| S-EPMC3774671 | biostudies-literature
| S-EPMC6363051 | biostudies-literature
| S-EPMC6134013 | biostudies-literature
| S-EPMC10568978 | biostudies-literature
| S-EPMC8487618 | biostudies-literature
| PRJEB31715 | ENA
| PRJNA518707 | ENA
| PRJNA518708 | ENA
| S-EPMC4047067 | biostudies-literature