Ontology highlight
ABSTRACT: Objective
We investigate surname affinities among areas of modern-day China, by constructing a spatial network, and making community detection. It reports a geographical genealogy of the Chinese population that is result of population origins, historical migrations, and societal evolutions.Materials and methods
We acquire data from the census records supplied by China's National Citizen Identity Information System, including the surname and regional information of 1.28 billion registered Chinese citizens. We propose a multilayer minimum spanning tree (MMST) to construct a spatial network based on the matrix of isonymic distances, which is often used to characterize the dissimilarity of surname structure among areas. We use the fast unfolding algorithm to detect network communities.Results
We obtain a 10-layer MMST network of 362 prefecture nodes and 3,610 edges derived from the matrix of the Euclidean distances among these areas. These prefectures are divided into eight groups in the spatial network via community detection. We measure the partition by comparing the inter-distances and intra-distances of the communities and obtain meaningful regional ethnicity classification.Discussion
The visualization of the resulting communities on the map indicates that the prefectures in the same community are usually geographically adjacent. The formation of this partition is influenced by geographical factors, historic migrations, trade and economic factors, as well as isolation of culture and language. The MMST algorithm proves to be effective in geo-genealogy and ethnicity classification for it retains essential information about surname affinity and highlights the geographical consanguinity of the population.
SUBMITTER: Shi Y
PROVIDER: S-EPMC6590414 | biostudies-literature | 2019 Mar
REPOSITORIES: biostudies-literature
American journal of physical anthropology 20181226 3
<h4>Objective</h4>We investigate surname affinities among areas of modern-day China, by constructing a spatial network, and making community detection. It reports a geographical genealogy of the Chinese population that is result of population origins, historical migrations, and societal evolutions.<h4>Materials and methods</h4>We acquire data from the census records supplied by China's National Citizen Identity Information System, including the surname and regional information of 1.28 billion re ...[more]