Unknown

Dataset Information

0

Rapid and ongoing evolution of repetitive sequence structures in human centromeres.


ABSTRACT: Our understanding of centromere sequence variation across human populations is limited by its extremely long nested repeat structures called higher-order repeats that are challenging to sequence. Here, we analyzed chromosomes 11, 17, and X using long-read sequencing data for 36 individuals from diverse populations including a Han Chinese trio and 21 Japanese. We revealed substantial structural diversity with many previously unidentified variant higher-order repeats specific to individuals characterizing rapid, haplotype-specific evolution of human centromeric arrays, while frequent single-nucleotide variants are largely conserved. We found a characteristic pattern shared among prevalent variants in human and chimpanzee. Our findings pave the way for studying sequence evolution in human and primate centromeres.

SUBMITTER: Suzuki Y 

PROVIDER: S-EPMC7732198 | biostudies-literature | 2020 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Rapid and ongoing evolution of repetitive sequence structures in human centromeres.

Suzuki Yuta Y   Myers Eugene W EW   Morishita Shinichi S  

Science advances 20201211 50


Our understanding of centromere sequence variation across human populations is limited by its extremely long nested repeat structures called higher-order repeats that are challenging to sequence. Here, we analyzed chromosomes 11, 17, and X using long-read sequencing data for 36 individuals from diverse populations including a Han Chinese trio and 21 Japanese. We revealed substantial structural diversity with many previously unidentified variant higher-order repeats specific to individuals charac  ...[more]

Similar Datasets

| S-EPMC309414 | biostudies-other
| S-EPMC48519 | biostudies-other
| S-EPMC2323805 | biostudies-literature
| S-EPMC7482064 | biostudies-literature
| S-EPMC2667148 | biostudies-literature
| S-EPMC97617 | biostudies-literature
2011-12-22 | E-GEOD-32880 | biostudies-arrayexpress
| S-EPMC1797245 | biostudies-literature
| S-EPMC1482600 | biostudies-literature
| S-EPMC3627088 | biostudies-literature