Unknown

Dataset Information

0

De novo human genome assemblies reveal spectrum of alternative haplotypes in diverse populations.


ABSTRACT: The human reference genome is used extensively in modern biological research. However, a single consensus representation is inadequate to provide a universal reference structure because it is a haplotype among many in the human population. Using 10× Genomics (10×G) "Linked-Read" technology, we perform whole genome sequencing (WGS) and de novo assembly on 17 individuals across five populations. We identify 1842 breakpoint-resolved non-reference unique insertions (NUIs) that, in aggregate, add up to 2.1?Mb of so far undescribed genomic content. Among these, 64% are considered ancestral to humans since they are found in non-human primate genomes. Furthermore, 37% of the NUIs can be found in the human transcriptome and 14% likely arose from Alu-recombination-mediated deletion. Our results underline the need of a set of human reference genomes that includes a comprehensive list of alternative haplotypes to depict the complete spectrum of genetic diversity across populations.

SUBMITTER: Wong KHY 

PROVIDER: S-EPMC6072799 | biostudies-literature | 2018 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

De novo human genome assemblies reveal spectrum of alternative haplotypes in diverse populations.

Wong Karen H Y KHY   Levy-Sakin Michal M   Kwok Pui-Yan PY  

Nature communications 20180802 1


The human reference genome is used extensively in modern biological research. However, a single consensus representation is inadequate to provide a universal reference structure because it is a haplotype among many in the human population. Using 10× Genomics (10×G) "Linked-Read" technology, we perform whole genome sequencing (WGS) and de novo assembly on 17 individuals across five populations. We identify 1842 breakpoint-resolved non-reference unique insertions (NUIs) that, in aggregate, add up  ...[more]

Similar Datasets

| S-EPMC8170690 | biostudies-literature
| S-EPMC3469330 | biostudies-literature
| S-EPMC4581417 | biostudies-literature
| S-EPMC4595904 | biostudies-literature
| S-BSST17 | biostudies-other
| S-EPMC3765854 | biostudies-literature
| S-EPMC7768660 | biostudies-literature
| PRJEB76276 | ENA
| S-EPMC6778788 | biostudies-literature
| S-EPMC5479803 | biostudies-literature