Unknown

Dataset Information

0

Mapping the human reference genome's missing sequence by three-way admixture in Latino genomes.


ABSTRACT: A principal obstacle to completing maps and analyses of the human genome involves the genome's "inaccessible" regions: sequences (often euchromatic and containing genes) that are isolated from the rest of the euchromatic genome by heterochromatin and other repeat-rich sequence. We describe a way to localize these sequences by using ancestry linkage disequilibrium in populations that derive ancestry from at least three continents, as is the case for Latinos. We used this approach to map the genomic locations of almost 20 megabases of sequence unlocalized or missing from the current human genome reference (NCBI Genome GRCh37)-a substantial fraction of the human genome's remaining unmapped sequence. We show that the genomic locations of most sequences that originated from fosmids and larger clones can be admixture mapped in this way, by using publicly available whole-genome sequence data. Genome assembly efforts and future builds of the human genome reference will be strongly informed by this localization of genes and other euchromatic sequences that are embedded within highly repetitive pericentromeric regions.

SUBMITTER: Genovese G 

PROVIDER: S-EPMC3971454 | biostudies-literature | 2013 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Mapping the human reference genome's missing sequence by three-way admixture in Latino genomes.

Genovese Giulio G   Handsaker Robert E RE   Li Heng H   Kenny Eimear E EE   McCarroll Steven A SA  

American journal of human genetics 20130808 3


A principal obstacle to completing maps and analyses of the human genome involves the genome's "inaccessible" regions: sequences (often euchromatic and containing genes) that are isolated from the rest of the euchromatic genome by heterochromatin and other repeat-rich sequence. We describe a way to localize these sequences by using ancestry linkage disequilibrium in populations that derive ancestry from at least three continents, as is the case for Latinos. We used this approach to map the genom  ...[more]

Similar Datasets

| S-EPMC1867104 | biostudies-literature
| S-EPMC9957797 | biostudies-literature
| S-EPMC6360325 | biostudies-literature
| S-EPMC3593143 | biostudies-literature
| S-EPMC4103496 | biostudies-literature
| S-EPMC7537619 | biostudies-literature
| S-EPMC4085159 | biostudies-literature
| S-EPMC1867092 | biostudies-literature
| S-EPMC10276158 | biostudies-literature
| S-EPMC4256931 | biostudies-literature