Unknown

Dataset Information

0

Sequence Analysis and Characterization of Active Human Alu Subfamilies Based on the 1000 Genomes Pilot Project.


ABSTRACT: The goal of the 1000 Genomes Consortium is to characterize human genome structural variation (SV), including forms of copy number variations such as deletions, duplications, and insertions. Mobile element insertions, particularly Alu elements, are major contributors to genomic SV among humans. During the pilot phase of the project we experimentally validated 645 (611 intergenic and 34 exon targeted) polymorphic "young" Alu insertion events, absent from the human reference genome. Here, we report high resolution sequencing of 343 (322 unique) recent Alu insertion events, along with their respective target site duplications, precise genomic breakpoint coordinates, subfamily assignment, percent divergence, and estimated A-rich tail lengths. All the sequenced Alu loci were derived from the AluY lineage with no evidence of retrotransposition activity involving older Alu families (e.g., AluJ and AluS). AluYa5 is currently the most active Alu subfamily in the human lineage, followed by AluYb8, and many others including three newly identified subfamilies we have termed AluYb7a3, AluYb8b1, and AluYa4a1. This report provides the structural details of 322 unique Alu variants from individual human genomes collectively adding about 100 kb of genomic variation. Many Alu subfamilies are currently active in human populations, including a surprising level of AluY retrotransposition. Human Alu subfamilies exhibit continuous evolution with potential drivers sprouting new Alu lineages.

SUBMITTER: Konkel MK 

PROVIDER: S-EPMC4607524 | biostudies-literature | 2015 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Sequence Analysis and Characterization of Active Human Alu Subfamilies Based on the 1000 Genomes Pilot Project.

Konkel Miriam K MK   Walker Jerilyn A JA   Hotard Ashley B AB   Ranck Megan C MC   Fontenot Catherine C CC   Storer Jessica J   Stewart Chip C   Marth Gabor T GT   Batzer Mark A MA  

Genome biology and evolution 20150829 9


The goal of the 1000 Genomes Consortium is to characterize human genome structural variation (SV), including forms of copy number variations such as deletions, duplications, and insertions. Mobile element insertions, particularly Alu elements, are major contributors to genomic SV among humans. During the pilot phase of the project we experimentally validated 645 (611 intergenic and 34 exon targeted) polymorphic "young" Alu insertion events, absent from the human reference genome. Here, we report  ...[more]

Similar Datasets

| PRJEB56604 | ENA
| S-EPMC3917988 | biostudies-literature
| S-EPMC4022254 | biostudies-literature
| S-EPMC3106317 | biostudies-literature
| PRJNA28889 | ENA
| S-EPMC4338501 | biostudies-literature
| S-EPMC3340611 | biostudies-literature
| S-EPMC2935084 | biostudies-literature
| S-EPMC6696682 | biostudies-literature
2017-07-01 | GSE94043 | GEO