Unknown

Dataset Information

0

High-throughput sequencing reveals extensive variation in human-specific L1 content in individual human genomes.


ABSTRACT: Using high-throughput sequencing, we devised a technique to determine the insertion sites of virtually all members of the human-specific L1 retrotransposon family in any human genome. Using diagnostic nucleotides, we were able to locate the approximately 800 L1Hs copies corresponding specifically to the pre-Ta, Ta-0, and Ta-1 L1Hs subfamilies, with over 90% of sequenced reads corresponding to human-specific elements. We find that any two individual genomes differ at an average of 285 sites with respect to L1 insertion presence or absence. In total, we assayed 25 individuals, 15 of which are unrelated, at 1139 sites, including 772 shared with the reference genome and 367 nonreference L1 insertions. We show that L1Hs profiles recapitulate genetic ancestry, and determine the chromosomal distribution of these elements. Using these data, we estimate that the rate of L1 retrotransposition in humans is between 1/95 and 1/270 births, and the number of dimorphic L1 elements in the human population with gene frequencies greater than 0.05 is between 3000 and 10,000.

SUBMITTER: Ewing AD 

PROVIDER: S-EPMC2928504 | biostudies-literature | 2010 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

High-throughput sequencing reveals extensive variation in human-specific L1 content in individual human genomes.

Ewing Adam D AD   Kazazian Haig H HH  

Genome research 20100520 9


Using high-throughput sequencing, we devised a technique to determine the insertion sites of virtually all members of the human-specific L1 retrotransposon family in any human genome. Using diagnostic nucleotides, we were able to locate the approximately 800 L1Hs copies corresponding specifically to the pre-Ta, Ta-0, and Ta-1 L1Hs subfamilies, with over 90% of sequenced reads corresponding to human-specific elements. We find that any two individual genomes differ at an average of 285 sites with  ...[more]

Similar Datasets

| PRJNA74807 | ENA
| S-EPMC1458931 | biostudies-literature
| S-EPMC3317159 | biostudies-literature
2012-01-25 | GSE31018 | GEO
| S-EPMC4896128 | biostudies-literature
2012-01-25 | E-GEOD-31018 | biostudies-arrayexpress
| S-EPMC6298018 | biostudies-literature
| S-EPMC3012916 | biostudies-literature
| S-EPMC2933874 | biostudies-other
| S-EPMC2424287 | biostudies-literature