Unknown

Dataset Information

0

Alu elements in primates are preferentially lost from areas of high GC content.


ABSTRACT: The currently-accepted dogma when analysing human Alu transposable elements is that 'young' Alu elements are found in low GC regions and 'old' Alus in high GC regions. The correlation between high GC regions and high gene frequency regions make this observation particularly difficult to explain. Although a number of studies have tackled the problem, no analysis has definitively explained the reason for this trend. These observations have been made by relying on the subfamily as a proxy for age of an element. In this study, we suggest that this is a misleading assumption and instead analyse the relationship between the taxonomic distribution of an individual element and its surrounding GC environment. An analysis of 103906 Alu elements across 6 human chromosomes was carried out, using the presence of orthologous Alu elements in other primate species as a proxy for age. We show that the previously-reported effect of GC content correlating with subfamily age is not reflected by the ages of the individual elements. Instead, elements are preferentially lost from areas of high GC content over time. The correlation between GC content and subfamily may be due to a change in insertion bias in the young subfamilies. The link between Alu subfamily age and GC region was made due to an over-simplification of the data and is incorrect. We suggest that use of subfamilies as a proxy for age is inappropriate and that the analysis of ortholog presence in other primate species provides a deeper insight into the data.

SUBMITTER: Hellen EH 

PROVIDER: S-EPMC3661076 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

Alu elements in primates are preferentially lost from areas of high GC content.

Hellen Elizabeth Hb EH   Brookfield John Fy JF  

PeerJ 20130521


The currently-accepted dogma when analysing human Alu transposable elements is that 'young' Alu elements are found in low GC regions and 'old' Alus in high GC regions. The correlation between high GC regions and high gene frequency regions make this observation particularly difficult to explain. Although a number of studies have tackled the problem, no analysis has definitively explained the reason for this trend. These observations have been made by relying on the subfamily as a proxy for age o  ...[more]

Similar Datasets

| S-EPMC6636035 | biostudies-literature
| S-EPMC1266357 | biostudies-literature
| S-EPMC240696 | biostudies-literature
2015-09-10 | E-GEOD-72877 | biostudies-arrayexpress
| S-EPMC5569700 | biostudies-literature
| S-EPMC8230782 | biostudies-literature
| S-EPMC5389847 | biostudies-literature
2015-09-10 | GSE72877 | GEO
| S-EPMC3378858 | biostudies-other
| S-EPMC3720884 | biostudies-literature