Unknown

Dataset Information

0

Repeat-encoded poly-Q tracts show statistical commonalities across species.


ABSTRACT: BACKGROUND: Among repetitive genomic sequence, the class of tri-nucleotide repeats has received much attention due to their association with human diseases. Tri-nucleotide repeat diseases are caused by excessive sequence length variability; diseases such as Huntington's disease and Fragile X syndrome are tied to an increase in the number of repeat units in a tract. Motivated by the recent discovery of a tri-nucleotide repeat associated genetic defect in Arabidopsis thaliana, this study takes a cross-species approach to investigating these repeat tracts, with the goal of using commonalities between species to identify potential disease-related properties. RESULTS: We find that statistical enrichment in regulatory function associations for coding region repeats - previously observed in human - is consistent across multiple organisms. By distinguishing between homo-amino acid tracts that are encoded by tri-nucleotide repeats, and those encoded by varying codons, we show that amino acid repeats - not tri-nucleotide repeats - fully explain these regulatory associations. Using this same separation between repeat- and non-repeat-encoded homo-amino acid tracts, we show that poly-glutamine tracts are disproportionately encoded by tri-nucleotide repeats, and those tracts that are encoded by tri-nucleotide repeats are also significantly longer; these results are consistent across multiple species. CONCLUSION: These findings establish similarities in tri-nucleotide repeats across species at the level of protein functionality and protein sequence. The tendency of tri-nucleotide repeats to encode longer poly-glutamine tracts indicates a link with the poly-glutamine repeat diseases. The cross-species nature of this tendency suggests that unknown repeat diseases are yet to be uncovered in other species. Future discoveries of new non-human repeat associated defects may provide the breadth of information needed to unravel the mechanisms that underpin this class of human disease.

SUBMITTER: Willadsen K 

PROVIDER: S-EPMC3617014 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

Repeat-encoded poly-Q tracts show statistical commonalities across species.

Willadsen Kai K   Cao Minh Duc MD   Wiles Janet J   Balasubramanian Sureshkumar S   Bodén Mikael M  

BMC genomics 20130202


<h4>Background</h4>Among repetitive genomic sequence, the class of tri-nucleotide repeats has received much attention due to their association with human diseases. Tri-nucleotide repeat diseases are caused by excessive sequence length variability; diseases such as Huntington's disease and Fragile X syndrome are tied to an increase in the number of repeat units in a tract. Motivated by the recent discovery of a tri-nucleotide repeat associated genetic defect in Arabidopsis thaliana, this study ta  ...[more]

Similar Datasets

| S-EPMC5076566 | biostudies-other
| S-EPMC4512540 | biostudies-literature
| S-EPMC2673466 | biostudies-literature
| S-EPMC3488214 | biostudies-literature
| S-EPMC2701052 | biostudies-literature
| S-EPMC5935138 | biostudies-literature
| S-EPMC1219581 | biostudies-other
| S-EPMC4212969 | biostudies-literature
| S-EPMC4756613 | biostudies-literature
| S-EPMC7054307 | biostudies-literature