Unknown

Dataset Information

0

Evolution of Protein Domain Repeats in Metazoa.


ABSTRACT: Repeats are ubiquitous elements of proteins and they play important roles for cellular function and during evolution. Repeats are, however, also notoriously difficult to capture computationally and large scale studies so far had difficulties in linking genetic causes, structural properties and evolutionary trajectories of protein repeats. Here we apply recently developed methods for repeat detection and analysis to a large dataset comprising over hundred metazoan genomes. We find that repeats in larger protein families experience generally very few insertions or deletions (indels) of repeat units but there is also a significant fraction of noteworthy volatile outliers with very high indel rates. Analysis of structural data indicates that repeats with an open structure and independently folding units are more volatile and more likely to be intrinsically disordered. Such disordered repeats are also significantly enriched in sites with a high functional potential such as linear motifs. Furthermore, the most volatile repeats have a high sequence similarity between their units. Since many volatile repeats also show signs of recombination, we conclude they are often shaped by concerted evolution. Intriguingly, many of these conserved yet volatile repeats are involved in host-pathogen interactions where they might foster fast but subtle adaptation in biological arms races. KEY WORDS: protein evolution, domain rearrangements, protein repeats, concerted evolution.

SUBMITTER: Schuler A 

PROVIDER: S-EPMC5100051 | biostudies-literature | 2016 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Evolution of Protein Domain Repeats in Metazoa.

Schüler Andreas A   Bornberg-Bauer Erich E  

Molecular biology and evolution 20160926 12


Repeats are ubiquitous elements of proteins and they play important roles for cellular function and during evolution. Repeats are, however, also notoriously difficult to capture computationally and large scale studies so far had difficulties in linking genetic causes, structural properties and evolutionary trajectories of protein repeats. Here we apply recently developed methods for repeat detection and analysis to a large dataset comprising over hundred metazoan genomes. We find that repeats in  ...[more]

Similar Datasets

| S-EPMC1553488 | biostudies-literature
| S-EPMC5896119 | biostudies-literature
| S-EPMC5291267 | biostudies-literature
| S-EPMC8740685 | biostudies-literature
| S-EPMC5054710 | biostudies-literature
| S-EPMC1764030 | biostudies-literature
| S-EPMC6949934 | biostudies-literature
| S-EPMC3378862 | biostudies-literature
| S-EPMC3517346 | biostudies-literature
| S-EPMC8139856 | biostudies-literature