Unknown

Dataset Information

0

Detecting clusters of mutations.


ABSTRACT: Positive selection for protein function can lead to multiple mutations within a small stretch of DNA, i.e., to a cluster of mutations. Recently, Wagner proposed a method to detect such mutation clusters. His method, however, did not take into account that residues with high solvent accessibility are inherently more variable than residues with low solvent accessibility. Here, we propose a new algorithm to detect clustered evolution. Our algorithm controls for different substitution probabilities at buried and exposed sites in the tertiary protein structure, and uses random permutations to calculate accurate P values for inferred clusters. We apply the algorithm to genomes of bacteria, fly, and mammals, and find several clusters of mutations in functionally important regions of proteins. Surprisingly, clustered evolution is a relatively rare phenomenon. Only between 2% and 10% of the genes we analyze contain a statistically significant mutation cluster. We also find that not controlling for solvent accessibility leads to an excess of clusters in terminal and solvent-exposed regions of proteins. Our algorithm provides a novel method to identify functionally relevant divergence between groups of species. Moreover, it could also be useful to detect artifacts in automatically assembled genomes.

SUBMITTER: Zhou T 

PROVIDER: S-EPMC2582452 | biostudies-literature | 2008

REPOSITORIES: biostudies-literature

altmetric image

Publications

Detecting clusters of mutations.

Zhou Tong T   Enyeart Peter J PJ   Wilke Claus O CO  

PloS one 20081119 11


Positive selection for protein function can lead to multiple mutations within a small stretch of DNA, i.e., to a cluster of mutations. Recently, Wagner proposed a method to detect such mutation clusters. His method, however, did not take into account that residues with high solvent accessibility are inherently more variable than residues with low solvent accessibility. Here, we propose a new algorithm to detect clustered evolution. Our algorithm controls for different substitution probabilities  ...[more]

Similar Datasets

| S-EPMC6143237 | biostudies-literature
| S-EPMC1200270 | biostudies-literature
| S-EPMC7206268 | biostudies-literature
| S-EPMC4271547 | biostudies-literature
| S-EPMC9738027 | biostudies-literature
| S-EPMC6723724 | biostudies-literature
| S-EPMC5271318 | biostudies-literature
| S-EPMC8095228 | biostudies-literature
| S-EPMC4666841 | biostudies-literature
| S-EPMC8891961 | biostudies-literature