Unknown

Dataset Information

0

Detecting coevolution in and among protein domains.


ABSTRACT: Correlated changes of nucleic or amino acids have provided strong information about the structures and interactions of molecules. Despite the rich literature in coevolutionary sequence analysis, previous methods often have to trade off between generality, simplicity, phylogenetic information, and specific knowledge about interactions. Furthermore, despite the evidence of coevolution in selected protein families, a comprehensive screening of coevolution among all protein domains is still lacking. We propose an augmented continuous-time Markov process model for sequence coevolution. The model can handle different types of interactions, incorporate phylogenetic information and sequence substitution, has only one extra free parameter, and requires no knowledge about interaction rules. We employ this model to large-scale screenings on the entire protein domain database (Pfam). Strikingly, with 0.1 trillion tests executed, the majority of the inferred coevolving protein domains are functionally related, and the coevolving amino acid residues are spatially coupled. Moreover, many of the coevolving positions are located at functionally important sites of proteins/protein complexes, such as the subunit linkers of superoxide dismutase, the tRNA binding sites of ribosomes, the DNA binding region of RNA polymerase, and the active and ligand binding sites of various enzymes. The results suggest sequence coevolution manifests structural and functional constraints of proteins. The intricate relations between sequence coevolution and various selective constraints are worth pursuing at a deeper level.

SUBMITTER: Yeang CH 

PROVIDER: S-EPMC2098842 | biostudies-literature | 2007 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Detecting coevolution in and among protein domains.

Yeang Chen-Hsiang CH   Haussler David D  

PLoS computational biology 20070918 11


Correlated changes of nucleic or amino acids have provided strong information about the structures and interactions of molecules. Despite the rich literature in coevolutionary sequence analysis, previous methods often have to trade off between generality, simplicity, phylogenetic information, and specific knowledge about interactions. Furthermore, despite the evidence of coevolution in selected protein families, a comprehensive screening of coevolution among all protein domains is still lacking.  ...[more]

Similar Datasets

| S-EPMC4332441 | biostudies-literature
| S-EPMC3130660 | biostudies-literature
| S-EPMC10496359 | biostudies-literature
| S-EPMC5842911 | biostudies-literature
| S-EPMC2869012 | biostudies-literature
| S-EPMC7320627 | biostudies-literature
| S-EPMC7881620 | biostudies-literature
| S-EPMC2637866 | biostudies-other
| S-EPMC18803 | biostudies-literature
| S-EPMC9410901 | biostudies-literature