Unknown

Dataset Information

0

Sequence variation in ligand binding sites in proteins.


ABSTRACT: The recent explosion in the availability of complete genome sequences has led to the cataloging of tens of thousands of new proteins and putative proteins. Many of these proteins can be structurally or functionally categorized from sequence conservation alone. In contrast, little attention has been given to the meaning of poorly-conserved sites in families of proteins, which are typically assumed to be of little structural or functional importance.Recently, using statistical free energy analysis of tetratricopeptide repeat (TPR) domains, we observed that positions in contact with peptide ligands are more variable than surface positions in general. Here we show that statistical analysis of TPRs, ankyrin repeats, Cys2His2 zinc fingers and PDZ domains accurately identifies specificity-determining positions by their sequence variation. Sequence variation is measured as deviation from a neutral reference state, and we present probabilistic and information theory formalisms that improve upon recently suggested methods such as statistical free energies and sequence entropies.Sequence variation has been used to identify functionally-important residues in four selected protein families. With TPRs and ankyrin repeats, protein families that bind highly diverse ligands, the effect is so pronounced that sequence "hypervariation" alone can be used to predict ligand binding sites.

SUBMITTER: Magliery TJ 

PROVIDER: S-EPMC1261162 | biostudies-literature | 2005 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Sequence variation in ligand binding sites in proteins.

Magliery Thomas J TJ   Regan Lynne L  

BMC bioinformatics 20050930


<h4>Background</h4>The recent explosion in the availability of complete genome sequences has led to the cataloging of tens of thousands of new proteins and putative proteins. Many of these proteins can be structurally or functionally categorized from sequence conservation alone. In contrast, little attention has been given to the meaning of poorly-conserved sites in families of proteins, which are typically assumed to be of little structural or functional importance.<h4>Results</h4>Recently, usi  ...[more]

Similar Datasets

| S-EPMC2639300 | biostudies-literature
| S-EPMC8604960 | biostudies-literature
| S-EPMC1534068 | biostudies-literature
| S-EPMC7193039 | biostudies-literature
| S-EPMC2279290 | biostudies-literature
| S-EPMC2966529 | biostudies-literature
| S-EPMC3026835 | biostudies-literature
| S-EPMC2777313 | biostudies-literature
| S-EPMC1524891 | biostudies-literature
| S-EPMC6651575 | biostudies-literature