Unknown

Dataset Information

0

Using structural motif templates to identify proteins with DNA binding function.


ABSTRACT: This work describes a method for predicting DNA binding function from structure using 3-dimensional templates. Proteins that bind DNA using small contiguous helix-turn-helix (HTH) motifs comprise a significant number of all DNA-binding proteins. A structural template library of seven HTH motifs has been created from non-homologous DNA-binding proteins in the Protein Data Bank. The templates were used to scan complete protein structures using an algorithm that calculated the root mean squared deviation (rmsd) for the optimal superposition of each template on each structure, based on C(alpha) backbone coordinates. Distributions of rmsd values for known HTH-containing proteins (true hits) and non-HTH proteins (false hits) were calculated. A threshold value of 1.6 A rmsd was selected that gave a true hit rate of 88.4% and a false positive rate of 0.7%. The false positive rate was further reduced to 0.5% by introducing an accessible surface area threshold value of 990 A2 per HTH motif. The template library and the validated thresholds were used to make predictions for target proteins from a structural genomics project.

SUBMITTER: Jones S 

PROVIDER: S-EPMC156721 | biostudies-literature | 2003 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Using structural motif templates to identify proteins with DNA binding function.

Jones Susan S   Barker Jonathan A JA   Nobeli Irene I   Thornton Janet M JM  

Nucleic acids research 20030601 11


This work describes a method for predicting DNA binding function from structure using 3-dimensional templates. Proteins that bind DNA using small contiguous helix-turn-helix (HTH) motifs comprise a significant number of all DNA-binding proteins. A structural template library of seven HTH motifs has been created from non-homologous DNA-binding proteins in the Protein Data Bank. The templates were used to scan complete protein structures using an algorithm that calculated the root mean squared dev  ...[more]

Similar Datasets

| S-EPMC519102 | biostudies-literature
| S-EPMC2726711 | biostudies-literature
| S-EPMC5668250 | biostudies-literature
| S-EPMC3412806 | biostudies-literature
| S-EPMC1892084 | biostudies-literature
| S-EPMC8837382 | biostudies-literature
| S-EPMC5121330 | biostudies-literature
| S-EPMC6331220 | biostudies-literature
| S-EPMC6283420 | biostudies-literature
| S-EPMC3174226 | biostudies-literature