Unknown

Dataset Information

0

Sequence-Specific Recognition of DNA by Proteins: Binding Motifs Discovered Using a Novel Statistical/Computational Analysis.


ABSTRACT: Decades of intensive experimental studies of the recognition of DNA sequences by proteins have provided us with a view of a diverse and complicated world in which few to no features are shared between individual DNA-binding protein families. The originally conceived direct readout of DNA residue sequences by amino acid side chains offers very limited capacity for sequence recognition, while the effects of the dynamic properties of the interacting partners remain difficult to quantify and almost impossible to generalise. In this work we investigated the energetic characteristics of all DNA residue-amino acid side chain combinations in the conformations found at the interaction interface in a very large set of protein-DNA complexes by the means of empirical potential-based calculations. General specificity-defining criteria were derived and utilised to look beyond the binding motifs considered in previous studies. Linking energetic favourability to the observed geometrical preferences, our approach reveals several additional amino acid motifs which can distinguish between individual DNA bases. Our results remained valid in environments with various dielectric properties.

SUBMITTER: Jakubec D 

PROVIDER: S-EPMC4934765 | biostudies-literature | 2016

REPOSITORIES: biostudies-literature

altmetric image

Publications

Sequence-Specific Recognition of DNA by Proteins: Binding Motifs Discovered Using a Novel Statistical/Computational Analysis.

Jakubec David D   Laskowski Roman A RA   Vondrasek Jiri J  

PloS one 20160706 7


Decades of intensive experimental studies of the recognition of DNA sequences by proteins have provided us with a view of a diverse and complicated world in which few to no features are shared between individual DNA-binding protein families. The originally conceived direct readout of DNA residue sequences by amino acid side chains offers very limited capacity for sequence recognition, while the effects of the dynamic properties of the interacting partners remain difficult to quantify and almost  ...[more]

Similar Datasets

2024-07-05 | GSE237017 | GEO
| PRJNA993300 | ENA
| S-EPMC47881 | biostudies-other
| S-EPMC5028900 | biostudies-other
| S-EPMC2926618 | biostudies-literature
| S-EPMC2887952 | biostudies-literature
| S-EPMC5515408 | biostudies-literature
| S-EPMC123107 | biostudies-literature
| S-EPMC1153215 | biostudies-other
| S-EPMC2755677 | biostudies-literature