Unknown

Dataset Information

0

Genome-wide prediction of minor-groove electrostatic potential enables biophysical modeling of protein-DNA binding.


ABSTRACT: Protein-DNA binding is a fundamental component of gene regulatory processes, but it is still not completely understood how proteins recognize their target sites in the genome. Besides hydrogen bonding in the major groove (base readout), proteins recognize minor-groove geometry using positively charged amino acids (shape readout). The underlying mechanism of DNA shape readout involves the correlation between minor-groove width and electrostatic potential (EP). To probe this biophysical effect directly, rather than using minor-groove width as an indirect measure for shape readout, we developed a methodology, DNAphi, for predicting EP in the minor groove and confirmed the direct role of EP in protein-DNA binding using massive sequencing data. The DNAphi method uses a sliding-window approach to mine results from non-linear Poisson-Boltzmann (NLPB) calculations on DNA structures derived from all-atom Monte Carlo simulations. We validated this approach, which only requires nucleotide sequence as input, based on direct comparison with NLPB calculations for available crystal structures. Using statistical machine-learning approaches, we showed that adding EP as a biophysical feature can improve the predictive power of quantitative binding specificity models across 27 transcription factor families. High-throughput prediction of EP offers a novel way to integrate biophysical and genomic studies of protein-DNA binding.

SUBMITTER: Chiu TP 

PROVIDER: S-EPMC5716191 | biostudies-literature | 2017 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Genome-wide prediction of minor-groove electrostatic potential enables biophysical modeling of protein-DNA binding.

Chiu Tsu-Pei TP   Rao Satyanarayan S   Mann Richard S RS   Honig Barry B   Rohs Remo R  

Nucleic acids research 20171201 21


Protein-DNA binding is a fundamental component of gene regulatory processes, but it is still not completely understood how proteins recognize their target sites in the genome. Besides hydrogen bonding in the major groove (base readout), proteins recognize minor-groove geometry using positively charged amino acids (shape readout). The underlying mechanism of DNA shape readout involves the correlation between minor-groove width and electrostatic potential (EP). To probe this biophysical effect dir  ...[more]

Similar Datasets

| S-EPMC3241897 | biostudies-literature
| S-EPMC2946858 | biostudies-literature
| S-EPMC5607081 | biostudies-literature
| S-EPMC1367283 | biostudies-literature
| S-EPMC6724015 | biostudies-literature
| S-EPMC3618862 | biostudies-literature
| S-EPMC3912452 | biostudies-literature
| S-EPMC3165004 | biostudies-literature
2011-08-01 | GSE30044 | GEO
| S-EPMC1518646 | biostudies-literature