Unknown

Dataset Information

0

Predicting Variation of DNA Shape Preferences in Protein-DNA Interaction in Cancer Cells with a New Biophysical Model.


ABSTRACT: DNA shape readout is an important mechanism of transcription factor target site recognition, in addition to the sequence readout. Several machine learning-based models of transcription factor-DNA interactions, considering DNA shape features, have been developed in recent years. Here, we present a new biophysical model of protein-DNA interactions by integrating the DNA shape properties. It is based on the neighbor dinucleotide dependency model BayesPI2, where new parameters are restricted to a subspace spanned by the dinucleotide form of DNA shape features. This allows a biophysical interpretation of the new parameters as a position-dependent preference towards specific DNA shape features. Using the new model, we explore the variation of DNA shape preferences in several transcription factors across various cancer cell lines and cellular conditions. The results reveal that there are DNA shape variations at FOXA1 (Forkhead Box Protein A1) binding sites in steroid-treated MCF7 cells. The new biophysical model is useful for elucidating the finer details of transcription factor-DNA interaction, as well as for predicting cancer mutation effects in the future.

SUBMITTER: Batmanov K 

PROVIDER: S-EPMC5615366 | biostudies-literature | 2017 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Predicting Variation of DNA Shape Preferences in Protein-DNA Interaction in Cancer Cells with a New Biophysical Model.

Batmanov Kirill K   Wang Junbai J  

Genes 20170918 9


DNA shape readout is an important mechanism of transcription factor target site recognition, in addition to the sequence readout. Several machine learning-based models of transcription factor-DNA interactions, considering DNA shape features, have been developed in recent years. Here, we present a new biophysical model of protein-DNA interactions by integrating the DNA shape properties. It is based on the neighbor dinucleotide dependency model BayesPI2, where new parameters are restricted to a su  ...[more]

Similar Datasets

| S-EPMC3966529 | biostudies-literature
| S-EPMC4574261 | biostudies-literature
| S-EPMC2758060 | biostudies-literature
| 2417295 | ecrin-mdr-crc
| S-EPMC6635840 | biostudies-literature
2010-06-26 | E-GEOD-11239 | biostudies-arrayexpress
| S-EPMC9820754 | biostudies-literature
| S-EPMC9976917 | biostudies-literature
| S-EPMC3379012 | biostudies-literature
| S-EPMC6368855 | biostudies-literature