Unknown

Dataset Information

0

A unified approach for quantifying and interpreting DNA shape readout by transcription factors.


ABSTRACT: Transcription factors (TFs) interpret DNA sequence by probing the chemical and structural properties of the nucleotide polymer. DNA shape is thought to enable a parsimonious representation of dependencies between nucleotide positions. Here, we propose a unified mathematical representation of the DNA sequence dependence of shape and TF binding, respectively, which simplifies and enhances analysis of shape readout. First, we demonstrate that linear models based on mononucleotide features alone account for 60-70% of the variance in minor groove width, roll, helix twist, and propeller twist. This explains why simple scoring matrices that ignore all dependencies between nucleotide positions can partially account for DNA shape readout by a TF Adding dinucleotide features as sequence-to-shape predictors to our model, we can almost perfectly explain the shape parameters. Building on this observation, we developed a post hoc analysis method that can be used to analyze any mechanism-agnostic protein-DNA binding model in terms of shape readout. Our insights provide an alternative strategy for using DNA shape information to enhance our understanding of how cis-regulatory codes are interpreted by the cellular machinery.

SUBMITTER: Rube HT 

PROVIDER: S-EPMC5822049 | biostudies-literature | 2018 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

A unified approach for quantifying and interpreting DNA shape readout by transcription factors.

Rube H Tomas HT   Rastogi Chaitanya C   Kribelbauer Judith F JF   Bussemaker Harmen J HJ  

Molecular systems biology 20180222 2


Transcription factors (TFs) interpret DNA sequence by probing the chemical and structural properties of the nucleotide polymer. DNA shape is thought to enable a parsimonious representation of dependencies between nucleotide positions. Here, we propose a unified mathematical representation of the DNA sequence dependence of shape and TF binding, respectively, which simplifies and enhances analysis of shape readout. First, we demonstrate that linear models based on mononucleotide features alone acc  ...[more]

Similar Datasets

| S-EPMC7607496 | biostudies-literature
| S-EPMC5327724 | biostudies-literature
| S-EPMC6961349 | biostudies-literature
| S-EPMC4112085 | biostudies-literature
| S-EPMC3874178 | biostudies-literature
| S-EPMC7197192 | biostudies-literature
| S-EPMC6387939 | biostudies-literature
| S-EPMC6368855 | biostudies-literature
| S-EPMC7476910 | biostudies-literature