Dataset Information

Statistical modeling of transcription factor binding affinities predicts regulatory interactions.

ABSTRACT: Recent experimental and theoretical efforts have highlighted the fact that binding of transcription factors to DNA can be more accurately described by continuous measures of their binding affinities, rather than a discrete description in terms of binding sites. While the binding affinities can be predicted from a physical model, it is often desirable to know the distribution of binding affinities for specific sequence backgrounds. In this paper, we present a statistical approach to derive the exact distribution for sequence models with fixed GC content. We demonstrate that the affinity distribution of almost all known transcription factors can be effectively parametrized by a class of generalized extreme value distributions. Moreover, this parameterization also describes the affinity distribution for sequence backgrounds with variable GC content, such as human promoter sequences. Our approach is applicable to arbitrary sequences and all transcription factors with known binding preferences that can be described in terms of a motif matrix. The statistical treatment also provides a proper framework to directly compare transcription factors with very different affinity distributions. This is illustrated by our analysis of human promoters with known binding sites, for many of which we could identify the known regulators as those with the highest affinity. The combination of physical model and statistical normalization provides a quantitative measure which ranks transcription factors for a given sequence, and which can be compared directly with large-scale binding data. Its successful application to human promoter sequences serves as an encouraging example of how the method can be applied to other sequences.

SUBMITTER: Manke T

PROVIDER: S-EPMC2266803 | biostudies-literature | 2008 Mar

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Statistical modeling of transcription factor binding affinities predicts regulatory interactions.

Manke Thomas T Roider Helge G HG Vingron Martin M

PLoS computational biology 20080321 3

Recent experimental and theoretical efforts have highlighted the fact that binding of transcription factors to DNA can be more accurately described by continuous measures of their binding affinities, rather than a discrete description in terms of binding sites. While the binding affinities can be predicted from a physical model, it is often desirable to know the distribution of binding affinities for specific sequence backgrounds. In this paper, we present a statistical approach to derive the ex ...[more]

PMID: 18369429

Dataset Information

Statistical modeling of transcription factor binding affinities predicts regulatory interactions.

Publications

Statistical modeling of transcription factor binding affinities predicts regulatory interactions.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Understanding variation in transcription factor binding by modeling transcription factor genome-epigenome interactions.
| S-EPMC3854512 | biostudies-literature

Transcription Factor Binding Affinities and DNA Shape Readout.
| S-EPMC7607496 | biostudies-literature

GERV: a statistical method for generative evaluation of regulatory variants for transcription factor binding.
| S-EPMC5860000 | biostudies-literature

Internal regulatory interactions determine DNA binding specificity by a Hox transcription factor.
| S-EPMC2739810 | biostudies-literature

Statistical tests for natural selection on regulatory regions based on the strength of transcription factor binding sites.
| S-EPMC2800119 | biostudies-literature

Genomic promoter analysis predicts functional transcription factor binding.
| S-EPMC2768302 | biostudies-literature

True equilibrium measurement of transcription factor-DNA binding affinities using automated polarization microscopy.
| S-EPMC5913336 | biostudies-literature

Integrated microfluidic approach for quantitative high-throughput measurements of transcription factor binding affinities.
| S-EPMC4824076 | biostudies-literature

A molecular mechanics approach to modeling protein-ligand interactions: relative binding affinities in congeneric series.
| S-EPMC3183355 | biostudies-literature

Combining transcription factor binding affinities with open-chromatin data for accurate gene expression prediction.
| S-EPMC5224477 | biostudies-literature