Unknown

Dataset Information

0

Accurate and sensitive quantification of protein-DNA binding affinity.


ABSTRACT: Transcription factors (TFs) control gene expression by binding to genomic DNA in a sequence-specific manner. Mutations in TF binding sites are increasingly found to be associated with human disease, yet we currently lack robust methods to predict these sites. Here, we developed a versatile maximum likelihood framework named No Read Left Behind (NRLB) that infers a biophysical model of protein-DNA recognition across the full affinity range from a library of in vitro selected DNA binding sites. NRLB predicts human Max homodimer binding in near-perfect agreement with existing low-throughput measurements. It can capture the specificity of the p53 tetramer and distinguish multiple binding modes within a single sample. Additionally, we confirm that newly identified low-affinity enhancer binding sites are functional in vivo, and that their contribution to gene expression matches their predicted affinity. Our results establish a powerful paradigm for identifying protein binding sites and interpreting gene regulatory sequences in eukaryotic genomes.

SUBMITTER: Rastogi C 

PROVIDER: S-EPMC5910815 | biostudies-literature | 2018 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Accurate and sensitive quantification of protein-DNA binding affinity.

Rastogi Chaitanya C   Rube H Tomas HT   Kribelbauer Judith F JF   Crocker Justin J   Loker Ryan E RE   Martini Gabriella D GD   Laptenko Oleg O   Freed-Pastor William A WA   Prives Carol C   Stern David L DL   Mann Richard S RS   Bussemaker Harmen J HJ  

Proceedings of the National Academy of Sciences of the United States of America 20180402 16


Transcription factors (TFs) control gene expression by binding to genomic DNA in a sequence-specific manner. Mutations in TF binding sites are increasingly found to be associated with human disease, yet we currently lack robust methods to predict these sites. Here, we developed a versatile maximum likelihood framework named No Read Left Behind (NRLB) that infers a biophysical model of protein-DNA recognition across the full affinity range from a library of in vitro selected DNA binding sites. NR  ...[more]

Similar Datasets

| S-EPMC6311686 | biostudies-literature
| S-EPMC4838337 | biostudies-literature
| S-EPMC10777193 | biostudies-literature
| S-EPMC5934639 | biostudies-literature
| S-EPMC10157843 | biostudies-literature
| S-EPMC10724026 | biostudies-literature
| S-EPMC5346138 | biostudies-literature
| S-EPMC1369289 | biostudies-literature
| S-EPMC4827127 | biostudies-literature
| S-EPMC4137368 | biostudies-literature