Unknown

Dataset Information

0

Comprehensive genome-wide transcription factor analysis reveals that a combination of high affinity and low affinity DNA binding is needed for human gene regulation.


ABSTRACT:

Background

High-throughput in vivo protein-DNA interaction experiments are currently widely used in gene regulation studies. Hitherto, comprehensive data analysis remains a challenge and for that reason most computational methods only consider the top few hundred or thousand strongest protein binding sites whereas weak protein binding sites are completely ignored.

Results

A new biophysical model of protein-DNA interactions, BayesPI2+, was developed to address the above-mentioned challenges. BayesPI2+ can be run in either a serial computation model or a parallel ensemble learning framework. BayesPI2+ allowed us to analyze all binding sites of the transcription factors, including weak binding that cannot be analyzed by other models. It is evaluated in both synthetic and real in vivo protein-DNA binding experiments. Analysing ESR1 and SPIB in breast carcinoma and activated B cell-like diffuse large B-cell lymphoma cell lines, respectively, revealed that the concerted binding to high and low affinity sites correlates best with gene expression.

Conclusions

BayesPI2+ allows us to analyze transcription factor binding on a larger scale than hitherto achieved. By this analysis, we were able to demonstrate that genes are regulated by concerted binding to high and low affinity binding sites. The program and output results are publicly available at: http://folk.uio.no/junbaiw/BayesPI2Plus.

SUBMITTER: Wang J 

PROVIDER: S-EPMC4474539 | biostudies-literature | 2015

REPOSITORIES: biostudies-literature

altmetric image

Publications

Comprehensive genome-wide transcription factor analysis reveals that a combination of high affinity and low affinity DNA binding is needed for human gene regulation.

Wang Junbai J   Malecka Agnieszka A   Trøen Gunhild G   Delabie Jan J  

BMC genomics 20150611


<h4>Background</h4>High-throughput in vivo protein-DNA interaction experiments are currently widely used in gene regulation studies. Hitherto, comprehensive data analysis remains a challenge and for that reason most computational methods only consider the top few hundred or thousand strongest protein binding sites whereas weak protein binding sites are completely ignored.<h4>Results</h4>A new biophysical model of protein-DNA interactions, BayesPI2+, was developed to address the above-mentioned c  ...[more]

Similar Datasets

| S-EPMC5468674 | biostudies-literature
2023-02-10 | GSE207640 | GEO
| S-EPMC6787930 | biostudies-literature
2016-12-01 | GSE89865 | GEO
| PRJNA856492 | ENA
| S-EPMC8216454 | biostudies-literature
| S-EPMC2848887 | biostudies-literature
| S-EPMC5737466 | biostudies-literature
| S-EPMC5695909 | biostudies-literature
| S-EPMC3415389 | biostudies-literature