Dataset Information

Novel computational analysis of protein binding array data identifies direct targets of Nkx2.2 in the pancreas.

ABSTRACT: BACKGROUND:The creation of a complete genome-wide map of transcription factor binding sites is essential for understanding gene regulatory networks in vivo. However, current prediction methods generally rely on statistical models that imperfectly model transcription factor binding. Generation of new prediction methods that are based on protein binding data, but do not rely on these models may improve prediction sensitivity and specificity. RESULTS:We propose a method for predicting transcription factor binding sites in the genome by directly mapping data generated from protein binding microarrays (PBM) to the genome and calculating a moving average of several overlapping octamers. Using this unique algorithm, we predicted binding sites for the essential pancreatic islet transcription factor Nkx2.2 in the mouse genome and confirmed >90% of the tested sites by EMSA and ChIP. Scores generated from this method more accurately predicted relative binding affinity than PWM based methods. We have also identified an alternative core sequence recognized by the Nkx2.2 homeodomain. Furthermore, we have shown that this method correctly identified binding sites in the promoters of two critical pancreatic islet ?-cell genes, NeuroD1 and insulin2, that were not predicted by traditional methods. Finally, we show evidence that the algorithm can also be applied to predict binding sites for the nuclear receptor Hnf4?. CONCLUSIONS:PBM-mapping is an accurate method for predicting Nkx2.2 binding sites and may be widely applicable for the creation of genome-wide maps of transcription factor binding sites.

SUBMITTER: Hill JT

PROVIDER: S-EPMC3050729 | biostudies-literature | 2011 Feb

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Novel computational analysis of protein binding array data identifies direct targets of Nkx2.2 in the pancreas.

Hill Jonathon T JT Anderson Keith R KR Mastracci Teresa L TL Kaestner Klaus H KH Sussel Lori L

BMC bioinformatics 20110225

<h4>Background</h4>The creation of a complete genome-wide map of transcription factor binding sites is essential for understanding gene regulatory networks in vivo. However, current prediction methods generally rely on statistical models that imperfectly model transcription factor binding. Generation of new prediction methods that are based on protein binding data, but do not rely on these models may improve prediction sensitivity and specificity.<h4>Results</h4>We propose a method for predictin ...[more]

PMID: 21352540

Dataset Information

Novel computational analysis of protein binding array data identifies direct targets of Nkx2.2 in the pancreas.

Publications

Novel computational analysis of protein binding array data identifies direct targets of Nkx2.2 in the pancreas.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Identification of known and novel pancreas genes expressed downstream of Nkx2.2 during development.
| S-EPMC2799404 | biostudies-literature

Nkx2.2:Cre knock-in mouse line: a novel tool for pancreas- and CNS-specific gene deletion.
| S-EPMC3902634 | biostudies-literature

Meta-analysis of exome array data identifies six novel genetic loci for lung function.
| S-EPMC6081985 | biostudies-literature

Combined Analysis of SNP Array Data Identifies Novel CNV Candidates and Pathways in Ependymoma and Mesothelioma.
| S-EPMC4491549 | biostudies-literature

Computational Analysis of Cholangiocarcinoma Phosphoproteomes Identifies Patient-Specific Drug Targets.
| S-EPMC9397618 | biostudies-literature

AI-driven drug repurposing and binding pose meta dynamics identifies novel targets for monkeypox virus.
| S-EPMC10014505 | biostudies-literature

Computational analysis of data from a genome-wide screening identifies new <i>PARP1</i> functional interactors as potential therapeutic targets.
| S-EPMC6505629 | biostudies-literature

Data mining identifies novel RNA-binding proteins involved in colon and rectal carcinomas.
| S-EPMC10293682 | biostudies-literature

Immunochip SNP array identifies novel genetic variants conferring susceptibility to candidaemia.
| S-EPMC5142444 | biostudies-literature

A computational study of the Warburg effect identifies metabolic targets inhibiting cancer migration.
| S-EPMC4299514 | biostudies-literature