Unknown

Dataset Information

0

Evaluating tools for transcription factor binding site prediction.


ABSTRACT:

Background

Binding of transcription factors to transcription factor binding sites (TFBSs) is key to the mediation of transcriptional regulation. Information on experimentally validated functional TFBSs is limited and consequently there is a need for accurate prediction of TFBSs for gene annotation and in applications such as evaluating the effects of single nucleotide variations in causing disease. TFBSs are generally recognized by scanning a position weight matrix (PWM) against DNA using one of a number of available computer programs. Thus we set out to evaluate the best tools that can be used locally (and are therefore suitable for large-scale analyses) for creating PWMs from high-throughput ChIP-Seq data and for scanning them against DNA.

Results

We evaluated a set of de novo motif discovery tools that could be downloaded and installed locally using ENCODE-ChIP-Seq data and showed that rGADEM was the best-performing tool. TFBS prediction tools used to scan PWMs against DNA fall into two classes - those that predict individual TFBSs and those that identify clusters. Our evaluation showed that FIMO and MCAST performed best respectively.

Conclusions

Selection of the best-performing tools for generating PWMs from ChIP-Seq data and for scanning PWMs against DNA has the potential to improve prediction of precise transcription factor binding sites within regions identified by ChIP-Seq experiments for gene finding, understanding regulation and in evaluating the effects of single nucleotide variations in causing disease.

SUBMITTER: Jayaram N 

PROVIDER: S-EPMC6889335 | biostudies-literature | 2016 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Evaluating tools for transcription factor binding site prediction.

Jayaram Narayan N   Usvyat Daniel D   R Martin Andrew C AC  

BMC bioinformatics 20161102 1


<h4>Background</h4>Binding of transcription factors to transcription factor binding sites (TFBSs) is key to the mediation of transcriptional regulation. Information on experimentally validated functional TFBSs is limited and consequently there is a need for accurate prediction of TFBSs for gene annotation and in applications such as evaluating the effects of single nucleotide variations in causing disease. TFBSs are generally recognized by scanning a position weight matrix (PWM) against DNA usin  ...[more]

Similar Datasets

| S-EPMC1891680 | biostudies-literature
| S-EPMC3764009 | biostudies-literature
| S-EPMC2845628 | biostudies-literature
| S-EPMC6037060 | biostudies-literature
| S-EPMC7414127 | biostudies-literature
| S-EPMC1866359 | biostudies-literature
| S-EPMC6237759 | biostudies-literature
| S-EPMC4636380 | biostudies-literature