Unknown

Dataset Information

0

MCAST: scanning for cis-regulatory motif clusters.


ABSTRACT: Precise regulatory control of genes, particularly in eukaryotes, frequently requires the joint action of multiple sequence-specific transcription factors. A cis-regulatory module (CRM) is a genomic locus that is responsible for gene regulation and that contains multiple transcription factor binding sites in close proximity. Given a collection of known transcription factor binding motifs, many bioinformatics methods have been proposed over the past 15 years for identifying within a genomic sequence candidate CRMs consisting of clusters of those motifs.The MCAST algorithm uses a hidden Markov model with a P-value-based scoring scheme to identify candidate CRMs. Here, we introduce a new version of MCAST that offers improved graphical output, a dynamic background model, statistical confidence estimates based on false discovery rate estimation and, most significantly, the ability to predict CRMs while taking into account epigenomic data such as DNase I sensitivity or histone modification data. We demonstrate the validity of MCAST's statistical confidence estimates and the utility of epigenomic priors in identifying CRMs.MCAST is part of the MEME Suite software toolkit. A web server and source code are available at http://meme-suite.org and http://alternate.meme-suite.orgt.bailey@imb.uq.edu.au or william-noble@uw.eduSupplementary data are available at Bioinformatics online.

SUBMITTER: Grant CE 

PROVIDER: S-EPMC4907379 | biostudies-other | 2016 Apr

REPOSITORIES: biostudies-other

altmetric image

Publications

MCAST: scanning for cis-regulatory motif clusters.

Grant Charles E CE   Johnson James J   Bailey Timothy L TL   Noble William Stafford WS  

Bioinformatics (Oxford, England) 20151224 8


<h4>Unlabelled</h4>Precise regulatory control of genes, particularly in eukaryotes, frequently requires the joint action of multiple sequence-specific transcription factors. A cis-regulatory module (CRM) is a genomic locus that is responsible for gene regulation and that contains multiple transcription factor binding sites in close proximity. Given a collection of known transcription factor binding motifs, many bioinformatics methods have been proposed over the past 15 years for identifying with  ...[more]

Similar Datasets

| S-EPMC4792141 | biostudies-literature
| S-EPMC6771399 | biostudies-literature
| S-EPMC2768654 | biostudies-literature
| S-EPMC3142959 | biostudies-literature
| S-EPMC8769700 | biostudies-literature
| S-EPMC5780765 | biostudies-literature
| S-EPMC10772951 | biostudies-literature
| S-EPMC9177858 | biostudies-literature
| S-EPMC2174486 | biostudies-literature
| S-EPMC2655695 | biostudies-literature