Dataset Information

Finding evolutionarily conserved cis-regulatory modules with a universal set of motifs.

ABSTRACT:

Background

Finding functional regulatory elements in DNA sequences is a very important problem in computational biology and providing a reliable algorithm for this task would be a major step towards understanding regulatory mechanisms on genome-wide scale. Major obstacles in this respect are that the fact that the amount of non-coding DNA is vast, and that the methods for predicting functional transcription factor binding sites tend to produce results with a high percentage of false positives. This makes the problem of finding regions significantly enriched in binding sites difficult.

Results

We develop a novel method for predicting regulatory regions in DNA sequences, which is designed to exploit the evolutionary conservation of regulatory elements between species without assuming that the order of motifs is preserved across species. We have implemented our method and tested its predictive abilities on various datasets from different organisms.

Conclusion

We show that our approach enables us to find a majority of the known CRMs using only sequence information from different species together with currently publicly available motif data. Also, our method is robust enough to perform well in predicting CRMs, despite differences in tissue specificity and even across species, provided that the evolutionary distances between compared species do not change substantially. The complexity of the proposed algorithm is polynomial, and the observed running times show that it may be readily applied.

SUBMITTER: Wilczynski B

PROVIDER: S-EPMC2669485 | biostudies-literature | 2009 Mar

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Finding evolutionarily conserved cis-regulatory modules with a universal set of motifs.

Wilczynski Bartek B Dojer Norbert N Patelak Mateusz M Tiuryn Jerzy J

BMC bioinformatics 20090310

<h4>Background</h4>Finding functional regulatory elements in DNA sequences is a very important problem in computational biology and providing a reliable algorithm for this task would be a major step towards understanding regulatory mechanisms on genome-wide scale. Major obstacles in this respect are that the fact that the amount of non-coding DNA is vast, and that the methods for predicting functional transcription factor binding sites tend to produce results with a high percentage of false posi ...[more]

PMID: 19284541

Dataset Information

Finding evolutionarily conserved cis-regulatory modules with a universal set of motifs.

Background

Results

Conclusion

Publications

Finding evolutionarily conserved cis-regulatory modules with a universal set of motifs.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Genomic approaches towards finding cis-regulatory modules in animals.
| S-EPMC3541939 | biostudies-literature

Conserved Motifs and Prediction of Regulatory Modules in Caenorhabditis elegans.
| S-EPMC3337475 | biostudies-literature

Imogene: identification of motifs and cis-regulatory modules underlying gene co-regulation.
| S-EPMC4041412 | biostudies-literature

Computational discovery of cis-regulatory modules in Drosophila without prior knowledge of motifs.
| S-EPMC2395258 | biostudies-literature

Network discovery pipeline elucidates conserved time-of-day-specific cis-regulatory modules.
| S-EPMC2222925 | biostudies-literature

Identification of an evolutionarily conserved cis-regulatory element controlling the Peg3 imprinted domain.
| S-EPMC3769284 | biostudies-literature

MOPAT: a graph-based method to predict recurrent cis-regulatory modules from known motifs.
| S-EPMC2490743 | biostudies-literature

WeederH: an algorithm for finding conserved regulatory motifs and regions in homologous sequences.
| S-EPMC1803799 | biostudies-literature

Conserved Cis-Regulatory Modules Control Robustness in Msx1 Expression at Single-Cell Resolution.
| S-EPMC4607535 | biostudies-literature

Motifs and cis-regulatory modules mediating the expression of genes co-expressed in presynaptic neurons.
| S-EPMC2728526 | biostudies-literature