Unknown

Dataset Information

0

The effect of orthology and coregulation on detecting regulatory motifs.


ABSTRACT:

Background

Computational de novo discovery of transcription factor binding sites is still a challenging problem. The growing number of sequenced genomes allows integrating orthology evidence with coregulation information when searching for motifs. Moreover, the more advanced motif detection algorithms explicitly model the phylogenetic relatedness between the orthologous input sequences and thus should be well adapted towards using orthologous information. In this study, we evaluated the conditions under which complementing coregulation with orthologous information improves motif detection for the class of probabilistic motif detection algorithms with an explicit evolutionary model.

Methodology

We designed datasets (real and synthetic) covering different degrees of coregulation and orthologous information to test how well Phylogibbs and Phylogenetic sampler, as representatives of the motif detection algorithms with evolutionary model performed as compared to MEME, a more classical motif detection algorithm that treats orthologs independently.

Results and conclusions

Under certain conditions detecting motifs in the combined coregulation-orthology space is indeed more efficient than using each space separately, but this is not always the case. Moreover, the difference in success rate between the advanced algorithms and MEME is still marginal. The success rate of motif detection depends on the complex interplay between the added information and the specificities of the applied algorithms. Insights in this relation provide information useful to both developers and users. All benchmark datasets are available at http://homes.esat.kuleuven.be/~kmarchal/Supplementary_Storms_Valerie_PlosONE.

SUBMITTER: Storms V 

PROVIDER: S-EPMC2815771 | biostudies-literature | 2010 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

The effect of orthology and coregulation on detecting regulatory motifs.

Storms Valerie V   Claeys Marleen M   Sanchez Aminael A   De Moor Bart B   Verstuyf Annemieke A   Marchal Kathleen K  

PloS one 20100203 2


<h4>Background</h4>Computational de novo discovery of transcription factor binding sites is still a challenging problem. The growing number of sequenced genomes allows integrating orthology evidence with coregulation information when searching for motifs. Moreover, the more advanced motif detection algorithms explicitly model the phylogenetic relatedness between the orthologous input sequences and thus should be well adapted towards using orthologous information. In this study, we evaluated the  ...[more]

Similar Datasets

| S-EPMC2930411 | biostudies-literature
| S-EPMC463320 | biostudies-literature
| S-EPMC1931588 | biostudies-literature
| S-EPMC5428484 | biostudies-literature
| S-EPMC3290541 | biostudies-literature
| S-EPMC3044298 | biostudies-literature
| S-EPMC3314571 | biostudies-literature
| S-EPMC4168715 | biostudies-literature
| S-EPMC3391214 | biostudies-literature
| S-EPMC5870558 | biostudies-literature