Unknown

Dataset Information

0

MOPAT: a graph-based method to predict recurrent cis-regulatory modules from known motifs.


ABSTRACT: The identification of cis-regulatory modules (CRMs) can greatly advance our understanding of eukaryotic regulatory mechanism. Current methods to predict CRMs from known motifs either depend on multiple alignments or can only deal with a small number of known motifs provided by users. These methods are problematic when binding sites are not well aligned in multiple alignments or when the number of input known motifs is large. We thus developed a new CRM identification method MOPAT (motif pair tree), which identifies CRMs through the identification of motif modules, groups of motifs co-occurring in multiple CRMs. It can identify 'orthologous' CRMs without multiple alignments. It can also find CRMs given a large number of known motifs. We have applied this method to mouse developmental genes, and have evaluated the predicted CRMs and motif modules by microarray expression data and known interacting motif pairs. We show that the expression profiles of the genes containing CRMs of the same motif module correlate significantly better than those of a random set of genes do. We also show that the known interacting motif pairs are significantly included in our predictions. Compared with several current methods, our method shows better performance in identifying meaningful CRMs.

SUBMITTER: Hu J 

PROVIDER: S-EPMC2490743 | biostudies-literature | 2008 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

MOPAT: a graph-based method to predict recurrent cis-regulatory modules from known motifs.

Hu Jianfei J   Hu Haiyan H   Li Xiaoman X  

Nucleic acids research 20080707 13


The identification of cis-regulatory modules (CRMs) can greatly advance our understanding of eukaryotic regulatory mechanism. Current methods to predict CRMs from known motifs either depend on multiple alignments or can only deal with a small number of known motifs provided by users. These methods are problematic when binding sites are not well aligned in multiple alignments or when the number of input known motifs is large. We thus developed a new CRM identification method MOPAT (motif pair tre  ...[more]

Similar Datasets

| S-EPMC1291357 | biostudies-literature
| S-EPMC4041412 | biostudies-literature
| S-EPMC2669485 | biostudies-literature
| S-EPMC2395258 | biostudies-literature
| S-EPMC6075022 | biostudies-literature
| S-EPMC2728526 | biostudies-literature
| S-EPMC2930411 | biostudies-literature
| S-EPMC3359238 | biostudies-literature
| S-EPMC3424583 | biostudies-literature
| S-EPMC1796902 | biostudies-literature