Unknown

Dataset Information

0

Heterodimeric DNA motif synthesis and validations.


ABSTRACT: Bound by transcription factors, DNA motifs (i.e. transcription factor binding sites) are prevalent and important for gene regulation in different tissues at different developmental stages of eukaryotes. Although considerable efforts have been made on elucidating monomeric DNA motif patterns, our knowledge on heterodimeric DNA motifs are still far from complete. Therefore, we propose to develop a computational approach to synthesize a heterodimeric DNA motif from two monomeric DNA motifs. The approach is sequentially divided into two components (Phases A and B). In Phase A, we propose to develop the inference models on how two DNA monomeric motifs can be oriented and overlapped with each other at nucleotide level. In Phase B, given the two monomeric DNA motifs oriented, we further propose to develop DNA-binding family-specific input-output hidden Markov models (IOHMMs) to synthesize a heterodimeric DNA motif. To validate the approach, we execute and cross-validate it with the experimentally verified 618 heterodimeric DNA motifs across 49 DNA-binding family combinations. We observe that our approach can even "rescue" the existing heterodimeric DNA motif pattern (i.e. HOXB2_EOMES) previously published on Nature. Lastly, we apply the proposed approach to infer previously uncharacterized heterodimeric motifs. Their motif instances are supported by DNase accessibility, gene ontology, protein-protein interactions, in vivo ChIP-seq peaks, and even structural data from PDB. A public web-server is built for open accessibility and scientific impact. Its address is listed as follows: http://motif.cs.cityu.edu.hk/custom/MotifKirin.

SUBMITTER: Wong KC 

PROVIDER: S-EPMC6393289 | biostudies-literature | 2019 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Heterodimeric DNA motif synthesis and validations.

Wong Ka-Chun KC   Lin Jiecong J   Li Xiangtao X   Lin Qiuzhen Q   Liang Cheng C   Song You-Qiang YQ  

Nucleic acids research 20190201 4


Bound by transcription factors, DNA motifs (i.e. transcription factor binding sites) are prevalent and important for gene regulation in different tissues at different developmental stages of eukaryotes. Although considerable efforts have been made on elucidating monomeric DNA motif patterns, our knowledge on heterodimeric DNA motifs are still far from complete. Therefore, we propose to develop a computational approach to synthesize a heterodimeric DNA motif from two monomeric DNA motifs. The app  ...[more]

Similar Datasets

| PRJEB29483 | ENA
| S-EPMC1971073 | biostudies-literature
| EGAS00001003409 | EGA
| S-EPMC5762248 | biostudies-literature
| S-EPMC24359 | biostudies-literature
| S-EPMC4019886 | biostudies-other
| MSV000088175 | MassIVE
| S-EPMC2693225 | biostudies-literature
| S-EPMC2180231 | biostudies-literature
| S-EPMC3763557 | biostudies-literature