Unknown

Dataset Information

0

GSMC: Combining Parallel Gibbs Sampling with Maximal Cliques for Hunting DNA Motif.


ABSTRACT: Regulatory elements are responsible for regulating gene transcription. Therefore, identification of these elements is a tremendous challenge in the field of gene expression. Transcription factors (TFs) play a key role in gene regulation by binding to target promoter sequences. A set of conserved sequence patterns with a highly similar structure that is bound by a TF is called a motif. Motif discovery has been a difficult problem over the past decades. Meanwhile, it is a foundation stone in meeting this challenge. Recent advances in obtaining genomic sequences and high-throughput gene expression analysis techniques have enabled the rapid development of computational methods for motif discovery. As a result, a large number of motif-finding algorithms aiming at various motif models have sprung up in the past few years. However, most of them are not suitable for analysis of the large data sets generated by next-generation sequencing. To better handle large-scale ChIP-Seq data and achieve better performance in computational time and motif detection accuracy, we propose an excellent motif-finding algorithm known as GSMC (Combining Parallel Gibbs Sampling with Maximal Cliques for hunting DNA Motif). The GSMC algorithm consists of two steps. First, we employ the commonly used Gibbs sampling to generating initial motifs. Second, we utilize maximal cliques to cluster motifs according to Similarity with Position Information Contents (SPIC). Consequently, we raise the detection accuracy in a great degree, in the meantime holding comparative computation efficiency. In addition, we can find much more credible cofactor interacting motifs.

SUBMITTER: Pei C 

PROVIDER: S-EPMC5749607 | biostudies-literature | 2017 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

GSMC: Combining Parallel Gibbs Sampling with Maximal Cliques for Hunting DNA Motif.

Pei Chao C   Wang Shu-Lin SL   Fang Jianwen J   Zhang Wei W  

Journal of computational biology : a journal of computational molecular cell biology 20171108 12


Regulatory elements are responsible for regulating gene transcription. Therefore, identification of these elements is a tremendous challenge in the field of gene expression. Transcription factors (TFs) play a key role in gene regulation by binding to target promoter sequences. A set of conserved sequence patterns with a highly similar structure that is bound by a TF is called a motif. Motif discovery has been a difficult problem over the past decades. Meanwhile, it is a foundation stone in meeti  ...[more]

Similar Datasets

| S-EPMC1309704 | biostudies-literature
| S-EPMC1234247 | biostudies-literature
| S-EPMC5344788 | biostudies-literature
| S-EPMC5634954 | biostudies-literature
| S-EPMC7530206 | biostudies-literature
| S-EPMC7076190 | biostudies-literature
| S-EPMC3371830 | biostudies-literature
| S-EPMC1599759 | biostudies-literature
| S-EPMC2703005 | biostudies-literature
| S-EPMC2951085 | biostudies-literature