Unknown

Dataset Information

0

Predicting transcription factor binding motifs from DNA-binding domains, chromatin accessibility and gene expression data.


ABSTRACT: Transcription factors (TFs) play crucial roles in regulating gene expression through interactions with specific DNA sequences. Recently, the sequence motif of almost 400 human TFs have been identified using high-throughput SELEX sequencing. However, there remain a large number of TFs (?800) with no high-throughput-derived binding motifs. Computational methods capable of associating known motifs to such TFs will avoid tremendous experimental efforts and enable deeper understanding of transcriptional regulatory functions. We present a method to associate known motifs to TFs (MATLAB code is available in Supplementary Materials). Our method is based on a probabilistic framework that not only exploits DNA-binding domains and specificities, but also integrates open chromatin, gene expression and genomic data to accurately infer monomeric and homodimeric binding motifs. Our analysis resulted in the assignment of motifs to 200 TFs with no SELEX-derived motifs, roughly a 50% increase compared to the existing coverage.

SUBMITTER: Zamanighomi M 

PROVIDER: S-EPMC5449588 | biostudies-literature | 2017 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Predicting transcription factor binding motifs from DNA-binding domains, chromatin accessibility and gene expression data.

Zamanighomi Mahdi M   Lin Zhixiang Z   Wang Yong Y   Jiang Rui R   Wong Wing Hung WH  

Nucleic acids research 20170601 10


Transcription factors (TFs) play crucial roles in regulating gene expression through interactions with specific DNA sequences. Recently, the sequence motif of almost 400 human TFs have been identified using high-throughput SELEX sequencing. However, there remain a large number of TFs (∼800) with no high-throughput-derived binding motifs. Computational methods capable of associating known motifs to such TFs will avoid tremendous experimental efforts and enable deeper understanding of transcriptio  ...[more]

Similar Datasets

| S-EPMC7397869 | biostudies-literature
| S-EPMC3138120 | biostudies-literature
| S-EPMC7263192 | biostudies-literature
| S-EPMC1316881 | biostudies-other
| S-EPMC5416775 | biostudies-literature
| S-EPMC5823854 | biostudies-literature
| S-EPMC8797260 | biostudies-literature
| S-EPMC5261565 | biostudies-literature
| S-EPMC8419926 | biostudies-literature
| S-EPMC10690171 | biostudies-literature