Unknown

Dataset Information

0

Towards a map of cis-regulatory sequences in the human genome.


ABSTRACT: Accumulating evidence indicates that transcription factor (TF) binding sites, or cis-regulatory elements (CREs), and their clusters termed cis-regulatory modules (CRMs) play a more important role than do gene-coding sequences in specifying complex traits in humans, including the susceptibility to common complex diseases. To fully characterize their roles in deriving the complex traits/diseases, it is necessary to annotate all CREs and CRMs encoded in the human genome. However, the current annotations of CREs and CRMs in the human genome are still very limited and mostly coarse-grained, as they often lack the detailed information of CREs in CRMs. Here, we integrated 620 TF ChIP-seq datasets produced by the ENCODE project for 168 TFs in 79 different cell/tissue types and predicted an unprecedentedly completely map of CREs in CRMs in the human genome at single nucleotide resolution. The map includes 305 912 CRMs containing a total of 1 178 913 CREs belonging to 736 unique TF binding motifs. The predicted CREs and CRMs tend to be subject to either purifying selection or positive selection, thus are likely to be functional. Based on the results, we also examined the status of available ChIP-seq datasets for predicting the entire regulatory genome of humans.

SUBMITTER: Niu M 

PROVIDER: S-EPMC6009671 | biostudies-literature | 2018 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Towards a map of cis-regulatory sequences in the human genome.

Niu Meng M   Tabari Ehsan E   Ni Pengyu P   Su Zhengchang Z  

Nucleic acids research 20180601 11


Accumulating evidence indicates that transcription factor (TF) binding sites, or cis-regulatory elements (CREs), and their clusters termed cis-regulatory modules (CRMs) play a more important role than do gene-coding sequences in specifying complex traits in humans, including the susceptibility to common complex diseases. To fully characterize their roles in deriving the complex traits/diseases, it is necessary to annotate all CREs and CRMs encoded in the human genome. However, the current annota  ...[more]

Similar Datasets

| S-EPMC4041622 | biostudies-literature
2012-07-01 | GSE29184 | GEO
| S-EPMC3179250 | biostudies-literature
2012-06-30 | E-GEOD-34587 | biostudies-arrayexpress
2012-06-30 | E-GEOD-29278 | biostudies-arrayexpress
2012-07-01 | E-GEOD-29218 | biostudies-arrayexpress
2012-07-01 | GSE34587 | GEO
2012-07-01 | GSE29278 | GEO
2012-07-01 | GSE29218 | GEO
| S-EPMC5994936 | biostudies-literature