Unknown

Dataset Information

0

Modelling of the breadth of expression from promoter architectures identifies pro-housekeeping transcription factors.


ABSTRACT: Understanding how regulatory elements control mammalian gene expression is a challenge of post-genomic era. We previously reported that size of proximal promoter architecture predicted the breadth of expression (fraction of tissues in which a gene is expressed). Herein, the contributions of individual transcription factors (TFs) were quantified. Several technologies of statistical modelling were utilized and compared: tree models, generalized linear models (GLMs, without and with regularization), Bayesian GLMs and random forest. Both linear and non-linear modelling strategies were explored. Encouragingly, different models led to similar statistical conclusions and biological interpretations. The majority of ENCODE TFs correlated positively with housekeeping expression, a minority correlated negatively. Thus, housekeeping expression can be understood as a cumulative effect of many types of TF binding sites. This is accompanied by the exclusion of fewer types of binding sites for TFs which are repressors, or support cell lineage commitment or temporarily inducible or spatially-restricted expression.

SUBMITTER: Huminiecki L 

PROVIDER: S-EPMC6013173 | biostudies-literature | 2018

REPOSITORIES: biostudies-literature

altmetric image

Publications

Modelling of the breadth of expression from promoter architectures identifies pro-housekeeping transcription factors.

Huminiecki Lukasz L  

PloS one 20180621 6


Understanding how regulatory elements control mammalian gene expression is a challenge of post-genomic era. We previously reported that size of proximal promoter architecture predicted the breadth of expression (fraction of tissues in which a gene is expressed). Herein, the contributions of individual transcription factors (TFs) were quantified. Several technologies of statistical modelling were utilized and compared: tree models, generalized linear models (GLMs, without and with regularization)  ...[more]

Similar Datasets

| S-EPMC4310617 | biostudies-literature
| S-EPMC4795619 | biostudies-literature
| S-EPMC2615112 | biostudies-literature
| S-EPMC2413260 | biostudies-literature
| S-EPMC6803630 | biostudies-literature
| S-EPMC4523886 | biostudies-literature
| S-EPMC6508781 | biostudies-literature
| S-EPMC3730107 | biostudies-literature
| S-EPMC2779200 | biostudies-literature
| S-EPMC152810 | biostudies-literature