Unknown

Dataset Information

0

The Escherichia coli transcriptome mostly consists of independently regulated modules.


ABSTRACT: Underlying cellular responses is a transcriptional regulatory network (TRN) that modulates gene expression. A useful description of the TRN would decompose the transcriptome into targeted effects of individual transcriptional regulators. Here, we apply unsupervised machine learning to a diverse compendium of over 250 high-quality Escherichia coli RNA-seq datasets to identify 92 statistically independent signals that modulate the expression of specific gene sets. We show that 61 of these transcriptomic signals represent the effects of currently characterized transcriptional regulators. Condition-specific activation of signals is validated by exposure of E. coli to new environmental conditions. The resulting decomposition of the transcriptome provides: a mechanistic, systems-level, network-based explanation of responses to environmental and genetic perturbations; a guide to gene and regulator function discovery; and a basis for characterizing transcriptomic differences in multiple strains. Taken together, our results show that signal summation describes the composition of a model prokaryotic transcriptome.

SUBMITTER: Sastry AV 

PROVIDER: S-EPMC6892915 | biostudies-literature | 2019 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

The Escherichia coli transcriptome mostly consists of independently regulated modules.

Sastry Anand V AV   Gao Ye Y   Szubin Richard R   Hefner Ying Y   Xu Sibei S   Kim Donghyuk D   Choudhary Kumari Sonal KS   Yang Laurence L   King Zachary A ZA   Palsson Bernhard O BO  

Nature communications 20191204 1


Underlying cellular responses is a transcriptional regulatory network (TRN) that modulates gene expression. A useful description of the TRN would decompose the transcriptome into targeted effects of individual transcriptional regulators. Here, we apply unsupervised machine learning to a diverse compendium of over 250 high-quality Escherichia coli RNA-seq datasets to identify 92 statistically independent signals that modulate the expression of specific gene sets. We show that 61 of these transcri  ...[more]

Similar Datasets

| S-EPMC7732839 | biostudies-literature
| S-EPMC7305307 | biostudies-literature
| S-EPMC3939876 | biostudies-literature
| S-EPMC4778597 | biostudies-literature
| S-EPMC4959288 | biostudies-literature
| S-EPMC3415497 | biostudies-literature
| S-EPMC6403342 | biostudies-literature
| S-EPMC2916507 | biostudies-literature
| S-EPMC5953345 | biostudies-literature
| S-EPMC4222554 | biostudies-literature