Unknown

Dataset Information

0

Functional Prediction of Hypothetical Transcription Factors of Escherichia coli K-12 Based on Expression Data.


ABSTRACT: The repertoire of 304 DNA-binding transcription factors (TFs) in Escherichia coli K-12 has been described recently, with 196 TFs experimentally characterized and 108 proteins predicted by sequence comparisons. Based on 303 expression profile patterns retrieved from the Colombos database 12 clusters were identified, including hypothetical and experimentally characterized TFs, using a spectral clustering algorithm based on a 3NN graph built using 14 principal components that represent 65% of the variance of the expression data. In a posterior step, clusters were characterized in terms of their associated overrepresented functions, based on KEGG, Supfam annotations and Pfam assignments among other functional categories using an enrichment test, reinforcing the notion that the identified clusters are functionally similar among them. Based on these data, the we identified 12 clusters in which hypothetical and known TFs share similar regulatory and physiological functions, such as module associations of toxin-antitoxin (TA) systems with DNA repair mechanisms, amino acid biosynthesis, and carbon metabolism/transport, among others. This analysis has increased our knowledge about gene regulation in E. coli K-12 and can be further expanded to other organisms.

SUBMITTER: Flores-Bautista E 

PROVIDER: S-EPMC6055005 | biostudies-literature | 2018

REPOSITORIES: biostudies-literature

altmetric image

Publications

Functional Prediction of Hypothetical Transcription Factors of <i>Escherichia coli</i> K-12 Based on Expression Data.

Flores-Bautista Emanuel E   Cronick Carenne Ludeña CL   Fersaca Anny Rodriguez AR   Martinez-Nuñez Mario Alberto MA   Perez-Rueda Ernesto E  

Computational and structural biotechnology journal 20180327


The repertoire of 304 DNA-binding transcription factors (TFs) in <i>Escherichia coli</i> K-12 has been described recently, with 196 TFs experimentally characterized and 108 proteins predicted by sequence comparisons. Based on 303 expression profile patterns retrieved from the Colombos database 12 clusters were identified, including hypothetical and experimentally characterized TFs, using a spectral clustering algorithm based on a 3NN graph built using 14 principal components that represent 65% o  ...[more]

Similar Datasets

| S-EPMC6237786 | biostudies-literature
2018-08-08 | GSE111095 | GEO
2018-08-08 | GSE111094 | GEO
2018-08-08 | GSE111093 | GEO
2021-08-26 | GSE159658 | GEO
| S-EPMC8249747 | biostudies-literature
| S-EPMC56896 | biostudies-literature
2015-09-30 | GSE65385 | GEO
| S-EPMC3680503 | biostudies-literature
| S-EPMC4560290 | biostudies-literature