Unknown

Dataset Information

0

Identification and mapping of self-assembling protein domains encoded by the Escherichia coli K-12 genome by use of lambda repressor fusions.


ABSTRACT: Self-assembling proteins and protein fragments encoded by the Escherichia coli genome were identified from E. coli K-12 strain MG1655. Libraries of random DNA fragments cloned into a series of lambda repressor fusion vectors were subjected to selection for immunity to infection by phage lambda. Survivors were identified by sequencing the ends of the inserts, and the fused protein sequence was inferred from the known genomic sequence. Four hundred sixty-three nonredundant open reading frame-encoded interacting sequence tags (ISTs) were recovered from sequencing 2,089 candidates. These ISTs, which range from 16 to 794 amino acids in length, were clustered into families of overlapping fragments, identifying potential homotypic interactions encoded by 232 E. coli genes. Repressor fusions identified ISTs from genes in every protein-based functional category, but membrane proteins were underrepresented. The IST-containing genes were enriched for regulatory proteins and for proteins that form higher-order oligomers. Forty-eight (20.7%) homotypic proteins identified by ISTs are predicted to contain coiled coils. Although most of the IST-containing genes are identifiably related to proteins in other bacterial genomes, more than half of the ISTs do not have identifiable homologs in the Protein Data Bank, suggesting that they may include many novel structures. The data are available online at http://oligomers.tamu.edu/.

SUBMITTER: Marino-Ramirez L 

PROVIDER: S-EPMC344411 | biostudies-literature | 2004 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Identification and mapping of self-assembling protein domains encoded by the Escherichia coli K-12 genome by use of lambda repressor fusions.

Mariño-Ramírez Leonardo L   Minor Jonathan L JL   Reading Nicola N   Hu James C JC  

Journal of bacteriology 20040301 5


Self-assembling proteins and protein fragments encoded by the Escherichia coli genome were identified from E. coli K-12 strain MG1655. Libraries of random DNA fragments cloned into a series of lambda repressor fusion vectors were subjected to selection for immunity to infection by phage lambda. Survivors were identified by sequencing the ends of the inserts, and the fused protein sequence was inferred from the known genomic sequence. Four hundred sixty-three nonredundant open reading frame-encod  ...[more]

Similar Datasets

| S-EPMC1955323 | biostudies-literature
2015-09-30 | GSE65385 | GEO
| S-EPMC111278 | biostudies-literature
| S-EPMC3761395 | biostudies-literature
| S-EPMC6405996 | biostudies-literature
| S-EPMC3396475 | biostudies-literature
| S-EPMC101859 | biostudies-literature
| S-EPMC3807447 | biostudies-literature
2007-05-18 | GSE6781 | GEO
| S-EPMC2903704 | biostudies-literature