Unknown

Dataset Information

0

Identification and analysis of novel amino-acid sequence repeats in Bacillus anthracis str. Ames proteome using computational tools.


ABSTRACT: We have identified four repeats and ten domains that are novel in proteins encoded by the Bacillus anthracis str. Ames proteome using automated in silico methods. A "repeat" corresponds to a region comprising less than 55-amino-acid residues that occur more than once in the protein sequence and sometimes present in tandem. A "domain" corresponds to a conserved region with greater than 55-amino-acid residues and may be present as single or multiple copies in the protein sequence. These correspond to (1) 57-amino-acid-residue PxV domain, (2) 122-amino-acid-residue FxF domain, (3) 111-amino-acid-residue YEFF domain, (4) 109-amino-acid-residue IMxxH domain, (5) 103-amino-acid-residue VxxT domain, (6) 84-amino-acid-residue ExW domain, (7) 104-amino-acid-residue NTGFIG domain, (8) 36-amino-acid-residue NxGK repeat, (9) 95-amino-acid-residue VYV domain, (10) 75-amino-acid-residue KEWE domain, (11) 59-amino-acid-residue AFL domain, (12) 53-amino-acid-residue RIDVK repeat, (13) (a) 41-amino-acid-residue AGQF repeat and (b) 42-amino-acid-residue GSAL repeat. A repeat or domain type is characterized by specific conserved sequence motifs. We discuss the presence of these repeats and domains in proteins from other genomes and their probable secondary structure.

SUBMITTER: Hemalatha GR 

PROVIDER: S-EPMC1876623 | biostudies-literature | 2007

REPOSITORIES: biostudies-literature

altmetric image

Publications

Identification and analysis of novel amino-acid sequence repeats in Bacillus anthracis str. Ames proteome using computational tools.

Hemalatha G R GR   Rao D Satyanarayana DS   Guruprasad L L  

Comparative and functional genomics 20070225


We have identified four repeats and ten domains that are novel in proteins encoded by the Bacillus anthracis str. Ames proteome using automated in silico methods. A "repeat" corresponds to a region comprising less than 55-amino-acid residues that occur more than once in the protein sequence and sometimes present in tandem. A "domain" corresponds to a conserved region with greater than 55-amino-acid residues and may be present as single or multiple copies in the protein sequence. These correspond  ...[more]

Similar Datasets

| S-EPMC2612425 | biostudies-literature
| PRJNA309 | ENA
| PRJNA209691 | ENA
| S-EPMC7023782 | biostudies-literature
| PRJNA176033 | ENA
| PRJNA10784 | ENA
| S-EPMC2743695 | biostudies-literature
| S-EPMC1828967 | biostudies-literature
| S-EPMC2408921 | biostudies-literature
| S-EPMC3917846 | biostudies-literature