Unknown

Dataset Information

0

Structure and computational analysis of a novel protein with metallopeptidase-like and circularly permuted winged-helix-turn-helix domains reveals a possible role in modified polysaccharide biosynthesis.


ABSTRACT: CA_C2195 from Clostridium acetobutylicum is a protein of unknown function. Sequence analysis predicted that part of the protein contained a metallopeptidase-related domain. There are over 200 homologs of similar size in large sequence databases such as UniProt, with pairwise sequence identities in the range of ~40-60%. CA_C2195 was chosen for crystal structure determination for structure-based function annotation of novel protein sequence space.The structure confirmed that CA_C2195 contained an N-terminal metallopeptidase-like domain. The structure revealed two extra domains: an ?+? domain inserted in the metallopeptidase-like domain and a C-terminal circularly permuted winged-helix-turn-helix domain.Based on our sequence and structural analyses using the crystal structure of CA_C2195 we provide a view into the possible functions of the protein. From contextual information from gene-neighborhood analysis, we propose that rather than being a peptidase, CA_C2195 and its homologs might play a role in biosynthesis of a modified cell-surface carbohydrate in conjunction with several sugar-modification enzymes. These results provide the groundwork for the experimental verification of the function.

SUBMITTER: Das D 

PROVIDER: S-EPMC4000134 | biostudies-literature | 2014 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Structure and computational analysis of a novel protein with metallopeptidase-like and circularly permuted winged-helix-turn-helix domains reveals a possible role in modified polysaccharide biosynthesis.

Das Debanu D   Murzin Alexey G AG   Rawlings Neil D ND   Finn Robert D RD   Coggill Penelope P   Bateman Alex A   Godzik Adam A   Aravind L L  

BMC bioinformatics 20140319


<h4>Background</h4>CA_C2195 from Clostridium acetobutylicum is a protein of unknown function. Sequence analysis predicted that part of the protein contained a metallopeptidase-related domain. There are over 200 homologs of similar size in large sequence databases such as UniProt, with pairwise sequence identities in the range of ~40-60%. CA_C2195 was chosen for crystal structure determination for structure-based function annotation of novel protein sequence space.<h4>Results</h4>The structure co  ...[more]

Similar Datasets

| S-EPMC4333399 | biostudies-literature
| S-EPMC2175375 | biostudies-literature
| S-EPMC1316287 | biostudies-literature
| S-EPMC1450330 | biostudies-literature
| S-EPMC6551276 | biostudies-literature
| S-EPMC8615523 | biostudies-literature
| S-EPMC3861651 | biostudies-literature
| S-EPMC291871 | biostudies-literature
| S-EPMC6733452 | biostudies-literature
| S-EPMC3654790 | biostudies-literature