Unknown

Dataset Information

0

Structure based annotation of Helicobacter pylori strain 26695 proteome.


ABSTRACT: The availability of complete genome sequences of H. pylori 26695 has provided a wealth of information enabling us to carry out in silico studies to identify new molecular targets for pharmaceutical treatment. In order to construe the structural and functional information of complete proteome, use of computational methods are more relevant since these methods are reliable and provide a solution to the time consuming and expensive experimental methods. Out of 1590 predicted protein coding genes in H. pylori, experimentally determined structures are available for only 145 proteins in the PDB. In the absence of experimental structures, computational studies on the three dimensional (3D) structural organization would help in deciphering the protein fold, structure and active site. Functional annotation of each protein was carried out based on structural fold and binding site based ligand association. Most of these proteins are uncharacterized in this proteome and through our annotation pipeline we were able to annotate most of them. We could assign structural folds to 464 uncharacterized proteins from an initial list of 557 sequences. Of the 1195 known structural folds present in the SCOP database, 411 (34% of all known folds) are observed in the whole H. pylori 26695 proteome, with greater inclination for domains belonging to ?/? class (36.63%). Top folds include P-loop containing nucleoside triphosphate hydrolases (22.6%), TIM barrel (16.7%), transmembrane helix hairpin (16.05%), alpha-alpha superhelix (11.1%) and S-adenosyl-L-methionine-dependent methyltransferases (10.7%).

SUBMITTER: Singh S 

PROVIDER: S-EPMC4280198 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

Structure based annotation of Helicobacter pylori strain 26695 proteome.

Singh Swati S   Guttula Praveen Kumar PK   Guruprasad Lalitha L  

PloS one 20141230 12


The availability of complete genome sequences of H. pylori 26695 has provided a wealth of information enabling us to carry out in silico studies to identify new molecular targets for pharmaceutical treatment. In order to construe the structural and functional information of complete proteome, use of computational methods are more relevant since these methods are reliable and provide a solution to the time consuming and expensive experimental methods. Out of 1590 predicted protein coding genes in  ...[more]

Similar Datasets

| S-EPMC94898 | biostudies-literature
2013-08-01 | PXD000054 | Pride
| S-EPMC8773439 | biostudies-literature
| PRJNA134859 | ENA
| PRJNA233 | ENA
| PRJNA175543 | ENA
| PRJNA286939 | ENA
| PRJDB3908 | ENA
| PRJNA329507 | ENA
| PRJNA286938 | ENA