Project description:The availability of complete genome sequences of H. pylori 26695 has provided a wealth of information enabling us to carry out in silico studies to identify new molecular targets for pharmaceutical treatment. In order to construe the structural and functional information of complete proteome, use of computational methods are more relevant since these methods are reliable and provide a solution to the time consuming and expensive experimental methods. Out of 1590 predicted protein coding genes in H. pylori, experimentally determined structures are available for only 145 proteins in the PDB. In the absence of experimental structures, computational studies on the three dimensional (3D) structural organization would help in deciphering the protein fold, structure and active site. Functional annotation of each protein was carried out based on structural fold and binding site based ligand association. Most of these proteins are uncharacterized in this proteome and through our annotation pipeline we were able to annotate most of them. We could assign structural folds to 464 uncharacterized proteins from an initial list of 557 sequences. Of the 1195 known structural folds present in the SCOP database, 411 (34% of all known folds) are observed in the whole H. pylori 26695 proteome, with greater inclination for domains belonging to α/β class (36.63%). Top folds include P-loop containing nucleoside triphosphate hydrolases (22.6%), TIM barrel (16.7%), transmembrane helix hairpin (16.05%), alpha-alpha superhelix (11.1%) and S-adenosyl-L-methionine-dependent methyltransferases (10.7%).
Project description:Methyltransferases (MTases) of procaryotes affect general cellular processes such as mismatch repair, regulation of transcription, replication, and transposition, and in some cases may be essential for viability. As components of restriction-modification systems, they contribute to bacterial genetic diversity. The genome of Helicobacter pylori strain 26695 contains 25 open reading frames encoding putative DNA MTases. To assess which MTase genes are active, strain 26695 genomic DNA was tested for cleavage by 147 restriction endonucleases; 24 were found that did not cleave this DNA. The specificities of 11 expressed MTases and the genes encoding them were identified from this restriction data, combined with the known sensitivities of restriction endonucleases to specific DNA modification, homology searches, gene cloning and genomic mapping of the methylated bases m(4)C, m(5)C, and m(6)A.
Project description:Dear Sir or Madam, we report an in-depth proteogenomics study of Helicobacter pylori strain 26695 and provide the supporting MS data via ProteomExchange. The study includes 2 biological replicates with 6 different datasets: G1: in-gel digestion with trypsin, replicate 1 G2: in-gel digestion with trypsin, replicate 2 T1: SEC fractionation of low molecular weight (LMW) proteins and subsequent trypsin digestion, replicate 1 T2: SEC fractionation of LMW proteins and subsequent trypsin digestion, replicate 2 A1: SEC fractionation of LMW proteins and subsequent AspN digestion, replicate 1 A2: SEC fractionation of LMW proteins and subsequent AspN digestion, replicate 2 L1: SEC fractionation of LMW proteins and subsequent LysC digestion, replicate 1 L2: SEC fractionation of LMW proteins and subsequent LysC digestion, replicate 2 In our proteogenomics approach, we could identify four previously missing protein annotations and were able to correct sequences of six protein coding regions. Furthermore we identified signal peptidase cleavage sites for 72 different proteins. MGFs were generated by Maxquant 1.1 [1] using recalibration of peptide parent masses. For PRIDE (http://www.ebi.ac.uk/pride) submission, we made an additional database search with Mascot and X!Tandem using the SearchGUI [2]. Therefore we searched against a NCBI database of H. pylori strain 26695 complemented with the sequence corrections, signal peptide cleavage sites and missing annotations with the same configurations as described in materials and methods. For pride xml export we used the software PeptideShaker (http://code.google.com/p/peptide-shaker/). The complemented database has entries which will be submitted to the UniProtKB via SPIN. The entries have the according SPIN number as accession number. The NCBI accession numbers for the shortened sequences due to signal peptide cleavage are extended with “_1”. The fasta database is added to the submission. For additional information, please contact me: stephan.mueller@ufz.de Yours sincerely, Stephan Mueller References: [1] Cox J, Neuhauser N, Michalski A, Scheltema RA, Olsen JV, Mann M. Andromeda: a peptide search engine integrated into the MaxQuant environment. Journal of proteome research. 2011;10:1794-805. [2] Vaudel M, Barsnes H, Berven FS, Sickmann A, Martens L. SearchGUI: An open-source graphical user interface for simultaneous OMSSA and X!Tandem searches. Proteomics. 2011;11:996-9.