Project description:PFGRC has developed a cost effective alternative to complete genome sequencing in order to study the genetic differences between closely related species and/or strains. The comparative genomics approach combines Gene Discovery (GD) and Comparative Genomic Hybridization (CGH) techniques, resulting in the design and production of species microarrays that represent the diversity of a species beyond just the sequenced reference strain(s) used in the initial microarray design. These species arrays may then be used to interrogate hundreds of closely related strains in order to further unravel their evolutionary relationships. The Pneumococcus are among most deadly pathogens world-wide. The infections and outbreaks caused by this pathogens is quite frequent despite existing diagnostic network and therapeutic means. Therefore, developing reliable diagnostic tools and efficient (broad-spectrum) therapeutics for Streptococcus pneumoniae remain a public health priority for every country in world today. The comparative genomics study will provide the largest hitherto genomic data sets regarding this pathogen.These large data sets will enable us as well as other members of scientific community to conduct comprehensive data mining in the form of gene association studies with statistical power and significance.
Project description:PFGRC has developed a cost effective alternative to complete genome sequencing in order to study the genetic differences between closely related species and/or strains. The comparative genomics approach combines Gene Discovery (GD) and Comparative Genomic Hybridization (CGH) techniques, resulting in the design and production of species microarrays that represent the diversity of a species beyond just the sequenced reference strain(s) used in the initial microarray design. These species arrays may then be used to interrogate hundreds of closely related strains in order to further unravel their evolutionary relationships. The Pneumococcus are among most deadly pathogens world-wide. The infections and outbreaks caused by this pathogens is quite frequent despite existing diagnostic network and therapeutic means. Therefore, developing reliable diagnostic tools and efficient (broad-spectrum) therapeutics for Streptococcus pneumoniae remain a public health priority for every country in world today. The comparative genomics study will provide the largest hitherto genomic data sets regarding this pathogen.These large data sets will enable us as well as other members of scientific community to conduct comprehensive data mining in the form of gene association studies with statistical power and significance. Two hundread fifty five query strains were investigated in this study, with each query strain hybridized against the reference strain, tigr4. Each strain has a single dye experiment. Each oligo is spotted on the S.pneumoniae species microarray once. Positive controls on the array consist of oligos designed from the sequenced reference genome of S. pneumoniae and negative controls on the array consist of oligos designed from the thale cress plant, Arabidopsis thaliana.The microarrays also had Agilent internal controls.
Project description:Streptococcus pneumoniae (pneumococcus) is a major human respiratory pathogen and the leading cause of bacterial pneumonia worldwide. Small regulatory RNAs (sRNAs), which often act by post-transcriptionally regulating gene expression, have been shown to be crucial for the virulence of S. pneumoniae and other bacterial pathogens. Over 170 putative sRNAs have been identified in S. pneumoniae TIGR4 strain (serotype 4) through transcriptomic studies, and a subset of these sRNAs have been further implicated in regulating pneumococcal pathogenesis. However, there was little overlap in the sRNAs identified among these studies, which indicated that the approaches used for sRNA identification were not sufficiently sensitive and robust and that there were likely many more undiscovered sRNAs encoded in the S. pneumoniae genome. Here, we sought to comprehensively identify sRNAs in Avery's virulent S. pneumoniae strain D39 using two independent RNA-seq based approaches. We developed an unbiased method for identifying novel sRNAs from bacterial RNA-seq data and have further tested the specificity of our analysis program towards identifying sRNAs encoded by both strains D39 and TIGR4. Interestingly, the genes for 15% of the putative sRNAs identified in strain TIGR4 including ones previously implicated in virulence were not present in strain D39 genome suggesting that the differences in sRNA repertoires between these two serotypes may contribute to their strain-specific virulence properties. Finally, this study has identified 67 new sRNA candidates in strain D39, 28 out of which have been further validated, raising the total number of sRNAs that have been identified in strain D39 to 112.