Project description:Hypervariable regions V3-V5 of bacterial 16S rRNA genes. This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/
Project description:16S rRNA gene sequences are commonly analyzed for taxonomic and phylogenetic studies because they contain variable regions that can help distinguish different genera. However, intra-genus distinction using variable region homology is often impossible due to the high overall sequence identities among closely related species, even though some residues may be conserved within respective species. Using a computational method that included the allelic diversity within individual genomes, we discovered that certain Escherichia and Shigella species can be distinguished by a multi-allelic 16S rRNA variable region single nucleotide polymorphism (SNP). To evaluate the performance of 16S rRNAs with altered variable regions, we developed an in vivo system that measures the acceptance and distribution of variant 16S rRNAs into a large pool of natural versions supporting normal translation and growth. We found that 16S rRNAs containing evolutionarily disparate variable regions were underpopulated both in ribosomes and in active translation pools, even for an SNP. Overall, this study revealed that variable region sequences can substantially influence the performance of 16S rRNAs and that this biological constraint can be leveraged to justify refining taxonomic assignments of variable region sequence data. IMPORTANCE This study reevaluates the notion that 16S rRNA gene variable region sequences are uninformative for intra-genus classification and that single nucleotide variations within them have no consequence to strains that bear them. We demonstrated that the performance of 16S rRNAs in Escherichia coli can be negatively impacted by sequence changes in variable regions, even for single nucleotide changes that are native to closely related Escherichia and Shigella species; thus, biological performance is likely constraining the evolution of variable regions in bacteria. Further, the native nucleotide variations we tested occur in all strains of their respective species and across their multiple 16S rRNA gene copies, suggesting that these species evolved beyond what would be discerned from a consensus sequence comparison. Therefore, this work also reveals that the multiple 16S rRNA gene alleles found in most bacteria can provide more informative phylogenetic and taxonomic detail than a single reference allele.
Project description:Cells devote a significant effort toward the production of multiple modified nucleotides in rRNAs, which fine tune the ribosome function. Here, we report that two methyltransferases, RsmB and RsmF, are responsible for all four 5-methylcytidine (m(5)C) modifications in 16S rRNA of Thermus thermophilus. Like Escherichia coli RsmB, T. thermophilus RsmB produces m(5)C967. In contrast to E. coli RsmF, which introduces a single m(5)C1407 modification, T. thermophilus RsmF modifies three positions, generating m(5)C1400 and m(5)C1404 in addition to m(5)C1407. These three residues are clustered near the decoding site of the ribosome, but are situated in distinct structural contexts, suggesting a requirement for flexibility in the RsmF active site that is absent from the E. coli enzyme. Two of these residues, C1400 and C1404, are sufficiently buried in the mature ribosome structure so as to require extensive unfolding of the rRNA to be accessible to RsmF. In vitro, T. thermophilus RsmF methylates C1400, C1404, and C1407 in a 30S subunit substrate, but only C1400 and C1404 when naked 16S rRNA is the substrate. The multispecificity of T. thermophilus RsmF is potentially explained by three crystal structures of the enzyme in a complex with cofactor S-adenosyl-methionine at up to 1.3 A resolution. In addition to confirming the overall structural similarity to E. coli RsmF, these structures also reveal that key segments in the active site are likely to be dynamic in solution, thereby expanding substrate recognition by T. thermophilus RsmF.
Project description:Contaminated aquifer (Dusseldorf-Flinger, Germany) templates extracted from 5 sediment depths ranging between 6.4 and 8.4 m below ground and over 3 years of sampling were amplified for amplicon pyrosequencing using the primers Ba27f (5’-aga gtt tga tcm tgg ctc ag-3’) and Ba519r (5’- tat tac cgc ggc kgc tg-3’), extended as amplicon fusion primers with respective primer A or B adapters, key sequence and multiplex identifiers (MID) as recommended by 454/Roche. Amplicons were purified and pooled as specified by the manufacturer. Emulsion PCR (emPCR), purification of DNA-enriched beads and sequencing run were performed following protocols and using a 2nd generation pyrosequencer (454 GS FLX Titanium, Roche) as recommended by the developer. Quality filtering of the pyrosequencing reads was performed using the automatic amplicon pipeline of the GS Run Processor (Roche), with a slight modification concerning the valley filter (vfScanAllFlows false instead of TiOnly) to extract the sequences. Demultiplexed raw reads were furhter trimmed for quality and lenght (>250 bp). 15 samples examined in total from important plume zones of the aquifer sampled in Feb. 2006, Sep. 2008 and Jun. 2009 (5 every year of sampling).
Project description:Contaminated aquifer (Dusseldorf-Flinger, Germany) templates extracted from 5 sediment depths ranging between 6.4 and 8.4 m below ground and over 3 years of sampling were amplified for amplicon pyrosequencing using the primers Ba27f (5’-aga gtt tga tcm tgg ctc ag-3’) and Ba519r (5’- tat tac cgc ggc kgc tg-3’), extended as amplicon fusion primers with respective primer A or B adapters, key sequence and multiplex identifiers (MID) as recommended by 454/Roche. Amplicons were purified and pooled as specified by the manufacturer. Emulsion PCR (emPCR), purification of DNA-enriched beads and sequencing run were performed following protocols and using a 2nd generation pyrosequencer (454 GS FLX Titanium, Roche) as recommended by the developer. Quality filtering of the pyrosequencing reads was performed using the automatic amplicon pipeline of the GS Run Processor (Roche), with a slight modification concerning the valley filter (vfScanAllFlows false instead of TiOnly) to extract the sequences. Demultiplexed raw reads were furhter trimmed for quality and lenght (>250 bp).
Project description:Microbial functions in the host physiology are a result of co-evolution between microbial communities and their hosts. Here we show that cold exposure leads to marked shift of the microbiota composition, referred to as cold microbiota. Transplantation of the cold microbiota to germ-free mice is sufficient to increase the insulin sensitivity of the host, and enable complete tolerance to cold partly by promoting the white fat browning, leading to increased energy expenditure and fat loss. During prolonged cold however, the body weight loss is attenuated, caused by adaptive mechanisms maximising caloric uptake and increasing intestinal, villi and microvilli lengths. This increased absorptive surface is promoted by the cold microbiota - effect that can be diminished by co-transplanting the most downregulated bacterial strain from the Verrucomicrobia phylum, Akkermansia muciniphila, during the cold microbiota transfer. Our results demonstrate the microbiota as a key factor orchestrating the overall energy homeostasis during increased demand.
Project description:Clostridium difficile infection is an increasing problem in hospitals worldwide, mainly due to the recent emergence of a hypervirulent C. difficile strain. C. difficile PCR ribotyping, based on size variation of the 16S-23S rRNA intergenic spacer region (16S-23S ISR), is widely used in Europe for molecular epidemiological investigation. The mechanism underlying the 16S-23S ISR size variations in the genome of C. difficile is currently not completely understood. To elucidate this mechanism, isolates of six different PCR ribotypes were analysed by cloning and sequencing the 16S-23S ISR. A direct repeat, IB, of 9 bp was detected up to five times in the 16S-23S ISR in all 47 clones investigated. Thirty-five clones displayed differences either by ribotype or by nucleotide sequence. The sequences of the 16S-23S ISR of C. difficile showed a uniformly organized structure, composed of a tRNA(Ala) gene and spacers of 33 and 53 bp separated by the 9 bp direct repeat IB. The results of the study support the hypothesis that this composition is responsible for the length variations seen in the 16S-23S ISR, and indicate that these length variations result from slipped-strand mispairing and intra- and possibly interchromosomal homologous recombination.
Project description:Proper preservation of stool samples to minimize microbial community shifts and inactivate infectious agents is important for self-collected specimens requiring shipment to laboratories when cold chain transport is not feasible. In this study, we evaluated the performance of six preservation solutions (Norgen, OMNI, RNAlater, CURNA, HEMA, and Shield) for these aspects. Following storage of human stool samples with these preservatives at room temperature for 7 days, three hypervariable regions of the bacterial 16S rRNA gene (V1-V2, V3-V4, and V4) were amplicon sequenced. We found that samples collected in two preservatives, Norgen and OMNI, showed the least shift in community composition relative to -80°C standards compared with other storage conditions, and both efficiently inhibited the growth of aerobic and anaerobic bacteria. RNAlater did not prevent bacterial activity and exhibited relatively larger community shift. Although the effect of preservation solution was small compared to intersubject variation, notable changes in microbiota composition were observed, which could create biases in downstream data analysis. When community profiles inferred from different 16S rRNA gene hypervariable regions were compared, we found differential sensitivity of primer sets in identifying overall microbial community and certain bacterial taxa. For example, reads generated by the V4 primer pair showed a higher alpha diversity of the gut microbial community. The degenerate 27f-YM primer failed to detect the majority of Bifidobacteriales. Our data indicate that choice of preservation solution and 16S rRNA gene primer pair are critical determinants affecting gut microbiota profiling. IMPORTANCE Large-scale human microbiota studies require specimens collected from multiple sites and/or time points to maximize detection of the small effects in microbe-host interactions. However, batch biases caused by experimental protocols, such as sample collection, massively parallel sequencing, and bioinformatics analyses, remain critical and should be minimized. This work evaluated the effects of preservation solutions and bacterial 16S rRNA gene primer pairs in revealing human gut microbiota composition. Since notable changes in detecting bacterial composition and abundance were observed among choice of preservatives and primer pairs, a consistent methodology is essential in minimizing their effects to facilitate comparisons between data sets.